AutonomousDriving

Teaching self-driving cars to spot the unknown

Kari Jaaskelainen

11 Nov 2025 — 1 min read

Teaching self-driving cars to spot the unknown

Real roads are messy. To stay safe, autonomous vehicles must notice unexpected hazards—from a toppled ladder to a runaway stroller. That’s the challenge of out-of-distribution (OOD) segmentation.

Seungheon Song and Jaekoo Lee propose a simple idea with big impact: use language to help vision. Their model aligns what a camera sees with rich text descriptions, so it can recognize when something “doesn’t belong.”

Text-driven learning: A vision–language encoder + transformer decoder learn from both images and words.
Distance-based prompts: Text prompts set at varying semantic distances from known road classes teach the model what “other” can look like.
Semantic augmentation: Extra OOD descriptors diversify training without needing new images.

The result? Strong generalization to unseen objects and state-of-the-art performance on Fishyscapes, Segment-Me-If-You-Can, and Road Anomaly—at both pixel and object levels.

Why it matters: Better OOD segmentation means fewer blind spots and more reliable decision-making for robots and self-driving cars.

Paper: http://arxiv.org/abs/2511.07238v1

Paper: http://arxiv.org/abs/2511.07238v1

Register: https://www.AiFeta.com

AutonomousDriving ComputerVision AI Safety Segmentation OOD VisionLanguage Robotics SelfDriving DeepLearning

Automating GDPR Compliance: A Roadmap for Companies and Law Firms

GDPR compliance is more than checkboxes. A new roadmap from the Privatech project shows how automation and machine learning can help companies and law firms assess—and even generate—privacy compliance. * Shift the focus to data processors’ real workflows: drafting policies, mapping data uses, documenting decisions. * Break compliance into machine-ready

FPGAs for Faster, Leaner Deep Learning: A Review of CNN Accelerators

Deep learning drives image search, robots, and medical scans. Most systems lean on CPUs and GPUs. This review asks: what if we run convolutional neural networks (CNNs) on FPGAs—reconfigurable chips you can tailor to the model? * Why FPGAs: custom dataflows, low latency, and strong energy efficiency—great for cameras,

Dynamic-K: Recommendations That Know When to Stop

Most apps show a fixed number of “top” items—say 10 movies or 20 products—assuming there are always enough good options. But that’s not always true: sometimes there are few relevant items, or some users are extra picky. The result? Filler recommendations. Dynamic-K flips the script. Instead of

Teaching chatbots to stop contradicting themselves (DECODE)

Teaching chatbots to stop contradicting themselves Ever had a bot say one thing, then the opposite a few turns later? This study introduces DECODE—a new task and dataset for spotting contradictions in everyday conversations, drawn from both human-human and human-bot chats. * New data beats existing natural language inference (NLI)

Teaching self-driving cars to spot the unknown

Read more

Automating GDPR Compliance: A Roadmap for Companies and Law Firms

FPGAs for Faster, Leaner Deep Learning: A Review of CNN Accelerators

Dynamic-K: Recommendations That Know When to Stop

Teaching chatbots to stop contradicting themselves (DECODE)