Teaching robots to ask for clarification in 3D

Teaching robots to ask for clarification in 3D

When robots should ask: Which one?

In safety-critical places like operating rooms, a vague command like "Pass me the vial" can be dangerous. This paper introduces a simple idea with big impact: teach AI to detect when an instruction is ambiguous in a 3D scene and pause to ask for clarification.

  • New task: Open-Vocabulary 3D Instruction Ambiguity Detection — decide if a command has exactly one clear target in a scene.
  • New dataset: Ambi3D with 700+ diverse scenes and ~22k instructions to stress-test models.
  • Key finding: Today’s leading 3D LLMs often miss ambiguity.
  • New method: AmbiVer, a two-stage system that gathers visual evidence from multiple views and uses it to judge clarity more reliably.

Why it matters: More cautious, trustworthy assistants — from hospitals and labs to warehouses and homes.

Read more: https://arxiv.org/abs/2601.05991 and project/code: https://jiayuding031020.github.io/ambi3d/

Paper: https://arxiv.org/abs/2601.05991v1

Register: https://www.AiFeta.com

#AI #Robotics #Safety #ComputerVision #3D #LLM #VLM #HRI #EmbodiedAI

Read more