Hidden Triggers in Robot Vision: A Backdoor Risk for MLLM-Powered Agents

New study warns that vision-powered AI agents can hide backdoors. Multimodal large language models (MLLMs) let robots see, reason, and act — but a specific object in view can secretly flip them into an attacker’s plan.

The authors introduce BEAT, the first framework to plant such visual backdoors using everyday objects as triggers. Because objects look different across angles and lighting, BEAT trains models to recognize the trigger robustly, pairing standard fine-tuning with a new Contrastive Trigger Learning step that sharpens the boundary between trigger-present and trigger-free inputs.

Results: up to 80% attack success while keeping normal task performance, and reliable activation even when the trigger appears in new places; under limited data, the contrastive step boosts activation by up to 39%.

Why it matters: embodied agents in homes, factories, and AR could be steered by innocuous-looking items. The community needs defenses now — model auditing, trigger detection, dataset hygiene, and rigorous red-teaming — before real-world deployment.

Paper: http://arxiv.org/abs/2510.27623v1

Paper: http://arxiv.org/abs/2510.27623v1

Register: https://www.AiFeta.com

#AI #Security #Robotics #ComputerVision #Safety #MLLM #Backdoor

Hidden Triggers in Robot Vision: A Backdoor Risk for MLLM-Powered Agents

Read more

Tekoälyapuria ei kannata valita pelkän esittelytekstin perusteella

Hakutulosten kannattaa olla hyödyllisiä, ei vain samankaltaisia

Yksi malli voi pian puhua, soittaa ja kolista – pelkillä tekstiohjeilla

Tekoälyn kanssa pärjäämme paremmin sopimalla kuin komentamalla