AI
Can AI Spot Nonsense in Pictures? Meet UAIT
Can AI spot nonsense in pictures? Humans can tell that “A sandwich cuts a chef” is grammatically fine but semantically absurd. Many vision-language models (VLMs) can’t. UAIT (Uncommon-sense Action Image-Text) is a new benchmark that stress-tests whether VLMs truly understand who is doing what to whom—and what’s