TAMAS: Stress-testing Multi‑Agent AI for Safety
AI agents are starting to work in teams. That unlocks power—and new ways things can go wrong. TAMAS is a benchmark that stress‑tests multi‑agent LLM systems against adversarial tricks and coordination failures. * 5 realistic scenarios, 300 attack instances across 6 attack types * 211 tools, plus 100 harmless