AI - AI Feta, the news about scientific AI research (Page 10)

AI

Escaping the Verifier: Learning to Reason via Demonstrations

LLMs can learn to reason—without task verifiers Many real-world problems don’t have automatic checkers to grade answers, even though we have lots of expert solutions. RARO (Relativistic Adversarial Reasoning Optimization) shows how to train reasoning skills from those examples alone. How it works: * A policy (the model) tries

Robotics

VacuumVLA: A Two-in-One Robot Hand That Grips and Suctions

Robots guided by Vision-Language-Action (VLA) AI are getting better at everyday tasks—but most still use simple two-finger grippers. That limits them on smooth, flat, or handleless objects. VacuumVLA is a low-cost robot hand that merges a standard two-finger gripper with a vacuum suction cup. The robot can switch between

DrugRepurposing

ChatDRex: No‑Code, Conversational AI for Drug Repurposing

Meet ChatDRex Finding new uses for approved drugs can save years and millions. But making those predictions usually takes teams of specialists and a tangle of tools. ChatDRex changes that. It’s a conversation-based, no‑code, multi‑agent system that lets clinicians and researchers ask complex bioinformatics questions in plain

AI

ToolOrchestra: Small AI that smartly manages tools

Large models are powerful, but solving deep, multi-step problems is pricey. ToolOrchestra trains a small “orchestrator” that decides which tools to use, when, and how—like a smart project manager for AI. It uses reinforcement learning with rewards for outcomes, efficiency, and user preferences. The result: an 8B Orchestrator that

AI

ToolOrchestra: Small Maestros, Big Intelligence

Small model, big wins Large language models are great generalists, but really tough, multi-step problems still strain both brains and budgets. ToolOrchestra flips the script: instead of one giant model, a small “orchestrator” coordinates other models and specialized tools. Trained with reinforcement learning that rewards outcomes, efficiency, and user preferences,

AI

BAMAS: Budget-Aware AI Teams That Deliver

What’s new As AI "teams" of LLM agents grow, the cloud bill can explode. BAMAS is a framework that designs these teams with a dollar cap in mind. How it works * Picks the right mix of models: Uses integer linear programming to balance accuracy vs. price. * Plans

AI

Meet EWE: An AI Weather Expert for Extreme Events

Extreme weather is getting worse, but understanding the physical “why” behind each event still takes weeks of expert detective work. Forecasting AI has improved, yet diagnostic reasoning — explaining causes — has lagged. Enter EWE (Extreme Weather Expert), a new AI agent that acts like a meteorologist-in-the-loop. EWE plans analyses, reasons in

AI

EWE: an AI Weather Detective for Extreme Events

Extreme weather is hitting harder and more often, but diagnosing the “why” behind each event is slow and expert-heavy. Meet EWE (Extreme Weather Expert): an AI agent that acts like a weather detective. * Thinks like experts: Plans analyses, reasons step-by-step, and uses a meteorology toolkit. * Works end-to-end: Turns raw atmospheric

cybersecurity

Can AI spot phishing? A new email dataset puts it to the test

Can AI spot phishing? This new email dataset puts it to the test Phishing and spam are getting smarter—often written by large language models. Rebeka Toth, Tamas Bisztray, and Richard Dubniczky release a labeled email dataset that separates phishing, spam, and legitimate messages, and flags whether they were written

AI

Teach Your Phone’s AI to Fix Itself—in Seconds

AI on our phones still makes awkward mistakes—and fixing them usually means heavy retraining in the cloud. This research shows a lightweight way to correct errors on-device, in seconds. What’s new: * One-shot fixes: Show the app a single example, and it updates its prototype for that class—no

cybersecurity

Smarter Email Dataset to Tackle Phishing and Spam

Smarter email defenses, grounded in real messages Phishing and spam are evolving fast—often with help from AI. This study releases a large, carefully labeled email dataset spanning phishing, spam, and legitimate messages, with a key twist: it marks whether each message was written by a human or an LLM.

Robotics

VacuumVLA: Two skills, one robot hand

Robots guided by Vision-Language-Action (VLA) models are getting good at everyday tasks — but most still grab with simple two-finger claws. That limits them on flat, slippery, or handle-less surfaces. VacuumVLA adds a low-cost twist: a single end-effector that combines a standard gripper with a vacuum suction tool. It can switch