Reproducibility - AI Feta, the news about scientific AI research

AI

When Leaderboards Mislead: Annotation Errors in Text-to-SQL

Leaderboards drive Text-to-SQL progress—but what if the test sets are wrong? This study audits two popular benchmarks and finds widespread annotation errors that can flip who looks best. * Error rates: 52.8% in BIRD Mini-Dev, 62.8% in Spider 2.0-Snow. * After correcting a BIRD Dev subset, open-source agents

Neuroimaging

Meet Cedalion: an open-source Python toolkit for wearable fNIRS/DOT

What if lab-grade brain imaging could go wearable—and be analyzed with the same, reproducible playbook? Meet Cedalion, an open-source Python framework for making sense of light-based brain data: fNIRS and DOT (they use harmless light to track brain activity). Cedalion unifies the whole pipeline in one place: forward modeling

AI

Meet Jr. AI Scientist: What It Can Do—and Why Caution Matters

Researchers introduce Jr. AI Scientist, an autonomous "junior researcher" that can read a baseline paper from a human mentor, spot limitations, propose improvements, run experiments, and draft a new paper. Unlike earlier "fully automated" attempts, it follows a clear research workflow and uses modern coding agents