LLMs
Elo-Rated LLM Reviewers: Can Rankings Improve Peer Review?
Can we make peer review fairer by rating reviewers like chess players? This study simulates a conference where multiple LLM agent reviewers with distinct personas evaluate papers across several rounds, guided by an Area Chair (AC). Researchers compared a baseline setup to versions that add Elo ratings (to track reviewer