Reward models are metrics in disguise
Different labels, same pitfalls. This position paper argues that reward models (for RL-based LLM training) and evaluation metrics face overlapping challenges—spurious correlations, reward hacking, data quality, and meta-evaluation. In some tasks, metrics even outperform reward models. Why it matters: Aligning these research communities could improve preference elicitation, robustness to