LLM
LLM-as-a-Judge: Can AI pick the best slate for you?
Can an LLM judge the best playlist, not just the next song? Recommender systems often serve slates—ordered lists like your home feed or a playlist. Modeling what a person prefers across domains is hard. This study tests Large Language Models as a 'world model' of user preferences: