Reward models today fail at personalization—they can't distinguish between equally good responses based on individual user preferences—and this benchmark provides a way to measure and improve this critical capability.
This paper introduces Personalized RewardBench, a benchmark for testing whether reward models can capture individual user preferences rather than just general quality.