A specialized language model trained to assess and score the quality of outputs from other AI models, acting as an automated judge.