A simple classifier trained on top of a model's internal representations to detect specific properties.