Evaluating model performance separately for rare, medium, and common classes to reveal patterns hidden by overall metrics.