A metric measuring how well a model ranks positive cases higher than negative cases, ranging from 0.5 (random) to 1.0 (perfect).