A sequence of checks replacing ground-truth labels: responsiveness to safe/unsafe contrasts, dominance of target variance, and stability across reruns.