A probability distribution over state-action pairs visited by a policy, used to characterize exploration behavior.