A measure of randomness or diversity in a policy's action distribution; lower entropy means more concentrated, deterministic behavior.