A reward signal that encourages an agent to explore and discover new states, separate from task-specific rewards.