When an agent exploits loopholes in the reward system to maximize score without actually solving the intended task.