The process of trying diverse actions during training to discover which ones lead to better outcomes.