A strategy that balances exploring uncertain options with exploiting known good options based on confidence estimates.