Think
LLM
Models
Capabilities
Use Cases
Benchmarks
Papers
Glossary
Search
/
Glossary
/
Confidence-Driven Reinforcement Learning
Confidence-Driven Reinforcement Learning
techniques
Training a model using rewards based on how well its confidence scores match its actual correctness.
Confidence-Driven Reinforcement Learning — Glossary — ThinkLLM