Think
LLM
Models
Capabilities
Use Cases
Benchmarks
Papers
Glossary
Search
/
Glossary
/
Reinforcement Fine-Tuning
Reinforcement Fine-Tuning
techniques
Adapting a model using reinforcement learning signals from verifiable rewards during post-training.
Reinforcement Fine-Tuning — Glossary — ThinkLLM