Think
LLM
Models
Capabilities
Use Cases
Benchmarks
Papers
Glossary
Search
/
Glossary
/
Reward-Guided Fine-Tuning
Reward-Guided Fine-Tuning
techniques
Adapting a model to optimize for a specific reward signal during training.
Reward-Guided Fine-Tuning — Glossary — ThinkLLM