Think
LLM
Models
Capabilities
Use Cases
Benchmarks
Papers
Glossary
Search
/
Glossary
/
Differentiable Reward Model
Differentiable Reward Model
techniques
A reward function designed to be differentiable so gradients can flow through it during training.
Differentiable Reward Model — Glossary — ThinkLLM