Think
LLM
Models
Capabilities
Use Cases
Benchmarks
Papers
Glossary
Search
/
Glossary
/
Label-Free Reward
Label-Free Reward
techniques
A training signal derived from model behavior itself rather than human-annotated labels.
Label-Free Reward — Glossary — ThinkLLM