Think
LLM
Models
Capabilities
Use Cases
Benchmarks
Papers
Glossary
Search
/
Glossary
/
Preference-based Reinforcement Learning
Preference-based Reinforcement Learning
techniques
Learning reward models from pairwise comparisons of behaviors instead of explicit reward signals.
Preference-based Reinforcement Learning — Glossary — ThinkLLM