Models Capabilities Use Cases Benchmarks Papers Glossary

Models Capabilities Use Cases Benchmarks Papers Glossary

About Privacy Terms RSS

ThinkLLM

Spot an error in our data? Let us know.

Glossary/Reward Uncertainty

Reward Uncertainty

techniques

Treating the reward function as a distribution rather than a fixed scalar, reflecting ambiguity in what behavior is actually desired.

Reward Uncertainty — Glossary — ThinkLLM