Models Capabilities Use Cases Benchmarks Papers Glossary

Models Capabilities Use Cases Benchmarks Papers Glossary

About Privacy Terms RSS

ThinkLLM

Spot an error in our data? Let us know.

Glossary/Advantage Estimation

Advantage Estimation

techniques

Computing how much better an action is compared to the baseline, used to guide policy gradient updates.

Advantage Estimation — Glossary — ThinkLLM