Models Capabilities Use Cases Benchmarks Papers Glossary

Models Capabilities Use Cases Benchmarks Papers Glossary

About Privacy Terms RSS

ThinkLLM

Spot an error in our data? Let us know.

Glossary/Reward-hackable

Reward-hackable

techniques

A benchmark task where an agent can achieve high scores without actually solving the intended problem.

Reward-hackable — Glossary — ThinkLLM