Reward-hackable — Glossary — ThinkLLM