Action-Value Estimation — Glossary — ThinkLLM