Agentic Reinforcement Learning — Glossary — ThinkLLM