Delayed Reward — Glossary — ThinkLLM