Reward Signal — Glossary — ThinkLLM