Reward Function — Glossary — ThinkLLM