Bandit Feedback — Glossary — ThinkLLM