Vector-valued Reward — Glossary — ThinkLLM