Multi-Objective Reinforcement Learning (MORL) — Glossary — ThinkLLM