Preference Optimization — Glossary — ThinkLLM