ORPO (Odds Ratio Preference Optimization) — Glossary — ThinkLLM