Reward-Confidence Covariance — Glossary — ThinkLLM