Toward Calibrated Mixture-of-Experts Under Distribution Shift — ThinkLLM