How Fast Should a Model Commit to Supervision? Training Reasoning Models on the Tsallis Loss Continuum — ThinkLLM