SafeSteer: Localized On-Policy Distillation for Efficient Safety Alignment — ThinkLLM