Rethinking the Divergence Regularization in LLM RL — ThinkLLM