Skill-RM: Unifying Heterogeneous Evaluation Criteria via Agent Skill — ThinkLLM