LLMSurgeon: Diagnosing Data Mixture of Large Language Models — ThinkLLM