LLMs Improving LLMs: Agentic Discovery for Test-Time Scaling — ThinkLLM