Humanity's Last Exam — Benchmark — ThinkLLM