FutureSim: Replaying World Events to Evaluate Adaptive Agents — ThinkLLM