ReContext: Recursive Evidence Replay as LLM Harness for Long-Context Reasoning

Yanjun Zhao, Ruizhong Qiu, Tianxin Wei, Yuanchen Bei, Zhining Liu et al.|July 2, 2026arXiv

Key Takeaway

You can boost long-context reasoning without retraining by identifying relevant evidence through attention patterns and replaying it before generation—a simple inference-time trick that works across different model sizes.

Summary

ReContext improves how LLMs use information in long documents by replaying relevant evidence before generating answers. Instead of training or pruning context, it uses the model's internal attention signals to identify and reorder important passages, helping the model focus on what matters for each question.

reasoning

Key Terms

long-context-handling evidence-grounding attention-based-grounding inference-time-compute associative-memory