Retrieving problems with similar reasoning strategies—not just semantic similarity—helps language models solve complex reasoning tasks better, and this works alongside other training improvements.
This paper teaches language models to solve complex problems by learning from analogous examples rather than just semantically similar ones. It introduces a system that retrieves problems with similar reasoning patterns (not just similar wording) and uses reinforcement learning to help models learn from these examples, improving performance on math reasoning tasks.