Adam's Law: Textual Frequency Law on Large Language Models

Hongyuan Adam Lu, Z. L., Victor Wei, Zefan Zhang, Zhao Hong et al.|April 2, 2026arXiv

Key Takeaway

LLMs perform better when trained on and prompted with more frequently-occurring textual patterns, similar to how humans read faster with common words—this simple principle can boost performance across multiple tasks.

Summary

This paper studies how word frequency in text affects large language model performance. The authors propose three techniques: using more frequent phrasings in prompts, generating training data with common expressions, and training models on increasingly frequent text. Tests on math, translation, reasoning, and tool-use tasks show these frequency-based approaches improve results.

training data

Key Terms

curriculum-learning paraphrasing fine-tuning knowledge-distillation