Stability and Generalization in Looped Transformers

Asher Labovich|April 16, 2026arXiv

Key Takeaway

Looped transformers need recall mechanisms combined with outer normalization to reliably generalize to harder problems; without these, they memorize training solutions and fail at test time.

Summary

This paper analyzes looped transformers—models that iterate multiple times at test time to solve harder problems—by studying when they generalize versus memorize.

architecture reasoning training

Key Terms

fixed-point-iteration looped-transformer recall-mechanism input-dependence spectral-regime