The Kitchen Loop: User-Spec-Driven Development for a Self-Evolving Codebase

Yannick Roy|March 26, 2026arXiv

Key Takeaway

You can safely automate continuous code improvement by combining LLM agents that act as power users, unbeatable verification tests, and automated pause gates that catch quality degradation before it ships.

Summary

A framework for autonomous software development where LLM agents continuously test and improve code against a specification. The system uses synthetic user testing at 1,000x human speed, ground-truth verification tests, and automated quality gates to safely evolve codebases without human intervention—validated on production systems with 1,000+ merged changes and zero regressions.

agents

Key Terms

agentic-workflows specification-driven synthetic-user-testing regression-detection self-evolution