AI-generated refactoring often improves code but frequently introduces new quality and security issues that developers accept anyway, highlighting the need for automated quality checks before merging AI contributions.
This study examines Python refactoring pull requests created by AI agents, measuring their impact on code quality and security.