Systematically improving agent skills through an external optimizer that suggests bounded edits validated against held-out test performance.