Redefining AI Red Teaming in the Agentic Era: From Weeks to Hours

Raja Sekhar Rao Dheekonda, Will Pearce, Nick Landers|May 5, 2026arXiv

Key Takeaway

Agentic red teaming can dramatically speed up security testing of AI systems by automating workflow construction, letting security teams focus on what vulnerabilities to test rather than how to implement each test.

Summary

This paper introduces an AI red teaming agent that automates adversarial testing of AI systems. Instead of manually building attack workflows over weeks, operators describe their testing goals in natural language, and the agent automatically selects attacks, applies transformations, and scores results—compressing the process from weeks to hours.

safety agents evaluation

Key Terms

red-team adversarial-attack jailbreaking agentic-ai