AVISE: Framework for Evaluating the Security of AI Systems

Mikko Lempinen, Joni Kemppainen, Niklas Raesalmi|April 22, 2026arXiv

Key Takeaway

AI security evaluation needs standardized, automated testing frameworks like AVISE to identify vulnerabilities before deployment—the authors show all tested language models can be jailbroken, highlighting the need for systematic security assessment.

Summary

AVISE is an open-source framework for systematically testing AI systems for security vulnerabilities. The researchers demonstrate it by creating an automated test suite that discovers jailbreak attacks on language models, finding that all nine tested models are vulnerable to varying degrees.

safety evaluation

Key Terms

jailbreaking red-team theory-of-mind adversarial-attack