PIArena: A Platform for Prompt Injection Evaluation

Runpeng Geng, Chenlong Yin, Yanting Wang, Ying Chen, Jinyuan Jia|April 9, 2026arXiv

Key Takeaway

Most prompt injection defenses are weaker than claimed—they fail to generalize across tasks and break down against adaptive attacks, highlighting the need for more robust security approaches.

Summary

PIArena is a unified platform for testing prompt injection attacks and defenses in AI systems. It reveals that current defenses have serious weaknesses: they don't work well across different tasks, fail against adaptive attacks, and struggle when injected instructions align with the model's original purpose.

safety evaluation

Key Terms

prompt-injection adversarial-attack robustness-evaluation adaptive-attack