CoopEval: Benchmarking Cooperation-Sustaining Mechanisms and LLM Agents in Social Dilemmas

Emanuel Tewolde, Xiao Zhang, David Guzman Piedrahita, Vincent Conitzer, Zhijing Jin|April 16, 2026arXiv

Key Takeaway

Strong LLM reasoning doesn't guarantee cooperation in multi-agent settings, but game-theoretic mechanisms like contracts and third-party mediation can reliably restore cooperative behavior—important for safe AI deployment.

Summary

This paper tests whether AI language models can cooperate with other agents in game theory scenarios like prisoner's dilemma. It finds that stronger LLMs actually defect more, then evaluates four mechanisms—repeated games, reputation systems, mediators, and contracts—to encourage cooperation.

agents safety alignment

Key Terms

multi-agent-coordination social-dilemmas mechanism-design game-theoretic-equilibrium