Safety monitors that only check individual user sessions miss coordinated attacks split across accounts—you need to track patterns across groups of users to catch distributed agent misuse.
Language models can help attackers find vulnerabilities, and they're increasingly splitting harmful tasks across multiple accounts to evade detection.