Words Speak Louder Than Code: Investigating Cognitive Heuristics in LLM-Based Code Vulnerability Detection

Asif Shahriar, Hongyu Cai, Hadjer Benkraouda, Gang Wang, Z. Berkay Celik|June 29, 2026arXiv

Key Takeaway

LLM-based code vulnerability detectors can be manipulated through cognitive heuristics without changing the actual code, making them unreliable for security-critical tasks and vulnerable to adversarial attacks that suppress vulnerability detection.

Summary

This paper reveals that LLMs used for detecting code vulnerabilities are susceptible to cognitive biases—the same mental shortcuts that affect human judgment.

safety evaluation

Key Terms

cognitive-heuristics halo-effect framing-effect anchoring-effect adversarial-attack