LLM-based code vulnerability detectors can be manipulated through cognitive heuristics without changing the actual code, making them unreliable for security-critical tasks and vulnerable to adversarial attacks that suppress vulnerability detection.
This paper reveals that LLMs used for detecting code vulnerabilities are susceptible to cognitive biases—the same mental shortcuts that affect human judgment.