When Prompts Override Vision: Prompt-Induced Hallucinations in LVLMs

Pegah Khayatan, Jayneel Parekh, Arnaud Dapogny, Mustafa Shukor, Alasdair Newson et al.|April 23, 2026arXiv

Key Takeaway

Hallucinations in vision-language models are primarily caused by over-reliance on textual instructions rather than vision limitations—and preference-based fine-tuning can effectively reduce this by teaching models to prioritize visual grounding.

Summary

Vision-language models often generate false descriptions that aren't supported by images, especially when text instructions are misleading. This paper introduces HalluScope, a benchmark to measure when and why this happens, and HalluVL-DPO, a fine-tuning method that teaches models to trust images over text instructions by learning from examples of correct vs. hallucinated responses.

evaluation safety multimodal

Key Terms

hallucination vision-language-model direct-preference-optimization visual-grounding textual-priors