Self-Study Reconsidered: The Hidden Fragility of Learning from Self-Generated QA

Ekaterina Alimaskina, Denis Shveykin, Gleb Molodtsov, Igor Shalygin, Alexey Kadeishvili et al.|June 30, 2026arXiv

Key Takeaway

Synthetic QA generation for model training has hidden failure modes: biased coverage of documents and susceptibility to instruction injection. Simple fixes like anchoring questions to specific targets and filtering instruction-like text can substantially reduce these problems.

Summary

This paper reveals that using synthetic question-answer pairs to train language models is riskier than assumed. Models generating QA pairs don't uniformly cover documents—they focus on salient regions and can be hijacked by artifacts like markup.

training data safety

Key Terms

synthetic-conversation-dataset training-data-curation instruction-following coverage