Neuron-Aware Data Selection for Annotation-Free LLM Self-Distillation

Zhuowei Chen, Xiang Lorraine Li|July 2, 2026arXiv

Key Takeaway

By analyzing which neurons activate during model predictions, you can automatically select better training data and improve self-supervised learning without any human annotations—useful when expert labels are expensive or unavailable.

Summary

This paper proposes Neuron-OPSD, a method for improving large language models without human labels by using the model's internal neuron activations to select which training examples to learn from and how to construct better teacher models. The approach trains the model on its own predictions, achieving better performance on specialized tasks while maintaining general knowledge.

training efficiency data

Key Terms

self-distillation neuron-activation on-policy-learning calibration pseudo-labels