Your UnEmbedding Matrix is Secretly a Feature Lens for Text Embeddings

Songhao Wu, Zhongxin Chen, Yuxuan Liu, Heng Cui, Cong Li et al.|June 5, 2026arXiv

Key Takeaway

LLM embeddings can be significantly improved by filtering out a specific subspace encoded in the unembedding matrix that captures frequent tokens—this also enables dimensionality reduction without quality loss.

Summary

This paper reveals that LLM embeddings are dominated by frequent but meaningless tokens, which hurts their quality for text search tasks. The authors propose EmbedFilter, a simple linear transformation that removes this noise by filtering out the subspace where the model's unembedding matrix writes high-frequency tokens.

efficiency evaluation

Key Terms

unembedding-matrix text-embeddings dimensionality-reduction semantic-representation zero-shot-learning