The process of using a model to convert raw input text into numerical representations (features) that capture the meaning of the text.