A model designed to convert inputs (like images or text) into numerical representations for understanding, rather than generating new content.