A model that has been compressed by training a smaller model to mimic a larger, more capable model, reducing size and computational requirements while retaining performance.