The range of topics, styles, and types of text a model was trained on; the model performs best on content similar to this distribution and may struggle outside it.