A training stage between pretraining and post-training where models are trained on curated, large-scale data mixtures to strengthen specific capabilities.