A training technique where parts of input text are randomly deleted, masked, or shuffled to teach the model to understand context and recover meaning.
World knowledge accuracy, recall of facts and relationships