A neural network design that combines masked language modeling with permutation language modeling to better understand relationships between words in text.