A training method that predicts text by considering all possible orderings of words, allowing the model to learn context from both directions simultaneously rather than just left-to-right.