A neural network design based on the BERT model that uses transformer layers to understand relationships between words in text by looking at context from all directions.