A neural network trained on large collections of protein sequences to learn patterns in amino acids, similar to how language models learn patterns in text.