The underlying structural design of a neural network that determines how it processes and learns from data, distinct from standard transformer designs.