A state-space model architecture designed to process long sequences faster and with less memory than traditional transformer models.