A neural network design that combines Mamba (a fast, efficient sequence model) with Transformer components to balance speed and capability.
Performance retention over long documents and conversations