Researchers have unveiled Mamba-3, a significant evolution in State Space Models (SSM) that shifts the optimization focus squarely onto inference efficiency. This marks a departure from its predecessor, Mamba-2, which prioritized training speed. The latest iteration aims to tackle the growing demand for faster LLM deployment and agentic workflows.
Developed through a collaboration between Carnegie Mellon University, Princeton University, Cartesia AI, and Together AI, Mamba-3 introduces a more expressive recurrence formula, complex-valued state tracking, and a multi-input, multi-output (MIMO) variant. These enhancements reportedly boost accuracy without compromising decoding speed.
