An attention mechanism with linear computational complexity instead of quadratic, enabling faster inference.