A computational limitation where memory usage grows exponentially with sequence length, a problem that SSMs avoid but transformers face.
Performance retention over long documents and conversations