Sequence Models
This chapter explores sequence modeling methods beyond Transformers, including state space models and long-sequence modeling techniques.
Contents:
- State Space Models — S4, Mamba, Selective SSM, Hardware-Aware Scan
- Long Sequence Modeling — xLSTM, RWKV, Hyena, Linear Attention, Long Range Arena Benchmark