RWKV: Reinventing RNNs for the Transformer Era

https://github.com/BlinkDL/RWKV-LM
Innovative RNN with AFT-type linear attention.
performance comparable with S4.