or: A Pen Is All You Need

Online handwriting recognition with character-level accuracy of 92%. An autoregressive model that uses a 2-layer convnet followed by a transformer with 8 attention heads. 50-minute training on a small cluster.