Charformer
Charformer: Fast Character Transformers Via Gradient-Based Subword Tokenization
A BERT-style language model handles tokens generated by a character-to-tokens transformer encoder.
from a laptop in Sunnyvale
Charformer: Fast Character Transformers Via Gradient-Based Subword Tokenization
A BERT-style language model handles tokens generated by a character-to-tokens transformer encoder.