BERT
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
An encoder-only transformer, like a bidirectional version of GPT

from a laptop in Sunnyvale
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
An encoder-only transformer, like a bidirectional version of GPT