A Deep Learning Lexicon
brief notes on significant deep learning papers
- Absolute Orientation 1 min read - Jul 20, 2023
- Adadelta 1 min read - Jul 23, 2023
- Adafactor 1 min read - Jul 23, 2023
- Adam 1 min read - Jul 25, 2023
- AdamW 1 min read - Jul 25, 2023
- Adaptive-Span 1 min read - Jul 23, 2023
- Alignment and Uniformity Losses 1 min read - Jul 22, 2023
- All-Attention 1 min read - Jul 24, 2023
- AlphaGo 1 min read - Jul 23, 2023
- AlphaGo Zero 1 min read - Jul 23, 2023
- AmoebaNet 1 min read - Jul 24, 2023
- Attention Is All You Need 1 min read - Jul 22, 2023
- AudioLM 1 min read - Jul 24, 2023
- AWE-SoME 1 min read - Jul 22, 2023
- BagNet 1 min read - Jul 24, 2023
- Batch Augmentation 1 min read - Jul 25, 2023
- BatchNorm vs WeightNorm 1 min read - Jul 25, 2023
- BEiT-3 1 min read - Jul 24, 2023
- BERT 1 min read - Jul 26, 2023
- BIG-Bench Hard 1 min read - Jul 25, 2023
- BigBird 1 min read - Jul 25, 2023
- Bit Diffusion 1 min read - Jul 25, 2023
- BoTNet 1 min read - Jul 26, 2023
- ByT5 1 min read - Jul 23, 2023
- Byte Pair Encoding 1 min read - Jul 25, 2023
- ByteNet 1 min read - Jul 23, 2023
- CANINE 1 min read - Jul 24, 2023
- Cascaded Diffusion 1 min read - Jul 24, 2023
- ChamNet 1 min read - Jul 24, 2023
- Charformer 1 min read - Jul 24, 2023
- Chinchilla 1 min read - Jul 24, 2023
- Chip Placement With Deep RL 1 min read - Jul 24, 2023
- Classifier-Free Guidance 1 min read - Jul 24, 2023
- CLIP 1 min read - Jul 24, 2023
- CoAtNet 1 min read - Jul 25, 2023
- Conditional Bits Back Coding 1 min read - Jul 25, 2023
- Cutmix 1 min read - Jul 25, 2023
- DACL 1 min read - Jul 25, 2023
- DeiT 1 min read - Jul 25, 2023
- DeLighT 1 min read - Jul 24, 2023
- Demon 1 min read - Jul 24, 2023
- DSS 1 min read - Jul 25, 2023
- Efficient-VDVAE 1 min read - Jul 26, 2023
- EfficientDet 1 min read - Jul 25, 2023
- EfficientNet-V2 1 min read - Jul 25, 2023
- ELBO Surgery 1 min read - Jul 24, 2023
- ESeg 1 min read - Jul 25, 2023
- Evolved Transformer 1 min read - Jul 25, 2023
- F-VAEs 1 min read - Jul 25, 2023
- FaceFormer 1 min read - Jul 26, 2023
- FastText 1 min read - Jul 25, 2023
- FastText Advances 1 min read - Jul 25, 2023
- FastText Subword 1 min read - Jul 25, 2023
- FastText.Zip 1 min read - Jul 25, 2023
- Flow Models 1 min read - Jul 25, 2023
- Focal Loss 1 min read - Jul 25, 2023
- Funnel Transformer 1 min read - Jul 24, 2023
- G2p-kd 1 min read - Jul 25, 2023
- GPT 1 min read - Jul 26, 2023
- Gradient Descent Overview 1 min read - Jul 24, 2023
- GRU 1 min read - Jul 26, 2023
- GSS 1 min read - Jul 25, 2023
- IGLOO 1 min read - Jul 25, 2023
- Improved Diffusion 1 min read - Jul 25, 2023
- Improving Distributional Similarity 1 min read - Jul 23, 2023
- Intrinsic Dimension 1 min read - Jul 26, 2023
- L0-Normed Sparse Networks 1 min read - Jul 24, 2023
- LambdaNetworks 1 min read - Jul 26, 2023
- LayerNorm 1 min read - Jul 25, 2023
- Linear Identifiability 1 min read - Jul 24, 2023
- Linear Transformers 1 min read - Jul 26, 2023
- Lite Transformer 1 min read - Jul 24, 2023
- Long Range Arena 1 min read - Jul 25, 2023
- Low-Resource mBERT 1 min read - Jul 24, 2023
- LSTM Variants 1 min read - Jul 26, 2023
- LSTMN 1 min read - Jul 26, 2023
- MEGA 1 min read - Jul 26, 2023
- MeshGraphNets 1 min read - Jul 24, 2023
- Mixup 1 min read - Jul 25, 2023
- MobileNet-V3 1 min read - Jul 25, 2023
- MultiModel 1 min read - Jul 23, 2023
- Multiplicative Interactions 1 min read - Jul 26, 2023
- Nesterov's Accelerated Gradient 1 min read - Jul 24, 2023
- NNCP 1 min read - Jul 26, 2023
- NNCP v2 1 min read - Jul 26, 2023
- Noisy Back-Translation 1 min read - Jul 25, 2023
- Noisy Student for ASR 1 min read - Jul 25, 2023
- Noisy Student for ImageNet 1 min read - Jul 25, 2023
- Normalized Word Embeddings 1 min read - Jul 24, 2023
- NVAE 1 min read - Jul 25, 2023
- OnHW-Transformer 1 min read - Jul 26, 2023
- Optimizers Compared 3 min read - Jul 24, 2023
- PaLM 1 min read - Jul 23, 2023
- Primer 1 min read - Jul 26, 2023
- QKNorm 1 min read - Jul 26, 2023
- Quant-Noise 1 min read - Jul 26, 2023
- RandAugment 1 min read - Jul 26, 2023
- Random Erasing 1 min read - Jul 26, 2023
- RASP 1 min read - Jul 26, 2023
- Recurrent Cell Architecture Search 1 min read - Jul 24, 2023
- Recurrent Units Compared 5 min read - Jul 24, 2023
- Relative Representations 1 min read - Jul 20, 2023
- RepLKnet 1 min read - Jul 26, 2023
- RoBERTa 1 min read - Jul 25, 2023
- Routing Transformer 1 min read - Jul 26, 2023
- RWKV 1 min read - Jul 26, 2023
- S4 1 min read - Jul 25, 2023
- ScribbleNet 1 min read - Jul 25, 2023
- SCRN 1 min read - Jul 26, 2023
- Selfie 1 min read - Jul 25, 2023
- Semi-Supervised VAEs 1 min read - Jul 24, 2023
- SENet 1 min read - Jul 25, 2023
- Sharpness-Aware Minimization 1 min read - Jul 23, 2023
- ShiftNet 1 min read - Jul 25, 2023
- ShuffleNet 1 min read - Jul 25, 2023
- ShuffleNet-v2 1 min read - Jul 25, 2023
- SliceNet 1 min read - Jul 23, 2023
- Spatial Gating Unit 1 min read - Jul 25, 2023
- SRU 1 min read - Jul 26, 2023
- Stochastic Depth 1 min read - Jul 25, 2023
- Subspace Diffusion 1 min read - Jul 23, 2023
- Survey of Transformers 1 min read - Jul 24, 2023
- T5 1 min read - Jul 23, 2023
- TCN 1 min read - Jul 26, 2023
- Transformer Decoders 1 min read - Jul 24, 2023
- Transformer-XL 1 min read - Jul 26, 2023
- Transformer++ 1 min read - Jul 26, 2023
- UL2R 1 min read - Jul 24, 2023
- Universal Music Translation 1 min read - Jul 20, 2023
- Variational Diffusion 1 min read - Jul 25, 2023
- Variational Lossy Autoencoder 1 min read - Jul 24, 2023
- VDVAE 1 min read - Jul 26, 2023
- Visualizing RNNs 1 min read - Jul 26, 2023
- ViT 1 min read - Jul 26, 2023
- ViT-C 1 min read - Jul 26, 2023
- Wav2vec 2.0 1 min read - Jul 24, 2023
- WaveGlow 1 min read - Jul 25, 2023
- WaveNet 1 min read - Jul 25, 2023
- Web Resources 4 min read - Jul 24, 2023
- WeightNorm 1 min read - Jul 24, 2023
- Word2vec 1 min read - Jul 25, 2023
- Word2vec 2 1 min read - Jul 25, 2023
- Word2vec Explained 1 min read - Jul 25, 2023