or: Very Deep VAEs Generalize Autoregressive Models And Can Outperform Them On Images

Deep VAEs are competitive on image synthesis benchmarks.
Use coarse-to-fine synthesis, which authors describe as top-down.
For stability in training, use stochastic layers and ignore any updates involving large gradients.