or: Semi-supervised Learning with Deep Generative Models

Unsupervised clustering of data using a supervised VAE model.
It works on MNIST, but is pretty fragile, as others report (see https://github.com/ruishu/vae-clustering)

ruishu finds that the model often crams the data into the wrong cluster, as long as it can model it accurately with an oddly-shaped distribution within that cluster.