PT Unknown
AU Ishaan Gulrajani
   Kundan Kumar
   Faruk Ahmed
   Adrien Ali Taiga
   Francesco Visin
   David Vazquez
   Aaron Courville
TI PixelVAE: A Latent Variable Model for Natural Images
BT 5th International Conference on Learning Representations
PY 2017
DE Deep Learning; Unsupervised Learning
AB Natural image modeling is a landmark challenge of unsupervised learning. Variational Autoencoders (VAEs) learn a useful latent representation and generate samples that preserve global structure but tend to suffer from image blurriness. PixelCNNs model sharp contours and details very well, but lack an explicit latent representation and have difficulty modeling large-scale structure in a computationally efficient way. In this paper, we present PixelVAE, a VAE model with an autoregressive decoder based on PixelCNN. The resulting architecture achieves state-of-the-art log-likelihood on binarized MNIST. We extend PixelVAE to a hierarchy of multiple latent variables at different scales; this hierarchical model achieves competitive likelihood on 64x64 ImageNet and generates high-quality samples on LSUN bedrooms.
ER