PT Unknown AU Ishaan Gulrajani Kundan Kumar Faruk Ahmed Adrien Ali Taiga Francesco Visin David Vazquez Aaron Courville TI PixelVAE: A Latent Variable Model for Natural Images BT 5th International Conference on Learning Representations PY 2017 DE Deep Learning; Unsupervised Learning AB Natural image modeling is a landmark challenge of unsupervised learning. Variational Autoencoders (VAEs) learn a useful latent representation and generate samples that preserve global structure but tend to suffer from image blurriness. PixelCNNs model sharp contours and details very well, but lack an explicit latent representation and have difficulty modeling large-scale structure in a computationally efficient way. In this paper, we present PixelVAE, a VAE model with an autoregressive decoder based on PixelCNN. The resulting architecture achieves state-of-the-art log-likelihood on binarized MNIST. We extend PixelVAE to a hierarchy of multiple latent variables at different scales; this hierarchical model achieves competitive likelihood on 64x64 ImageNet and generates high-quality samples on LSUN bedrooms. ER