%0 Conference Proceedings %T PixelVAE: A Latent Variable Model for Natural Images %A Ishaan Gulrajani %A Kundan Kumar %A Faruk Ahmed %A Adrien Ali Taiga %A Francesco Visin %A David Vazquez %A Aaron Courville %B 5th International Conference on Learning Representations %D 2017 %F Ishaan Gulrajani2017 %O ADAS; 600.085; 600.076; 601.281; 600.118 %O exported from refbase (http://refbase.cvc.uab.es/show.php?record=2815), last updated on Thu, 04 Apr 2019 12:38:22 +0200 %X Natural image modeling is a landmark challenge of unsupervised learning. Variational Autoencoders (VAEs) learn a useful latent representation and generate samples that preserve global structure but tend to suffer from image blurriness. PixelCNNs model sharp contours and details very well, but lack an explicit latent representation and have difficulty modeling large-scale structure in a computationally efficient way. In this paper, we present PixelVAE, a VAE model with an autoregressive decoder based on PixelCNN. The resulting architecture achieves state-of-the-art log-likelihood on binarized MNIST. We extend PixelVAE to a hierarchy of multiple latent variables at different scales; this hierarchical model achieves competitive likelihood on 64x64 ImageNet and generates high-quality samples on LSUN bedrooms. %K Deep Learning %K Unsupervised Learning %U http://104.155.136.4:3000/pdf?id=BJKYvt5lg %U http://refbase.cvc.uab.es/files/gka2016.pdf