Structured and Efficient Variational Deep Learning with Matrix Gaussian Posteriors

C. Louizos; M. Welling

Structured and Efficient Variational Deep Learning with Matrix Gaussian Posteriors

Authors	C. Louizos M. Welling
Publication date	2016
Journal	JMLR Workshop and Conference Proceedings
Event	International Conference on Machine Learning
Volume \| Issue number	48
Pages (from-to)	1708-1716
Organisations	Faculty of Science (FNWI) Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract	We introduce a variational Bayesian neural network where the parameters are governed via a probability distribution on random matrices. Specifically, we employ a matrix variate Gaussian (Gupta & Nagar ’99) parameter posterior distribution where we explicitly model the covariance among the input and output dimensions of each layer. Furthermore, with approximate covariance matrices we can achieve a more efficient way to represent those correlations that is also cheaper than fully factorized parameter posteriors. We further show that with the “local reprarametrization trick" (Kingma & Welling ’15) on this posterior distribution we arrive at a Gaussian Process (Rasmussen ’06) interpretation of the hidden units in each layer and we, similarly with (Gal & Ghahramani ’15), provide connections with deep Gaussian processes. We continue in taking advantage of this duality and incorporate “pseudo-data” (Snelson & Ghahramani ’05) in our model, which in turn allows for more efficient posterior sampling while maintaining the properties of the original model. The validity of the proposed approach is verified through extensive experiments.
Document type	Article
Note	With supplementary material. - International Conference on Machine Learning, 20-22 June 2016, New York, New York, USA.
Language	English
Published at	http://proceedings.mlr.press/v48/louizos16.html
Downloads	louizos16 (Final published version)
Supplementary materials	louizos16-supp
Permalink to this page

Back

UvA-DARE

Digital Academic Repository

Structured and Efficient Variational Deep Learning with Matrix Gaussian Posteriors