The Role of the Information Bottleneck in Representation Learning

Vera Matias; Pablo Piantanida; Leonardo Rey Vega

doi:10.1109/isit.2018.8437679

Communication Dans Un Congrès Année : 2018

The Role of the Information Bottleneck in Representation Learning

(1) , (2) , (1)

1
2

Vera Matias

Fonction : Auteur

Consejo Nacional de Investigaciones Científicas y Técnicas [Buenos Aires]

Pablo Piantanida

Fonction : Auteur
PersonId : 736967
IdHAL : pablo-piantanida
ORCID : 0000-0002-8717-2117

Laboratoire des signaux et systèmes

Leonardo Rey Vega

Fonction : Auteur

Consejo Nacional de Investigaciones Científicas y Técnicas [Buenos Aires]

Résumé

A grand challenge in representation learning is the development of computational algorithms that learn the different explanatory factors of variation behind high-dimensional data. Encoder models are usually determined to optimize performance on training data when the real objective is to generalize well to other (unseen) data. Although numerical evidence suggests that noise injection at the level of representations might improve the generalization ability of the resulting encoders, an information-theoretic justification of this principle remains elusive. In this work, we derive an upper bound to the so-called generalization gap corresponding to the cross-entropy loss and show that when this bound times a suitable multiplier and the empirical risk are minimized jointly, the problem is equivalent to optimizing the Information Bottleneck objective with respect to the empirical data-distribution. We specialize our general conclusions to analyze the dropout regularization method in deep neural networks, explaining how this regularizer helps to decrease the generalization gap.

Domaines

Théorie de l'information [cs.IT] Théorie de l'information et codage [math.IT] Statistiques [math.ST] Intelligence artificielle [cs.AI]

Pablo Piantanida : Connectez-vous pour contacter le contributeur

https://centralesupelec.hal.science/hal-01756003

Soumis le : samedi 31 mars 2018-10:47:36

Dernière modification le : lundi 18 mars 2024-03:05:32

Dates et versions

hal-01756003 , version 1 (31-03-2018)

Identifiants

HAL Id : hal-01756003 , version 1
DOI : 10.1109/isit.2018.8437679

Citer

Vera Matias, Pablo Piantanida, Leonardo Rey Vega. The Role of the Information Bottleneck in Representation Learning. IEEE International Symposium on Information Theory (ISIT 2018), Jun 2018, Vail, United States. ⟨10.1109/isit.2018.8437679⟩. ⟨hal-01756003⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS SUP_LSS SUP_TELECOMS CENTRALESUPELEC UNIV-PARIS-SACLAY GS-ENGINEERING GS-COMPUTER-SCIENCE

351 Consultations

0 Téléchargements

The Role of the Information Bottleneck in Representation Learning

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager