Interpreting Neural Networks as Majority Votes through the PAC-Bayesian Theory - INRIA - Institut National de Recherche en Informatique et en Automatique Accéder directement au contenu
Communication Dans Un Congrès Année : 2019

Interpreting Neural Networks as Majority Votes through the PAC-Bayesian Theory

Paul Viallard
Rémi Emonet
Pascal Germain
Amaury Habrard
Emilie Morvant

Résumé

We propose a PAC-Bayesian theoretical study of the two-phase learning procedure of a neural network introduced by Kawaguchi et al. (2017). In this procedure, a network is expressed as a weighted combination of all the paths of the network (from the input layer to the output one), that we reformulate as a PAC-Bayesian majority vote. Starting from this observation, their learning procedure consists in (1) learning a "prior" network for fixing some parameters, then (2) learning a "posterior" network by only allowing a modification of the weights over the paths of the prior network. This allows us to derive a PAC-Bayesian generalization bound that involves the empirical individual risks of the paths (known as the Gibbs risk) and the empirical diversity between pairs of paths. Note that similarly to classical PAC-Bayesian bounds, our result involves a KL-divergence term between a "prior" network and the "posterior" network. We show that this term is computable by dynamic programming without assuming any distribution on the network weights.
Fichier non déposé

Dates et versions

hal-02335762 , version 1 (28-10-2019)

Identifiants

  • HAL Id : hal-02335762 , version 1

Citer

Paul Viallard, Rémi Emonet, Pascal Germain, Amaury Habrard, Emilie Morvant. Interpreting Neural Networks as Majority Votes through the PAC-Bayesian Theory. Workshop on Machine Learning with guarantees @ NeurIPS 2019, Dec 2019, Vancouver, Canada. ⟨hal-02335762⟩
115 Consultations
0 Téléchargements

Partager

Gmail Facebook X LinkedIn More