Interpreting Neural Networks as Majority Votes through the PAC-Bayesian Theory

Paul Viallard; Rémi Emonet; Pascal Germain; Amaury Habrard; Emilie Morvant

Communication Dans Un Congrès Année : 2019

Interpreting Neural Networks as Majority Votes through the PAC-Bayesian Theory

(1) , (1) , (2, 3) , (1) , (1)

1
2
3

Paul Viallard

Fonction : Auteur
PersonId : 743893
IdHAL : paul-viallard
ORCID : 0000-0003-4836-0809

Laboratoire Hubert Curien

Rémi Emonet

Fonction : Auteur
PersonId : 3876
IdHAL : remi-emonet
ORCID : 0000-0002-1870-1329
IdRef : 139072578

Laboratoire Hubert Curien

Pascal Germain

Fonction : Auteur
PersonId : 14639
IdHAL : pascal-germain
ORCID : 0000-0003-3998-9533
IdRef : 240406532

MOdel for Data Analysis and Learning

GRAAL

Amaury Habrard

Fonction : Auteur
PersonId : 439
IdHAL : amaury-habrard
ORCID : 0000-0003-3038-9347
IdRef : 084103655

Laboratoire Hubert Curien

Emilie Morvant

Fonction : Auteur
PersonId : 410
IdHAL : emilie-morvant
ORCID : 0000-0002-8301-7240
IdRef : 179027468

Laboratoire Hubert Curien

Résumé

We propose a PAC-Bayesian theoretical study of the two-phase learning procedure of a neural network introduced by Kawaguchi et al. (2017). In this procedure, a network is expressed as a weighted combination of all the paths of the network (from the input layer to the output one), that we reformulate as a PAC-Bayesian majority vote. Starting from this observation, their learning procedure consists in (1) learning a "prior" network for fixing some parameters, then (2) learning a "posterior" network by only allowing a modification of the weights over the paths of the prior network. This allows us to derive a PAC-Bayesian generalization bound that involves the empirical individual risks of the paths (known as the Gibbs risk) and the empirical diversity between pairs of paths. Note that similarly to classical PAC-Bayesian bounds, our result involves a KL-divergence term between a "prior" network and the "posterior" network. We show that this term is computable by dynamic programming without assuming any distribution on the network weights.

Domaines

Machine Learning [stat.ML]

Emilie Morvant : Connectez-vous pour contacter le contributeur

https://hal.science/hal-02335762

Soumis le : lundi 28 octobre 2019-14:35:40

Dernière modification le : lundi 12 février 2024-15:38:10

Dates et versions

hal-02335762 , version 1 (28-10-2019)

Identifiants

HAL Id : hal-02335762 , version 1

Citer

Paul Viallard, Rémi Emonet, Pascal Germain, Amaury Habrard, Emilie Morvant. Interpreting Neural Networks as Majority Votes through the PAC-Bayesian Theory. Workshop on Machine Learning with guarantees @ NeurIPS 2019, Dec 2019, Vancouver, Canada. ⟨hal-02335762⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-ST-ETIENNE IOGS CNRS INRIA PARISTECH INRIA2 UNIV-LILLE UDL ANR LPP-MATH

115 Consultations

0 Téléchargements

Interpreting Neural Networks as Majority Votes through the PAC-Bayesian Theory

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager