Adversarial attacks via backward error analysis

Théo Beuzeville; Pierre Boudier; Alfredo Buttari; Serge Gratton; Théo Mary; Stéphane Pralet

Pré-Publication, Document De Travail Année : 2021

Adversarial attacks via backward error analysis

(1, 2) , (3) , (1, 4) , (1) , (5) , (2)

1
2
3
4
5

Théo Beuzeville

Fonction : Auteur
PersonId : 1106146
IdHAL : theo-beuzeville
ORCID : 0009-0002-6031-0900

Algorithmes Parallèles et Optimisation

Atos [Bezons]

Pierre Boudier

Fonction : Auteur
PersonId : 1106147

NVIDIA

Alfredo Buttari

Fonction : Auteur
PersonId : 170442
IdHAL : alfredo-buttari
ORCID : 0000-0003-3207-7021
IdRef : 167548999

Algorithmes Parallèles et Optimisation

Centre National de la Recherche Scientifique

Serge Gratton

Fonction : Auteur
PersonId : 745245
IdHAL : serge-gratton
ORCID : 0000-0002-5021-2357
IdRef : 059873736

Algorithmes Parallèles et Optimisation

Théo Mary

Fonction : Auteur
PersonId : 178018
IdHAL : tmary
ORCID : 0000-0001-9949-4634
IdRef : 230009417

Performance et Qualité des Algorithmes Numériques

Stéphane Pralet

Fonction : Auteur
PersonId : 1106148

Atos [Bezons]

Résumé

Backward error (BE) analysis was developed and popularized by James Wilkinson in the 1950s and 1960s, with origins in the works of Neumann and Goldstine (1947) and Turing (1948). It is a fundamental notion used in numerical linear algebra software, both as a theoretical and a practical tool for the rounding error analysis of numerical algorithms. Broadly speaking the backward error quantifies, in terms of perturbation of input data, by how much the output of an algorithm fails to be equal to an expected quantity. For a given computed solution y, this amounts to computing the norm of the smallest perturbation ∆x of the input data x such that y is an exact solution of a perturbed system: f (x + ∆x) = y. Up to now, BE analysis has been applied to numerous linear algebra problems, always with the objective of quantifying the robustness of algebraic processes with respect to rounding errors stemming from finite precision computations. While deep neural networks (DNN) have achieved an unprecedented success in numerous machine learning tasks in various domains, their robustness to adversarial attacks, rounding errors, or quantization processes has raised considerable concerns from the machine learning community. In this work, we generalize BE analysis to DNN. This enables us to obtain closed formulas and a numerical algorithm for computing adversarial attacks. By construction, these attacks are optimal, and thereby smaller, in norm, than perturbations obtained with existing gradient-based approaches. We produce numerical results that support our theoretical findings and illustrate the relevance of our approach on well-known datasets.

Domaines

Intelligence artificielle [cs.AI] Analyse numérique [math.NA]

Fichier principal

Adversarial_BE.pdf (483.66 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Théo Beuzeville : Connectez-vous pour contacter le contributeur

https://ut3-toulouseinp.hal.science/hal-03296180

Soumis le : jeudi 9 décembre 2021-09:56:55

Dernière modification le : mercredi 17 avril 2024-14:27:57

Dates et versions

hal-03296180 , version 1 (22-07-2021)

hal-03296180 , version 2 (07-12-2021)

hal-03296180 , version 3 (09-12-2021)

Identifiants

HAL Id : hal-03296180 , version 3

Citer

Théo Beuzeville, Pierre Boudier, Alfredo Buttari, Serge Gratton, Théo Mary, et al.. Adversarial attacks via backward error analysis. 2021. ⟨hal-03296180v3⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-TLSE2 CNRS LIP6 UT1-CAPITOLE TDS-MACS SORBONNE-UNIVERSITE SU-SCIENCES IRIT IRIT-APO ANR ANITI IRIT-CISO IRIT-CNRS IRIT-INPT TOULOUSE-INP UNIV-UT3 UT3-TOULOUSEINP

691 Consultations

228 Téléchargements

Adversarial attacks via backward error analysis

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager