Walking on the Edge: Fast, Low-Distortion Adversarial Examples - INRIA - Institut National de Recherche en Informatique et en Automatique Accéder directement au contenu
Pré-Publication, Document De Travail Année : 2019

Walking on the Edge: Fast, Low-Distortion Adversarial Examples

Résumé

Adversarial examples of deep neural networks are receiving ever increasing attention because they help in understanding and reducing the sensitivity to their input. This is natural given the increasing applications of deep neural networks in our everyday lives. When white-box attacks are almost always successful, it is typically only the distortion of the perturbations that matters in their evaluation. In this work, we argue that speed is important as well, especially when considering that fast attacks are required by adversarial training. Given more time, iterative methods can always find better solutions. We investigate this speed-distortion trade-off in some depth and introduce a new attack called boundary projection (BP) that improves upon existing methods by a large margin. Our key idea is that the classification boundary is a manifold in the image space: we therefore quickly reach the boundary and then optimize distortion on this manifold.

Dates et versions

hal-02404216 , version 1 (11-12-2019)

Identifiants

Citer

Hanwei Zhang, Yannis Avrithis, Teddy Furon, Laurent Amsaleg. Walking on the Edge: Fast, Low-Distortion Adversarial Examples. 2019. ⟨hal-02404216⟩
78 Consultations
0 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More