Combinatorial Tiling for Sparse Neural Networks - INRIA - Institut National de Recherche en Informatique et en Automatique Accéder directement au contenu
Rapport (Rapport De Recherche) Année : 2020

Combinatorial Tiling for Sparse Neural Networks

Résumé

Sparse deep neural networks (DNNs) emerged as the result of search for networks with less storage and lower computational complexity. The sparse DNN inference is the task of using such trained DNN networks to classify a batch of input data. We propose an efficient, hybrid model- and data-parallel DNN inference using hypergraph models and partitioners. We exploit tiling and weak synchronization to increase cache reuse, hide load imbalance, and hide synchronisation costs. Finally, a blocking approach allows application of this new hybrid inference procedure for deep neural networks. We initially experiment using the hybrid tiled inference approach only, using the first five layers of networks from the IEEE HPEC 2019 Graph Challenge, and attain up to 2x speedup versus a data-parallel baseline.
Fichier principal
Vignette du fichier
RR-9357 (1).pdf (818.99 Ko) Télécharger le fichier

Dates et versions

hal-02910997 , version 1 (03-08-2020)
hal-02910997 , version 2 (03-08-2020)
hal-02910997 , version 3 (03-09-2020)

Identifiants

  • HAL Id : hal-02910997 , version 1

Citer

Filip Pawłowski, Rob H Bisseling, Bora Uçar, Albert-Jan Yzelman. Combinatorial Tiling for Sparse Neural Networks. [Research Report] RR-9357, Inria - Research Centre Grenoble – Rhône-Alpes. 2020. ⟨hal-02910997v1⟩
294 Consultations
505 Téléchargements

Partager

Gmail Facebook X LinkedIn More