Pyramid Scene Parsing Network in 3D: improving semantic segmentation of point clouds with multi-scale contextual information

Hao Fang; Florent Lafarge

Article Dans Une Revue ISPRS Journal of Photogrammetry and Remote Sensing Année : 2019

Pyramid Scene Parsing Network in 3D: improving semantic segmentation of point clouds with multi-scale contextual information

(1) , (1)

Hao Fang

Fonction : Auteur

Geometric Modeling of 3D Environments

Florent Lafarge

Fonction : Auteur

Geometric Modeling of 3D Environments

Résumé

Analyzing and extracting geometric features from 3D data is a fundamental step in 3D scene understanding. Recent works demonstrated that deep learning archi-tectures can operate directly on raw point clouds, i.e. without the use of intermediate grid-like structures. These architectures are however not designed to encode contextual information in-between objects efficiently. Inspired by a global feature aggregation algorithm designed for images, we propose a 3D pyramid module to enrich pointwise features with multi-scale contextual information. Our module can be easily coupled with 3D semantic segmantation methods operating on 3D point clouds. We evaluated our method on three large scale datasets with four baseline models. Experimental results show that the use of enriched features brings significant improvements to the semantic segmentation of indoor and outdoor scenes.

Mots clés

Point Cloud Semantic Segmentation Deep Learning Multi-scale Contextual Information

Domaines

Mathématique discrète [cs.DM]

Fichier principal

elsarticle-template.pdf (1.68 Mo)

jprs19.png (41.26 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Florent Lafarge : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-02159279

Soumis le : mardi 18 juin 2019-16:00:48

Dernière modification le : mercredi 15 mars 2023-08:58:09

Dates et versions

hal-02159279 , version 1 (18-06-2019)

Identifiants

HAL Id : hal-02159279 , version 1

Citer

Hao Fang, Florent Lafarge. Pyramid Scene Parsing Network in 3D: improving semantic segmentation of point clouds with multi-scale contextual information. ISPRS Journal of Photogrammetry and Remote Sensing, In press. ⟨hal-02159279⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

INRIA INRIA2 TDS-MACS UNIV-COTEDAZUR

273 Consultations

1585 Téléchargements

Pyramid Scene Parsing Network in 3D: improving semantic segmentation of point clouds with multi-scale contextual information

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager