Immersive Video Coding: Should Geometry Information be Transmitted as Depth Maps?

Patrick Garus; Felix Henry; Joël Jung; Thomas Maugey; Christine Guillemot

doi:10.1109/TCSVT.2021.3100006

Article Dans Une Revue IEEE Transactions on Circuits and Systems for Video Technology Année : 2022

Immersive Video Coding: Should Geometry Information be Transmitted as Depth Maps?

(1) , (1) , (2) , (3) , (3)

1
2
3

Patrick Garus

Fonction : Auteur

Orange Labs [Cesson-Sévigné]

Felix Henry

Fonction : Auteur

Orange Labs [Cesson-Sévigné]

Joël Jung

Fonction : Auteur
PersonId : 184915
IdHAL : joel-jung
ORCID : 0000-0002-3878-6454
IdRef : 060868821

Tencent Media Lab [US]

Thomas Maugey

Fonction : Auteur
PersonId : 2482
IdHAL : tmaugey
ORCID : 0000-0002-7149-0823
IdRef : 179788108

Analysis representation, compression and communication of visual data

Christine Guillemot

Fonction : Auteur

Analysis representation, compression and communication of visual data

Résumé

Immersive video often refers to multiple views with texture and scene geometry information, from which different viewports can be synthesized on the client side. To design efficient immersive video coding solutions, it is desirable to minimize bitrate, pixel rate and complexity. We investigate whether the classical approach of sending the geometry of a scene as depth maps is appropriate to serve this purpose. Previous work shows that bypassing depth transmission entirely and estimating depth at the client side improves the synthesis performance while saving bitrate and pixel rate. In order to understand if the encoder side depth maps contain information that is beneficial to be transmitted, we first explore a hybrid approach which enables partial depth map transmission using a block-based RD-based decision in the depth coding process. This approach reveals that partial depth map transmission may improve the rendering performance but does not present a good compromise in terms of compression efficiency. This led us to address the remaining drawbacks of decoder side depth estimation: complexity and depth map inaccuracy. We propose a novel system that takes advantage of high quality depth maps at the server side by encoding them into lightweight features that support the depth estimator at the client side. These features allow reducing the amount of data that has to be handled during decoder side depth estimation by 88%, which significantly speeds up the cost computation and the energy minimization of the depth estimator. Furthermore,-46.0% and-37.9% average synthesis BD-Rate gains are achieved compared to the classical approach with depth maps estimated at the encoder.

Mots clés

MPEG decoder side depth estimation Feature-Driven Depth Estimation Immersive Video

Domaines

Traitement du signal et de l'image [eess.SP]

Fichier principal

CSVT2021_FINAL_VERSION.pdf (40.77 Mo)

Origine : Fichiers produits par l'(les) auteur(s)

Christine Guillemot : Connectez-vous pour contacter le contributeur

https://hal.science/hal-03303040

Soumis le : mardi 27 juillet 2021-21:29:43

Dernière modification le : vendredi 24 mars 2023-14:53:22

Archivage à long terme le : jeudi 28 octobre 2021-18:43:39

Dates et versions

hal-03303040 , version 1 (27-07-2021)

Identifiants

HAL Id : hal-03303040 , version 1
DOI : 10.1109/TCSVT.2021.3100006

Citer

Patrick Garus, Felix Henry, Joël Jung, Thomas Maugey, Christine Guillemot. Immersive Video Coding: Should Geometry Information be Transmitted as Depth Maps?. IEEE Transactions on Circuits and Systems for Video Technology, 2022, 32 (5), pp.3250-3264. ⟨10.1109/TCSVT.2021.3100006⟩. ⟨hal-03303040⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UNIV-RENNES1 CNRS INRIA INSA-RENNES IRISA CENTRALESUPELEC INRIA2 UR1-MATH-STIC UR1-UFR-ISTIC UNIV-RENNES UR1-MATH-NUM

164 Consultations

18 Téléchargements

Immersive Video Coding: Should Geometry Information be Transmitted as Depth Maps?

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager