A Multi-stage deep architecture for summary generation of soccer videos

Melissa Sanabria; Thomas Menguy; Frédéric Precioso; Pierre-Alexandre Mattei

Pré-Publication, Document De Travail Année : 2022

A Multi-stage deep architecture for summary generation of soccer videos

(1) , (2) , (1, 3, 4) , (1, 3)

1
2
3
4

Melissa Sanabria

Fonction : Auteur
PersonId : 736102
IdHAL : melissa-sanabria
ORCID : 0000-0003-4345-1074

Modèles et algorithmes pour l’intelligence artificielle

Thomas Menguy

Fonction : Auteur

Wildmoka

Frédéric Precioso

Fonction : Auteur
PersonId : 9244
IdHAL : frederic-precioso
ORCID : 0000-0001-8712-1443
IdRef : 087273934

Modèles et algorithmes pour l’intelligence artificielle

Université Côte d'Azur

Laboratoire d'Informatique, Signaux, et Systèmes de Sophia Antipolis

Pierre-Alexandre Mattei

Fonction : Auteur
PersonId : 8469
IdHAL : pierre-alexandre-mattei
IdRef : 224920278

Modèles et algorithmes pour l’intelligence artificielle

Université Côte d'Azur

Résumé

Video content is present in an ever-increasing number of fields, both scientific and commercial. Sports, particularly soccer, is one of the industries that has invested the most in the field of video analytics, due to the massive popularity of the game and the emergence of new markets. Previous state-of-the-art methods on soccer matches video summarization rely on handcrafted heuristics to generate summaries which are poorly generalizable, but these works have yet proven that multiple modalities help detect the best actions of the game. On the other hand, machine learning models with higher generalization potential have entered the field of summarization of general-purpose videos, offering several deep learning approaches. However, most of them exploit content specificities that are not appropriate for sport whole-match videos. Although video content has been for many years the main source for automatizing knowledge extraction in soccer, the data that records all the events happening on the field has become lately very important in sports analytics, since this event data provides richer context information and requires less processing. We propose a method to generate the summary of a soccer match exploiting both the audio and the event metadata. The results show that our method can detect the actions of the match, identify which of these actions should belong to the summary and then propose multiple candidate summaries which are similar enough but with relevant variability to provide different options to the final editor. Furthermore, we show the generalization capability of our work since it can transfer knowledge between datasets from different broadcasting companies, different competitions, acquired in different conditions, and corresponding to summaries of different lengths

Domaines

Apprentissage [cs.LG]

Melissa Sanabria : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-03662328

Soumis le : lundi 9 mai 2022-11:41:50

Dernière modification le : lundi 11 mars 2024-15:14:04

Dates et versions

hal-03662328 , version 1 (09-05-2022)

Identifiants

HAL Id : hal-03662328 , version 1
ARXIV : 2205.00694

Citer

Melissa Sanabria, Thomas Menguy, Frédéric Precioso, Pierre-Alexandre Mattei. A Multi-stage deep architecture for summary generation of soccer videos. 2022. ⟨hal-03662328⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA I3S DIEUDONNE INRIA2 UNIV-COTEDAZUR 3IA-COTEDAZUR ANR

27 Consultations

0 Téléchargements

A Multi-stage deep architecture for summary generation of soccer videos

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager