A Scalable Video Search Engine Based on Audio Content Indexing and Topic Segmentation - INRIA - Institut National de Recherche en Informatique et en Automatique Accéder directement au contenu
Communication Dans Un Congrès Année : 2011

A Scalable Video Search Engine Based on Audio Content Indexing and Topic Segmentation

Résumé

One important class of online videos is that of news broadcasts. Most news organisations provide near-immediate access to topical news broadcasts over the Internet, through RSS streams or podcasts. Until lately, technology has not made it possible for a user to automatically go to the smaller parts, within a longer broadcast, that might interest them. Recent advances in both speech recognition systems and natural language processing have led to a number of robust tools that allow us to provide users with quicker, more focussed access to relevant segments of one or more news broadcast videos. Here we present our new interface for browsing or searching news broadcasts (video/audio) that exploits these new language processing tools to (i) provide immediate access to topical passages within news broadcasts, (ii) browse news broadcasts by events as well as by people, places and organisations, (iii) perform cross lingual search of news broadcasts, (iv) search for news through a map interface, (v) browse news by trending topics, and (vi) see automatically-generated textual clues for news segments, before listening. Our publicly searchable demonstrator currently indexes daily broadcast news content from 50 sources in English, French, Chinese, Arabic, Spanish, Dutch and Russian.
Fichier principal
Vignette du fichier
NEMSummit2011_final2.pdf (591.93 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-00645228 , version 1 (27-11-2011)

Identifiants

Citer

Julien Lawto, Jean-Luc Gauvain, Lori Lamel, Gregory Grefenstette, Guillaume Gravier, et al.. A Scalable Video Search Engine Based on Audio Content Indexing and Topic Segmentation. 2011 Networked and Electronic Media (NEM) Summit : Implementing Future Media Internet, Sep 2011, Torino, Italy. 160 p. ⟨hal-00645228⟩
574 Consultations
442 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More