Learning from Video and Text via Large-Scale Discriminative Clustering - INRIA - Institut National de Recherche en Informatique et en Automatique Accéder directement au contenu
Ouvrages Année : 2017

Learning from Video and Text via Large-Scale Discriminative Clustering

Résumé

Discriminative clustering has been successfully applied to a number of weakly-supervised learning tasks. Such applications include person and action recognition, text-to-video alignment, object co-segmentation and co-localization in videos and images. One drawback of dis-criminative clustering, however, is its limited scalability. We address this issue and propose an online optimization algorithm based on the Block-Coordinate Frank-Wolfe algorithm. We apply it to the problem of weakly-supervised learning of actions and actors from movies and corresponding movie scripts. The scaling up of the learning problem to 66 feature-length movies enables us to significantly improve weakly-supervised action recognition.
Fichier principal
Vignette du fichier
ICCV_arxiv.pdf (3.19 Mo) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01569540 , version 1 (27-07-2017)
hal-01569540 , version 2 (28-07-2017)

Identifiants

  • HAL Id : hal-01569540 , version 1

Citer

Antoine Miech, Jean-Baptiste Alayrac, Piotr Bojanowski, Ivan Laptev, Josef Sivic (Dir.). Learning from Video and Text via Large-Scale Discriminative Clustering. published by the authors, 2017. ⟨hal-01569540v1⟩
315 Consultations
1458 Téléchargements

Partager

Gmail Facebook X LinkedIn More