Natural language processing for usage based indexing of web resources - INRIA - Institut National de Recherche en Informatique et en Automatique Accéder directement au contenu
Communication Dans Un Congrès Année : 2007

Natural language processing for usage based indexing of web resources

Anne Boyer
Armelle Brun

Résumé

The identification of reliable and interesting items on Internet becomes more and more difficult and time consuming. This paper is a position paper describing our intended work in the framework of multimedia information retrieval by browsing techniques within web navigation. It relies on a usage-based indexing of resources: we ignore the nature, the content and the structure of resources. We describe a new approach taking advantage of the similarity between statistical modeling of language and document retrieval systems. A syntax of usage is computed that designs a Statistical Grammar of Usage (SGU). A SGU enables resources classification to perform a personalized navigation assistant tool. It relies both on collaborative filtering to compute virtual communities of users and classical statistical language models. The resulting SGU is a community dependent SGU.
Fichier principal
Vignette du fichier
BoyerBrun.pdf (98.77 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

inria-00172231 , version 1 (14-09-2007)

Identifiants

Citer

Anne Boyer, Armelle Brun. Natural language processing for usage based indexing of web resources. 29th European Conference on Information Retrieval - ECIR'07, Fondazione Ugo Bordoni; BCS-IRSG; ACM SIGIR, Apr 2007, Rome, Italy. pp.517-524, ⟨10.1007/978-3-540-71496-5_46⟩. ⟨inria-00172231⟩
134 Consultations
291 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More