How to match bilingual tweets ? - INRIA - Institut National de Recherche en Informatique et en Automatique Accéder directement au contenu
Communication Dans Un Congrès Année : 2017

How to match bilingual tweets ?

Résumé

In this paper, we propose a method that aligns comparable bilingual tweets which, not only takes into account the specificity of a Tweet, but treats also proper names, dates and numbers in two different languages. This permits to retrieve more relevant target tweets. The process of matching proper names between Arabic and English is a difficult task, because these two languages use different scripts. For that, we used an approach which projects the sounds of an English proper name into Arabic and aligns it with the most appropriate proper name. We evaluated the method with a classical measure and compared it to the one we developed. The experiments have been achieved on two parallel corpora and shows that our measure outperforms the baseline by 5.6% at R@1 recall.
Fichier principal
Vignette du fichier
articleABIDIandSmaili (1).pdf (542.42 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-01536078 , version 1 (10-06-2017)

Identifiants

  • HAL Id : hal-01536078 , version 1

Citer

Karima Abidi, Kamel Smaili. How to match bilingual tweets ? . 6th NLP 2017 - Computer Science Conference Proceedings in Computer Science & Information Technology (CS & IT) , Feb 2017, Sydney, Australia. ⟨hal-01536078⟩
210 Consultations
389 Téléchargements

Partager

Gmail Facebook X LinkedIn More