Impact of Time on Detecting Spammers in Twitter - Université Toulouse III - Paul Sabatier - Toulouse INP Accéder directement au contenu
Communication Dans Un Congrès Année : 2016

Impact of Time on Detecting Spammers in Twitter

Résumé

Twitter is one of the most popular microblogging social systems, which provides a set of distinctive posting services operating in real time manner. The flexibility in using these services has attracted unethical individuals, so-called ”spammers”, aiming at spreading malicious, phishing, and misleading information over the network. The spamming behavior results non-ignorable problems related to real-time search and user’s privacy. Although of Twitter’s community attempts in breaking up the spam phenomenon, researchers have dived far in fighting spammers through automating the detection process. To do so, they leverage the features concept combined with machine learning methods. However, the existing features are not effective enough to adapt the spammers’ tactics due to ease of manipulation, including the graph features which are not suitable for real-time filtering. In this paper, we introduce the design of novel features suited for real-time filtering. The features are distributed between robust statistical features considering explicitly the time of posting tweets and creation date of user’s account, and behavioral features which catch any potential posting behavior similarity between different instances (e.g. hash-tags) in the user’s tweets. The experimental results show that our new features are able to classify correctly the majority of spammers with an accuracy higher than 93% when using Random Forest learning algorithm, outperforming the accuracy of the state of features by about 6%.
Fichier principal
Vignette du fichier
Impact of Time on Detecting Spammers in Twitter.pdf (484 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-03159076 , version 1 (05-03-2021)

Licence

Paternité - Pas d'utilisation commerciale - Pas de modification

Identifiants

  • HAL Id : hal-03159076 , version 1

Citer

Mahdi Washha, Aziz Qaroush, Florence Sèdes. Impact of Time on Detecting Spammers in Twitter. 32ème Conférence Gestion de Données : Principes, Technologies et Applications (BDA 2016), Laboratoire d’Informatique et d’Automatique pour les Systèmes (LIAS) - Université de Poitiers et ENSMA, Nov 2016, Poitiers, France. ⟨hal-03159076⟩
48 Consultations
51 Téléchargements

Partager

Gmail Facebook X LinkedIn More