Classification of Hate Speech Using Deep Neural Networks - INRIA - Institut National de Recherche en Informatique et en Automatique Accéder directement au contenu
Article Dans Une Revue Revue d'Information Scientifique & Technique Année : 2020

Classification of Hate Speech Using Deep Neural Networks

Résumé

In the Internet age where the information flow has grown rapidly, there is an increase in digital communication. The spread of hatred that was previously limited to verbal communications has quickly moved over the Internet. Social media and community forums that allow people to discuss and express their opinions are becoming platforms for the dissemination of hate messages. Many countries have developed laws to prevent online hate speech. They hold the companies that run the social media responsible for their failure to remove hate speech. However, manual analysis of hate speech on online platforms is infeasible due to the huge amount of data as it is expensive and time consuming. Thus, it is important to automatically process the online user contents to detect and remove hate speech from online media. Through this work, we propose some solutions for the problem of automatic detection of hate messages. We perform hate speech classification using embedding representations of words and Deep Neural Networks (DNN). We compare fastText and BERT (Bidirectional Encoder Representations from Transformers) embedding representations of words. Furthermore, we perform classification using two approaches: (a) using word embeddings as input to Support Vector Machines (SVM) and DNN-based classifiers; (b) fine-tuning of a BERT model for classification using a task-specific corpus. Among the DNNbased classifiers, we compare Convolutional Neural Networks (CNN), Bi-Directional Long Short Term Memory (Bi-LSTM) and Convolutional Recurrent Neural Network (CRNN). The classification was performed on a Twitter dataset using three classes: hate, offensive and neither classes. Compared to the feature-based approaches, the BERT fine-tuning approach obtained a relative improvement of 16% in terms of macro-average F1-measure and 5.3% in terms of weighted F1-measure.
Fichier principal
Vignette du fichier
SIIE_chap.pdf (434.45 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)

Dates et versions

hal-03101938 , version 1 (14-01-2021)

Identifiants

  • HAL Id : hal-03101938 , version 1

Citer

Ashwin Geet d'Sa, Irina Illina, Dominique Fohr. Classification of Hate Speech Using Deep Neural Networks. Revue d'Information Scientifique & Technique , 2020, From Data and Information Processing to Knowledge Organization : Architectures, Models and Systems, 25 (01). ⟨hal-03101938⟩
382 Consultations
2427 Téléchargements

Partager

Gmail Facebook X LinkedIn More