Classification of Hate Speech Using Deep Neural Networks

Ashwin Geet d'Sa; Irina Illina; Dominique Fohr

Article Dans Une Revue Revue d'Information Scientifique & Technique Année : 2020

Classification of Hate Speech Using Deep Neural Networks

(1) , (1) , (1)

Ashwin Geet d'Sa

Fonction : Auteur

Speech Modeling for Facilitating Oral-Based Communication

Irina Illina

Fonction : Auteur
PersonId : 15663
IdHAL : irina-illina
IdRef : 120731746

Speech Modeling for Facilitating Oral-Based Communication

Dominique Fohr

Fonction : Auteur
PersonId : 15652
IdHAL : dominique-fohr
IdRef : 031092942

Speech Modeling for Facilitating Oral-Based Communication

Résumé

In the Internet age where the information flow has grown rapidly, there is an increase in digital communication. The spread of hatred that was previously limited to verbal communications has quickly moved over the Internet. Social media and community forums that allow people to discuss and express their opinions are becoming platforms for the dissemination of hate messages. Many countries have developed laws to prevent online hate speech. They hold the companies that run the social media responsible for their failure to remove hate speech. However, manual analysis of hate speech on online platforms is infeasible due to the huge amount of data as it is expensive and time consuming. Thus, it is important to automatically process the online user contents to detect and remove hate speech from online media. Through this work, we propose some solutions for the problem of automatic detection of hate messages. We perform hate speech classification using embedding representations of words and Deep Neural Networks (DNN). We compare fastText and BERT (Bidirectional Encoder Representations from Transformers) embedding representations of words. Furthermore, we perform classification using two approaches: (a) using word embeddings as input to Support Vector Machines (SVM) and DNN-based classifiers; (b) fine-tuning of a BERT model for classification using a task-specific corpus. Among the DNNbased classifiers, we compare Convolutional Neural Networks (CNN), Bi-Directional Long Short Term Memory (Bi-LSTM) and Convolutional Recurrent Neural Network (CRNN). The classification was performed on a Twitter dataset using three classes: hate, offensive and neither classes. Compared to the feature-based approaches, the BERT fine-tuning approach obtained a relative improvement of 16% in terms of macro-average F1-measure and 5.3% in terms of weighted F1-measure.

Mots clés

natural language processing classification deep neural network embedding Hate speech

Domaines

Interface homme-machine [cs.HC]

Fichier principal

SIIE_chap.pdf (434.45 Ko)

Origine : Fichiers produits par l'(les) auteur(s)

Dominique Fohr : Connectez-vous pour contacter le contributeur

https://hal.science/hal-03101938

Soumis le : jeudi 14 janvier 2021-10:05:33

Dernière modification le : lundi 11 septembre 2023-17:41:19

Archivage à long terme le : jeudi 15 avril 2021-18:08:13

Dates et versions

hal-03101938 , version 1 (14-01-2021)

Identifiants

HAL Id : hal-03101938 , version 1

Citer

Ashwin Geet d'Sa, Irina Illina, Dominique Fohr. Classification of Hate Speech Using Deep Neural Networks. Revue d'Information Scientifique & Technique , 2020, From Data and Information Processing to Knowledge Organization : Architectures, Models and Systems, 25 (01). ⟨hal-03101938⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA UNIV-LORRAINE INRIA2 LORIA LORIA-NLPKD ANR

382 Consultations

2427 Téléchargements

Classification of Hate Speech Using Deep Neural Networks

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager