Detection of Hate Tweets using Machine Learning and Deep Learning

Lida Ketsbaia, Biju Issac, Xiaomin Chen

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

22 Citations (Scopus)
492 Downloads (Pure)

Abstract

Cyberbullying has become a highly problematic occurrence due to its potential of anonymity and its ease for others to join in the harassment of victims. The distancing effect that technological devices have, has led to cyberbullies say and do harsher things compared to what is typical in a traditional face-to-face bullying situation. Given the great importance of the problem, detection is becoming a key area of cyberbullying research. Therefore, it is highly necessary for a framework to accurately detect new cyberbullying instances automatically. To review the machine learning and deep learning approaches, two datasets were used. The first dataset was provided by the University of Maryland consisting of over 30,000 tweets, whereas the second dataset was based on the article ‘Automated Hate Speech Detection and the Problem of Offensive Language’ by Davidson et al., containing roughly 25,000 tweets. The paper explores machine learning approaches using word embeddings such as DBOW (Distributed Bag of Words) and DMM (Distributed Memory Mean) and the performance of Word2vec Convolutional Neural Networks (CNNs) to classify online hate.
Original languageEnglish
Title of host publication2020 IEEE 19th International Conference on Trust, Security and Privacy in Computing and Communications (TrustCom)
Subtitle of host publication29 December 2020 – 1 January 2021 Guangzhou, China
EditorsGuojun Wang, Ryan Ko, Md Zakirul Alam Bhuiyan, Yi Pan
Place of PublicationPiscataway, NJ
PublisherIEEE
Pages751-758
Number of pages8
ISBN (Electronic)9781665403924
ISBN (Print)9781665403931
DOIs
Publication statusPublished - Dec 2020
Event19th IEEE International Conference on Trust, Security and Privacy in Computing and Communications (TrustCom 2020): 4th International Workshop on Cyberspace Security (IWCSS 2020) - Guangzhou University, Guangzhou, China
Duration: 29 Dec 20201 Jan 2021
http://ieee-trustcom.org/TrustCom2020/

Conference

Conference19th IEEE International Conference on Trust, Security and Privacy in Computing and Communications (TrustCom 2020)
Country/TerritoryChina
CityGuangzhou
Period29/12/201/01/21
Internet address

Keywords

  • hate speech
  • CNN
  • machine learning
  • Word2Vec
  • Doc2Vec

Fingerprint

Dive into the research topics of 'Detection of Hate Tweets using Machine Learning and Deep Learning'. Together they form a unique fingerprint.

Cite this