Abstract
Cyberbullying has become a highly problematic occurrence due to its potential of anonymity and its ease for others to join in the harassment of victims. The distancing effect that technological devices have, has led to cyberbullies say and do harsher things compared to what is typical in a traditional face-to-face bullying situation. Given the great importance of the problem, detection is becoming a key area of cyberbullying research. Therefore, it is highly necessary for a framework to accurately detect new cyberbullying instances automatically. To review the machine learning and deep learning approaches, two datasets were used. The first dataset was provided by the University of Maryland consisting of over 30,000 tweets, whereas the second dataset was based on the article ‘Automated Hate Speech Detection and the Problem of Offensive Language’ by Davidson et al., containing roughly 25,000 tweets. The paper explores machine learning approaches using word embeddings such as DBOW (Distributed Bag of Words) and DMM (Distributed Memory Mean) and the performance of Word2vec Convolutional Neural Networks (CNNs) to classify online hate.
Original language | English |
---|---|
Title of host publication | 2020 IEEE 19th International Conference on Trust, Security and Privacy in Computing and Communications (TrustCom) |
Subtitle of host publication | 29 December 2020 – 1 January 2021 Guangzhou, China |
Editors | Guojun Wang, Ryan Ko, Md Zakirul Alam Bhuiyan, Yi Pan |
Place of Publication | Piscataway, NJ |
Publisher | IEEE |
Pages | 751-758 |
Number of pages | 8 |
ISBN (Electronic) | 9781665403924 |
ISBN (Print) | 9781665403931 |
DOIs | |
Publication status | Published - Dec 2020 |
Event | 19th IEEE International Conference on Trust, Security and Privacy in Computing and Communications (TrustCom 2020): 4th International Workshop on Cyberspace Security (IWCSS 2020) - Guangzhou University, Guangzhou, China Duration: 29 Dec 2020 → 1 Jan 2021 http://ieee-trustcom.org/TrustCom2020/ |
Conference
Conference | 19th IEEE International Conference on Trust, Security and Privacy in Computing and Communications (TrustCom 2020) |
---|---|
Country/Territory | China |
City | Guangzhou |
Period | 29/12/20 → 1/01/21 |
Internet address |
Keywords
- hate speech
- CNN
- machine learning
- Word2Vec
- Doc2Vec