A Semi-Supervised Learning Approach for Tackling Twitter Spam Drift

Niddal Imam, Biju Issac, Seibu Mary Jacob

Research output: Contribution to journalArticlepeer-review

13 Citations (Scopus)
27 Downloads (Pure)

Abstract

Twitter has changed the way people get information by allowing them to express their opinion and comments on the daily tweets. Unfortunately, due to the high popularity of Twitter, it has become very attractive to spammers. Unlike other types of spam, Twitter spam has become a serious issue in the last few years. The large number of users and the high amount of information being shared on Twitter play an important role in accelerating the spread of spam. In order to protect the users, Twitter and the research community have been developing different spam detection systems by applying different machine-learning techniques. However, a recent study showed that the current machine learning-based detection systems are not able to detect spam accurately because spam tweet characteristics vary over time. This issue is called “Twitter Spam Drift”. In this paper, a semi-supervised learning approach (SSLA) has been proposed to tackle this. The new approach uses the unlabeled data to learn the structure of the domain. Different experiments were performed on English and Arabic datasets to test and evaluate the proposed approach and the results show that the proposed SSLA can reduce the effect of Twitter spam drift and outperform the existing techniques.
Original languageEnglish
Article number1950010
JournalInternational Journal of Computational Intelligence and Applications
Volume18
Issue number02
Early online date13 Jun 2019
DOIs
Publication statusPublished - Jun 2019

Keywords

  • Semi-supervised learning
  • twitter spam
  • machine learning
  • spam drift

Fingerprint

Dive into the research topics of 'A Semi-Supervised Learning Approach for Tackling Twitter Spam Drift'. Together they form a unique fingerprint.

Cite this