Spam Detection on Arabic Twitter

Hamdy Mubarak, Ahmed Abdelali, Sabit Hassan*, Kareem Darwish

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

27 Citations (Scopus)

Abstract

Twitter has become a popular social media platform in the Arab region. Some users exploit this popularity by posting unwanted advertisements for their own interest. In this paper, we present a large manually annotated dataset of advertisement (Spam) tweets in Arabic. We analyze the characteristics of these tweets that distinguish them from other tweets and identify their targets and topics. In addition, we analyze the characteristics of Spam accounts. We utilize Support Vector Machines (SVMs) and contextual embedding based models to identify these Spam tweets with macro averaged F1 score above 98%.

Original languageEnglish
Title of host publicationSocial Informatics - 12th International Conference, SocInfo 2020, Proceedings
EditorsSamin Aref, Kalina Bontcheva, Marco Braghieri, Frank Dignum, Fosca Giannotti, Francesco Grisolia, Dino Pedreschi
PublisherSpringer Science and Business Media Deutschland GmbH
Pages237-251
Number of pages15
ISBN (Print)9783030609740
DOIs
Publication statusPublished - 2020
Event12th International Conference on Social Informatics, SocInfo 2020 - Pisa, Italy
Duration: 6 Oct 20209 Oct 2020

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume12467 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference12th International Conference on Social Informatics, SocInfo 2020
Country/TerritoryItaly
CityPisa
Period6/10/209/10/20

Keywords

  • Advertisement detection
  • Arabic social media
  • Social media analysis
  • Spam filtering

Fingerprint

Dive into the research topics of 'Spam Detection on Arabic Twitter'. Together they form a unique fingerprint.

Cite this