Detecting deceptive tweets in arabic for cyber-security

Francisco Rangel, Paolo Rosso, Anis Charfi, Wajdi Zaghouani

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

12 Citations (Scopus)

Abstract

In the framework of the QNRF project on Arabic Author Profiling for Cyber-Security, we addressed deception detection in Arabic in order to discard those messages that do not really represent potential threats. We have applied the Low Dimensionality Statistical Embedding (LDSE) method to several corpora for Arabic including the Arabic credibility corpus and two new corpora that we created: the Qatar Twitter corpus and the Qatar News corpus. We achieved a performance of 0.797 Macro F-measure on the Arabic Credibility corpus. The obtained results with two well-known distributed representations, namely Continuous Bag of Words and Skip Grams, showed the competitiveness of our approach. The LDSE approach gave similar results on the two corpora that we created. We evaluated our work in a cross-genre scenario, showing the robustness of LDSE when there are enough data about similar topics.

Original languageEnglish
Title of host publication2019 IEEE International Conference on Intelligence and Security Informatics, ISI 2019
EditorsXiaolong Zheng, Ahmed Abbasi, Michael Chau, Alan Wang, Lina Zhou
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages86-91
Number of pages6
ISBN (Electronic)9781728125046
DOIs
Publication statusPublished - Jul 2019
Event17th IEEE International Conference on Intelligence and Security Informatics, ISI 2019 - Shenzhen, China
Duration: 1 Jul 20193 Jul 2019

Publication series

Name2019 IEEE International Conference on Intelligence and Security Informatics, ISI 2019

Conference

Conference17th IEEE International Conference on Intelligence and Security Informatics, ISI 2019
Country/TerritoryChina
CityShenzhen
Period1/07/193/07/19

Keywords

  • Arabic
  • Cyber-security
  • Deception detection
  • Twitter

Fingerprint

Dive into the research topics of 'Detecting deceptive tweets in arabic for cyber-security'. Together they form a unique fingerprint.

Cite this