Nullpointer at CheckThat! 2024: Identifying Subjectivity from Multilingual Text Sequence

Md Rafiul Biswas, Abrar Tasneem Abir, Wajdi Zaghouani*

*Corresponding author for this work

Research output: Contribution to journalConference articlepeer-review

Abstract

This study addresses a binary classification task to determine whether a text sequence, either a sentence or paragraph, is subjective or objective. The task spans five languages-Arabic, Bulgarian, English, German, and Italian-along with a multilingual category. Our approach involved several key techniques. Initially, we preprocessed the data through parts of speech (POS) tagging, identification of question marks, and application of attention masks. We fine-tuned the sentiment-based Transformer model 'MarieAngeA13/Sentiment-AnalysisBERT' on our dataset. Given the imbalance with more objective data, we implemented a custom classifier that assigned greater weight to objective data. Additionally, we translated non-English data into English to maintain consistency across the dataset. Our model achieved notable results, scoring top marks for the multilingual dataset (Macro F1-0.7121) and German (Macro F1-0.7908). It ranked second for Arabic (Macro F1-0.4908) and Bulgarian (Macro F1-0.7169), third for Italian (Macro F1-0.7430), and ninth for English (Macro F1-0.6893).

Original languageEnglish
Pages (from-to)361-368
Number of pages8
JournalCEUR Workshop Proceedings
Volume3740
Publication statusPublished - 2024
Externally publishedYes
Event25th Working Notes of the Conference and Labs of the Evaluation Forum, CLEF 2024 - Grenoble, France
Duration: 9 Sept 202412 Sept 2024

Keywords

  • fact checking
  • natural language processing
  • news articles
  • sentiment
  • subjectivity
  • text sequence

Fingerprint

Dive into the research topics of 'Nullpointer at CheckThat! 2024: Identifying Subjectivity from Multilingual Text Sequence'. Together they form a unique fingerprint.

Cite this