GenAI Content Detection Task 2: AI vs. Human - Academic Essay Authenticity Challenge

Shammur Absar Chowdhury, Hind Almerekhi, Mucahid Kutlu, Kaan Efe Keleş, Fatema Ahmad, Tasnim Mohiuddin, George Mikros, Firoj Alam

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Citation (Scopus)

Abstract

This paper presents a comprehensive overview of the first edition of the Academic Essay Authenticity Challenge, organized as part of the GenAI Content Detection shared tasks collocated with COLING 2025. This challenge focuses on detecting machine-generated vs human-authored essays for academic purposes. The task is defined as follows: “Given an essay, identify whether it is generated by a machine or authored by a human.” The challenge involves two languages: English and Arabic. During the evaluation phase, 25 teams submitted systems for English and 21 teams for Arabic, reflecting substantial interest in the task. Finally, five teams submitted system description papers. The majority of submissions utilized fine-tuned transformer-based models, with one team employing Large Language Models (LLMs) such as Llama 2 and Llama 3. This paper outlines the task formulation, details the dataset construction process, and explains the evaluation framework. Additionally, we present a summary of the approaches adopted by participating teams. Nearly all submitted systems outperformed the n-gram-based baseline, with the top-performing systems achieving F1 scores exceeding 0.98 for both languages, indicating significant progress in the detection of machine-generated text.

Original languageEnglish
Title of host publicationGenAIDetect 2025 - Proceedings of the 1st Workshop on GenAI Content Detection, Proceedings of the Workshop - 31st International Conference on Computational Linguistics, COLING 2025
EditorsFiroj Alam, Preslav Nakov, Nizar Habash, Iryna Gurevych, Iryna Gurevych, Shammur Chowdhury, Artem Shelmanov, Yuxia Wang, Ekaterina Artemova, Mucahid Kutlu, George Mikros
PublisherAssociation for Computational Linguistics (ACL)
Pages323-333
Number of pages11
ISBN (Electronic)9798891762053
Publication statusPublished - 19 Jan 2025
Event1st Workshop on GenAI Content Detection, GenAIDetect 2025 - Abu Dhabi, United Arab Emirates
Duration: 19 Jan 2025 → …

Publication series

NameProceedings - International Conference on Computational Linguistics, COLING
ISSN (Print)2951-2093

Conference

Conference1st Workshop on GenAI Content Detection, GenAIDetect 2025
Country/TerritoryUnited Arab Emirates
CityAbu Dhabi
Period19/01/25 → …

Fingerprint

Dive into the research topics of 'GenAI Content Detection Task 2: AI vs. Human - Academic Essay Authenticity Challenge'. Together they form a unique fingerprint.

Cite this