Overview of the DagPap24 Shared Task on Detecting Automatically Generated Scientific Papers

Savvas Chamezopoulos, Drahomira Herrmannova, Anita de Waard, Domenic Rosati, Yury Kashnitsky

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

This paper provides an overview of the 2024 ACL Scholarly Document Processing workshop shared task on the detection of automatically generated scientific papers. Unlike our previous task, which focused on the binary classification of whether scientific passages were machine-generated or not, one likely use case for text generation technology in scientific writing is to intersperse human-written text with passages of machine-generated text. We frame the detection problem as a multi-class span classification task: given an expert of text, label token spans in the text as human-written or machine-generated We shared a dataset containing excerpts from human-written papers as well as artificially generated content collected by Elsevier publishing and editorial teams. As a test set, the participants were provided with a corpus of openly accessible human-written as well as generated papers from the same scientific domains of documents. The shared task saw 457 submissions across 28 participating teams and resulted in three published technical reports. We discuss our findings from the shared task in this overview paper.

Original languageEnglish
Title of host publicationSDP 2024 - 4th Workshop on Scholarly Document Processing, Proceedings of the Workshop
EditorsTirthankar Ghosal, Amanpreet Singh, Anita de Waard, Philipp Mayr, Aakanksha Naik, Orion Weller, Yoonjoo Lee, Shannon Shen, Yanxia Qin
PublisherAssociation for Computational Linguistics (ACL)
Pages7-11
Number of pages5
ISBN (Electronic)9798891761513
StatePublished - 2024
Externally publishedYes
Event4th Workshop on Scholarly Document Processing, SDP 2024 at ACL 2024 - Bangkok, Thailand
Duration: Aug 16 2024 → …

Publication series

NameSDP 2024 - 4th Workshop on Scholarly Document Processing, Proceedings of the Workshop

Conference

Conference4th Workshop on Scholarly Document Processing, SDP 2024 at ACL 2024
Country/TerritoryThailand
CityBangkok
Period08/16/24 → …

Fingerprint

Dive into the research topics of 'Overview of the DagPap24 Shared Task on Detecting Automatically Generated Scientific Papers'. Together they form a unique fingerprint.

Cite this