Overview of ChEMU 2022 Evaluation Campaign: Information Extraction in Chemical Patents

Yuan Li, Biaoyan Fang, Jiayuan He, Hiyori Yoshikawa, Saber A. Akhondi, Christian Druckenbrodt, Camilo Thorne, Zubair Afzal, Zenan Zhai, Timothy Baldwin, Karin Verspoor

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

4 Scopus citations

Abstract

In this paper, we provide an overview of the Cheminformatics Elsevier Melbourne University (ChEMU) evaluation lab 2022, part of the Conference and Labs of the Evaluation Forum 2022 (CLEF 2022). The ChEMU campaign focuses on information extraction tasks over chemical reactions in patents. The ChEMU 2020 lab provided two information extraction tasks, named entity recognition and event extraction. The ChEMU 2021 lab introduced one more task, anaphora resolution. This year, we re-run all the three tasks with new test data. Together, the tasks support comprehensive automatic chemical patent analysis. Herein, we describe the resources created for these tasks and the evaluation methodology adopted. We also provide a brief summary of the methods employed by participants of this lab and the results obtained across 22 runs from 3 teams, finding that several submissions achieve better results than the baseline methods prepared by the organizers.

Original languageEnglish
Title of host publicationExperimental IR Meets Multilinguality, Multimodality, and Interaction - 13th International Conference of the CLEF Association, CLEF 2022, Proceedings
EditorsAlberto Barrón-Cedeño, Giovanni Da San Martino, Guglielmo Faggioli, Nicola Ferro, Mirko Degli Esposti, Fabrizio Sebastiani, Craig Macdonald, Gabriella Pasi, Allan Hanbury, Martin Potthast
PublisherSpringer Science and Business Media Deutschland GmbH
Pages521-540
Number of pages20
ISBN (Print)9783031136429
DOIs
StatePublished - 2022
Externally publishedYes
Event13th International Conference of the Cross-Language Evaluation Forum for European Languages, CLEF 2022 - Bologna, Italy
Duration: Sep 5 2022Sep 8 2022

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume13390 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference13th International Conference of the Cross-Language Evaluation Forum for European Languages, CLEF 2022
Country/TerritoryItaly
CityBologna
Period09/5/2209/8/22

Keywords

  • Chemical patents
  • Information Extraction
  • Text mining

Fingerprint

Dive into the research topics of 'Overview of ChEMU 2022 Evaluation Campaign: Information Extraction in Chemical Patents'. Together they form a unique fingerprint.

Cite this