The ChEMU 2022 Evaluation Campaign: Information Extraction in Chemical Patents

Yuan Li, Biaoyan Fang, Jiayuan He, Hiyori Yoshikawa, Saber A A. Akhondi, Christian Druckenbrodt, Camilo Thorne, Zenan Zhai, Zubair Afzal, Trevor Cohn, Timothy Baldwin, Karin Verspoor

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

The discovery of new chemical compounds is a key driver of the chemistry and pharmaceutical industries, and many other industrial sectors. Patents serve as a critical source of information about new chemical compounds. The ChEMU (Cheminformatics Elsevier Melbourne Universities) lab addresses information extraction over chemical patents and aims to advance the state of the art on this topic. ChEMU lab 2022, as part of the 13th Conference and Labs of the Evaluation Forum (CLEF-2022), will be the third ChEMU lab. The ChEMU 2020 lab provided two information extraction tasks, named entity recognition and event extraction. The ChEMU 2021 lab introduced two more tasks, chemical reaction reference resolution and anaphora resolution. For ChEMU 2022, we plan to re-run all the four tasks with a new task on semantic classification for tables as the fifth one. In this paper, we introduce ChEMU 2022, including its motivation, goals, tasks, resources, and evaluation framework.

Original languageEnglish
Title of host publicationAdvances in Information Retrieval - 44th European Conference on IR Research, ECIR 2022, Proceedings
EditorsMatthias Hagen, Suzan Verberne, Craig Macdonald, Christin Seifert, Krisztian Balog, Kjetil Nørvåg, Vinay Setty
PublisherSpringer Science and Business Media Deutschland GmbH
Pages400-407
Number of pages8
ISBN (Print)9783030997380
DOIs
StatePublished - 2022
Externally publishedYes
Event44th European Conference on Information Retrieval, ECIR 2022 - Stavanger, Norway
Duration: Apr 10 2022Apr 14 2022

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume13186 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference44th European Conference on Information Retrieval, ECIR 2022
Country/TerritoryNorway
CityStavanger
Period04/10/2204/14/22

Keywords

  • Anaphora resolution
  • Chemical patents
  • Event extraction
  • Named entity recognition
  • Reaction reference resolution
  • Table classification
  • Text mining

Fingerprint

Dive into the research topics of 'The ChEMU 2022 Evaluation Campaign: Information Extraction in Chemical Patents'. Together they form a unique fingerprint.

Cite this