Annotating Research Infrastructure in Scientifc Papers: An NLP-driven Approach

Seyed Amin Tabatabaei, Georgios Cheirmpos, Marius Doornenbal, Alberto Zigoni, Veronique Moore, Georgios Tsatsaronis

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

In this work, we present a natural language processing (NLP) pipeline for the identifcation, extraction and linking of Research Infrastructure (RI) used in scientifc publications. Links between scientifc equipment and publications where the equipment was used can support multiple use cases, such as evaluating the impact of RI investment, and supporting Open Science and research reproducibility. These links can also be used to establish a profle of the RI portfolio of each institution and associate each equipment with scientifc output. The system we are describing here is already in production, and has been used to address real business use cases, some of which we discuss in this paper. The computational pipeline at the heart of the system comprises both supervised and unsuper-vised modules to detect the usage of research equipment by processing the full text of the articles. Additionally, we have created a knowledge graph of RI, which is utilized to annotate the articles with metadata. Finally, examples of the business value of the insights made possible by this NLP pipeline are illustrated.

Original languageEnglish
Title of host publicationIndustry Track
PublisherAssociation for Computational Linguistics (ACL)
Pages457-463
Number of pages7
ISBN (Electronic)9781959429685
DOIs
StatePublished - 2023
Externally publishedYes
Event61st Annual Meeting of the Association for Computational Linguistics, ACL 2023 - Toronto, Canada
Duration: Jul 9 2023Jul 14 2023

Publication series

NameProceedings of the Annual Meeting of the Association for Computational Linguistics
Volume5
ISSN (Print)0736-587X

Conference

Conference61st Annual Meeting of the Association for Computational Linguistics, ACL 2023
Country/TerritoryCanada
CityToronto
Period07/9/2307/14/23

Fingerprint

Dive into the research topics of 'Annotating Research Infrastructure in Scientifc Papers: An NLP-driven Approach'. Together they form a unique fingerprint.

Cite this