Generating Topic Pages for Scientific Concepts Using Scientific Publications

Hosein Azarbonyad, Zubair Afzal, George Tsatsaronis

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Scopus citations

Abstract

In this paper, we describe Topic Pages, an inventory of scientific concepts and information around them extracted from a large collection of scientific books and journals. The main aim of Topic Pages is to provide all the necessary information to the readers to understand scientific concepts they come across while reading scholarly content in any scientific domain. Topic Pages are a collection of automatically generated information pages using NLP and ML, each corresponding to a scientific concept. Each page contains three pieces of information: a definition, related concepts, and the most relevant snippets, all extracted from scientific peer-reviewed publications. In this paper, we discuss the details of different components to extract each of these elements. The collection of pages in production contains over 360, 000 Topic Pages across 20 different scientific domains with an average of 23 million unique visits per month, constituting it a popular source for scientific information.

Original languageEnglish
Title of host publicationAdvances in Information Retrieval - 45th European Conference on Information Retrieval, ECIR 2023, Proceedings
EditorsJaap Kamps, Lorraine Goeuriot, Fabio Crestani, Maria Maistro, Hideo Joho, Brian Davis, Cathal Gurrin, Annalina Caputo, Udo Kruschwitz
PublisherSpringer Science and Business Media Deutschland GmbH
Pages341-349
Number of pages9
ISBN (Print)9783031282379
DOIs
StatePublished - 2023
Externally publishedYes
Event45th European Conference on Information Retrieval, ECIR 2023 - Dublin, Ireland
Duration: Apr 2 2023Apr 6 2023

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume13981 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference45th European Conference on Information Retrieval, ECIR 2023
Country/TerritoryIreland
CityDublin
Period04/2/2304/6/23

Keywords

  • Definition extraction
  • Multi-document summarization
  • Scientific document processing

Fingerprint

Dive into the research topics of 'Generating Topic Pages for Scientific Concepts Using Scientific Publications'. Together they form a unique fingerprint.

Cite this