Disease Normalization with Graph Embeddings

D. Pujary, C. Thorne, W. Aziz

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

6 Scopus citations

Abstract

The detection and normalization of diseases in biomedical texts are key biomedical natural language processing tasks. Disease names need not only be identified, but also normalized or linked to clinical taxonomies describing diseases such as MeSH®. In this paper we describe deep learning methods that tackle both tasks. We train and test our methods on the known NCBI disease benchmark corpus. We propose to represent disease names by leveraging MeSH® ’s graphical structure together with the lexical information available in the taxonomy using graph embeddings. We also show that combining neural named entity recognition models with our graph-based entity linking methods via multitask learning leads to improved disease recognition in the NCBI corpus.

Original languageEnglish
Title of host publicationIntelligent Systems and Applications - Proceedings of the 2020 Intelligent Systems Conference IntelliSys Volume 2
EditorsKohei Arai, Supriya Kapoor, Rahul Bhatia
PublisherSpringer
Pages209-217
Number of pages9
ISBN (Print)9783030551865
DOIs
StatePublished - 2021
Externally publishedYes
EventIntelligent Systems Conference, IntelliSys 2020 - London, United Kingdom
Duration: Sep 3 2020Sep 4 2020

Publication series

NameAdvances in Intelligent Systems and Computing
Volume1251 AISC
ISSN (Print)2194-5357
ISSN (Electronic)2194-5365

Conference

ConferenceIntelligent Systems Conference, IntelliSys 2020
Country/TerritoryUnited Kingdom
CityLondon
Period09/3/2009/4/20

Keywords

  • biLSTM-CRF Models
  • Disease named entity normalization
  • Graph embeddings
  • Multi-task learning

Fingerprint

Dive into the research topics of 'Disease Normalization with Graph Embeddings'. Together they form a unique fingerprint.

Cite this