Understanding the Impact of Entity Linking on the Topology of Entity Co-occurrence Networks for Social Media Analysis

James Nevin, Pengyu Zhang, Dimitar Dimitrov, Michael Lees, Paul Groth, Stefan Dietze

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

A common form of analysis of textual data is entity co-occurrence, where networks of entities and their connections within the text are constructed and their topology analysed. As the analysis is focused on the entities and their relations, the tools used to extract them can have a potentially large effect on the results. A frequently used method as part of these analyses is entity linking, where extracted entities are mapped to a knowledge graph. Many established entity linking tools have been created for long text following standard spelling and grammar rules. As a result, the tools struggle on short, unstructured text such as tweets. On such text, it can be difficult to choose between tools and parameter settings, especially since ground truth is often unavailable. Given these challenges in entity linking on text and the direct influence of extracted entities on subsequent network analysis, we propose the need to apply multiple tools to create a more holistic set of results. We verify this assertion through a set of experiments. Using a dataset of approximately 21 million English-language tweets, we construct multiple entity co-occurrence networks using two tools (Fast Entity Linker and DBpedia Spotlight) and numerous confidence thresholds for each. We find that standard network analysis metrics, such as size, connectivity, and centrality are all heavily influenced by the choice of entity linking tool.

Original languageEnglish
Title of host publicationKnowledge Engineering and Knowledge Management - 24th International Conference, EKAW 2024, Proceedings
EditorsMehwish Alam, Marco Rospocher, Marieke van Erp, Laura Hollink, Genet Asefa Gesese
PublisherSpringer Science and Business Media Deutschland GmbH
Pages69-85
Number of pages17
ISBN (Print)9783031777912
DOIs
StatePublished - 2025
Externally publishedYes
Event24th International Conference on Knowledge Engineering and Knowledge Management, EKAW 2024 - Amsterdam, Netherlands
Duration: Nov 26 2024Nov 28 2024

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume15370 LNAI
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

Conference24th International Conference on Knowledge Engineering and Knowledge Management, EKAW 2024
Country/TerritoryNetherlands
CityAmsterdam
Period11/26/2411/28/24

Keywords

  • Co-occurrence networks
  • Entity linking
  • Network analysis
  • Social media

Fingerprint

Dive into the research topics of 'Understanding the Impact of Entity Linking on the Topology of Entity Co-occurrence Networks for Social Media Analysis'. Together they form a unique fingerprint.

Cite this