Evaluating Class Membership Relations in Knowledge Graphs Using Large Language Models

Bradley P. Allen, Paul T. Groth

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

A backbone of knowledge graphs are their class membership relations, which assign entities to a given class. As part of the knowledge engineering process, we propose a new method for evaluating the quality of these relations by processing descriptions of a given entity and class using a zero-shot chain-of-thought classifier that uses a natural language intensional definition of a class. We evaluate the method using two publicly available knowledge graphs, Wikidata and CaLiGraph, and 7 large language models. Using the gpt-4-0125-preview large language model, the method’s classification performance achieves a macro-averaged F1-score of 0.830 on data from Wikidata and 0.893 on data from CaLiGraph. Moreover, a manual analysis of the classification errors shows that 40.9% of errors were due to the knowledge graphs, with 16.0% due to missing relations and 24.9% due to incorrectly asserted relations. These results show how large language models can assist knowledge engineers in the process of knowledge graph refinement. The code and data are available on Github (https://github.com/bradleypallen/evaluating-kg-class-memberships-using-llms).

Original languageEnglish
Title of host publicationThe Semantic Web
Subtitle of host publicationESWC 2024 Satellite Events, Proceedings
EditorsAlbert Meroño Peñuela, Oscar Corcho, Paul Groth, Elena Simperl, Valentina Tamma, Andrea Giovanni Nuzzolese, Maria Poveda-Villalón, Marta Sabou, Valentina Presutti, Irene Celino, Artem Revenko, Joe Raad, Bruno Sartini, Pasquale Lisena
PublisherSpringer Science and Business Media Deutschland GmbH
Pages14-24
Number of pages11
ISBN (Print)9783031789519
DOIs
StatePublished - 2025
Externally publishedYes
EventEuropean Semantic Web Conference, ESWC 2024 - Hersonissos, Greece
Duration: May 26 2024May 30 2024

Publication series

NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume15344 LNCS
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Conference

ConferenceEuropean Semantic Web Conference, ESWC 2024
Country/TerritoryGreece
CityHersonissos
Period05/26/2405/30/24

Keywords

  • Knowledge engineering
  • knowledge graph refinement
  • large language models
  • natural language generation

Fingerprint

Dive into the research topics of 'Evaluating Class Membership Relations in Knowledge Graphs Using Large Language Models'. Together they form a unique fingerprint.

Cite this