SHROOM-INDElab at SemEval-2024 Task 6: Zero- and Few-Shot LLM-Based Classification for Hallucination Detection

Bradley P. Allen, Fina Polat, Paul Groth

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

2 Scopus citations

Abstract

We describe the University of Amsterdam Intelligent Data Engineering Lab team's entry for the SemEval-2024 Task 6 competition. The SHROOM-INDElab system builds on previous work on using prompt programming and in-context learning with large language models (LLMs) to build classifiers for hallucination detection, and extends that work through the incorporation of context-specific definition of task, role, and target concept, and automated generation of examples for use in a few-shot prompting approach. The resulting system achieved fourth-best and sixth-best performance in the model-agnostic track and model-aware tracks for Task 6, respectively, and evaluation using the validation sets showed that the system's classification decisions were consistent with those of the crowd-sourced human labellers. We further found that a zero-shot approach provided better accuracy than a few-shot approach using automatically generated examples. Code for the system described in this paper is available on Github.

Original languageEnglish
Title of host publicationSemEval 2024 - 18th International Workshop on Semantic Evaluation, Proceedings of the Workshop
EditorsAtul Kr. Ojha, A. Seza Dohruoz, Harish Tayyar Madabushi, Giovanni Da San Martino, Sara Rosenthal, Aiala Rosa
PublisherAssociation for Computational Linguistics (ACL)
Pages839-844
Number of pages6
ISBN (Electronic)9798891761070
StatePublished - 2024
Externally publishedYes
Event18th International Workshop on Semantic Evaluation, SemEval 2024, co-located with the 2024 Annual Conference of the North American Chapter of the Association for Computational Linguistics, NAACL 2024 - Hybrid, Mexico City, Mexico
Duration: Jun 20 2024Jun 21 2024

Publication series

NameSemEval 2024 - 18th International Workshop on Semantic Evaluation, Proceedings of the Workshop

Conference

Conference18th International Workshop on Semantic Evaluation, SemEval 2024, co-located with the 2024 Annual Conference of the North American Chapter of the Association for Computational Linguistics, NAACL 2024
Country/TerritoryMexico
CityHybrid, Mexico City
Period06/20/2406/21/24

Fingerprint

Dive into the research topics of 'SHROOM-INDElab at SemEval-2024 Task 6: Zero- and Few-Shot LLM-Based Classification for Hallucination Detection'. Together they form a unique fingerprint.

Cite this