SHROOM-INDElab at SemEval-2024 Task 6: Zero- and Few-Shot LLM-Based Classification for Hallucination Detection

B.P. Allen; F. Polat; P. Groth

doi:https://doi.org/10.18653/v1/2024.semeval-1.120

SHROOM-INDElab at SemEval-2024 Task 6: Zero- and Few-Shot LLM-Based Classification for Hallucination Detection

Authors	B.P. Allen F. Polat P. Groth
Publication date	2024
Host editors	A.K. Ojha A.S. Doğruöz H.T. Madabushi G. Da San Martino S. Rosenthal A. Rosá
Book title	The 18th International Workshop on Semantic Evaluation (SemEval-2024) : proceedings of the workshop
Book subtitle	SemEval 2024 : June 20-21, 2024
ISBN (electronic)	9798891761070
Event	18th International Workshop on Semantic Evaluation
Pages (from-to)	839-844
Publisher	Stroudsburg, PA: Association for Computational Linguistics
Organisations	Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract	We describe the University of Amsterdam Intelligent Data Engineering Lab team’s entry for the SemEval-2024 Task 6 competition. The SHROOM-INDElab system builds on previous work on using prompt programming and in-context learning with large language models (LLMs) to build classifiers for hallucination detection, and extends that work through the incorporation of context-specific definition of task, role, and target concept, and automated generation of examples for use in a few-shot prompting approach. The resulting system achieved fourth-best and sixth-best performance in the model-agnostic track and model-aware tracks for Task 6, respectively, and evaluation using the validation sets showed that the system’s classification decisions were consistent with those of the crowdsourced human labelers. We further found that a zero-shot approach provided better accuracy than a few-shot approach using automatically generated examples. Code for the system described in this paper is available on Github.
Document type	Conference contribution
Note	With supplementary materials
Language	English
Published at	https://doi.org/10.18653/v1/2024.semeval-1.120 (Final published version)
Other links	https://www.github.com/bradleypallen/shroom/
Downloads	2024.semeval-1.120 (Final published version)
Supplementary materials	2024.semeval-1.120.SupplementaryMaterial 2024.semeval-1.120.SupplementaryMaterial
Permalink to this page

Back

UvA-DARE

Digital Academic Repository

SHROOM-INDElab at SemEval-2024 Task 6: Zero- and Few-Shot LLM-Based Classification for Hallucination Detection