TY - GEN
T1 - Taxonomy Generation for Scientific Concepts Using Large Language Models
AU - Zhang, Yue
AU - Zhu, Zi Long
AU - Capari, Artemis
AU - Azarbonyad, Hosein
AU - Afzal, Zubair
AU - Tsatsaronis, George
N1 - Publisher Copyright:
© The Author(s), under exclusive license to Springer Nature Switzerland AG 2026.
PY - 2026
Y1 - 2026
N2 - Traditional data-driven automatic taxonomy generation methods struggle with complex, large, and domain-specific datasets. To address these issues, this study leverages Large Language Models (LLMs) to automate key stages of taxonomy generation, focusing on scientific concepts. Our approach employs LLMs at several stages of the taxonomy generation process, including extracting candidate concepts and organizing keywords into taxonomies centered around chosen scientific concepts. By incorporating LLMs, we aim to enhance depth, accuracy, and coherence of generated taxonomies. Comparative analyses show that the proposed LLM-based taxonomy generation method outperforms state-of-the-art taxonomy generation methods on several metrics, such as concept coherence and coverage. Using a hybrid evaluation framework that combines automatic and human assessments, we demonstrate that our LLM-based solution is scalable, adaptable, and capable of generating high-quality taxonomies tailored to specific scientific concepts.
AB - Traditional data-driven automatic taxonomy generation methods struggle with complex, large, and domain-specific datasets. To address these issues, this study leverages Large Language Models (LLMs) to automate key stages of taxonomy generation, focusing on scientific concepts. Our approach employs LLMs at several stages of the taxonomy generation process, including extracting candidate concepts and organizing keywords into taxonomies centered around chosen scientific concepts. By incorporating LLMs, we aim to enhance depth, accuracy, and coherence of generated taxonomies. Comparative analyses show that the proposed LLM-based taxonomy generation method outperforms state-of-the-art taxonomy generation methods on several metrics, such as concept coherence and coverage. Using a hybrid evaluation framework that combines automatic and human assessments, we demonstrate that our LLM-based solution is scalable, adaptable, and capable of generating high-quality taxonomies tailored to specific scientific concepts.
KW - Automatic Taxonomy Construction
KW - LLMs for Taxonomy Construction
KW - Scientific Document Processing
UR - https://www.scopus.com/pages/publications/105023464214
U2 - 10.1007/978-3-032-04354-2_6
DO - 10.1007/978-3-032-04354-2_6
M3 - Contribución a la conferencia
AN - SCOPUS:105023464214
SN - 9783032043535
T3 - Lecture Notes in Computer Science
SP - 74
EP - 86
BT - Experimental IR Meets Multilinguality, Multimodality, and Interaction - 16th International Conference of the CLEF Association, CLEF 2025, Proceedings
A2 - Carrillo-de-Albornoz, Jorge
A2 - García Seco de Herrera, Alba
A2 - Gonzalo, Julio
A2 - Plaza, Laura
A2 - Mothe, Josiane
A2 - Piroi, Florina
A2 - Rosso, Paolo
A2 - Spina, Damiano
A2 - Faggioli, Guglielmo
A2 - Ferro, Nicola
PB - Springer Science and Business Media Deutschland GmbH
T2 - 16th International Conference of the Cross-Language Evaluation Forum for European Languages, CLEF 2025
Y2 - 9 September 2025 through 12 September 2025
ER -