Ontology matching with Large Language Models and prioritized depth-first search

Taboada Iglesias, María Jesús; Martínez Hernández, Diego; Arideh, Mohammed; Mosquera Losada, María Rosa

doi:10.1016/j.inffus.2025.103254

Ontology matching with Large Language Models and prioritized depth-first search

Files

2025_inffus_taboada_ontology.pdf (2.8 MB)

Identifiers

URI: https://hdl.handle.net/10347/43542

ISSN: 1566-2535

E-ISSN: 1872-6305

DOI: 10.1016/j.inffus.2025.103254

Publication date

2025-05-07

Authors

Taboada Iglesias, María Jesús

Martínez Hernández, Diego

Arideh, Mohammed

Mosquera Losada, María Rosa

Publisher

Elsevier

Metrics

Export

Abstract

Ontology matching (OM) plays a key role in enabling data interoperability and knowledge sharing. Recently, methods based on Large Language Model (LLMs) have shown great promise in OM, particularly through the use of a retrieve-then-prompt pipeline. In this approach, relevant target entities are first retrieved and then used to prompt the LLM to predict the final matches. Despite their potential, these systems still present limited performance and high computational overhead. To address these issues, we introduce MILA, a novel approach that embeds a retrieve-identify-prompt pipeline within a prioritized depth-first search (PDFS) strategy. This approach efficiently identifies a large number of semantic correspondences with high accuracy, limiting LLM requests to only the most borderline cases. We evaluated MILA using three challenges from the 2024 edition of the Ontology Alignment Evaluation Initiative. Our method achieved the highest F-Measure in five of seven unsupervised tasks, outperforming state-of-the-art OM systems by up to 17%. It also performed better than or comparable to the leading supervised OM systems. MILA further exhibited task-agnostic performance, remaining stable across all tasks and settings, while significantly reducing runtime. These findings highlight that high-performance LLM-based OM can be achieved through a combination of programmed (PDFS), learned (embedding vectors), and prompting-based heuristics, without the need of domain-specific heuristics or fine-tuning.

Keywords

Ontology matching| Retrieval augmented generation| Greedy search| Large Language Models| Zero-shot setting

Bibliographic citation

Taboada, M., Martinez, D., Arideh, M., & Mosquera, R. (2025). Ontology matching with Large Language Models and prioritized depth-first search. Information Fusion, 123, 103254. 10.1016/j.inffus.2025.103254

Publisher version

https://doi.org/10.1016/j.inffus.2025.103254

Rights

Collections

Electrónica e Computación
Bioloxía Funcional
Física Aplicada

Full item page

Ontology matching with Large Language Models and prioritized depth-first search

Files

Identifiers

Publication date

Authors

Advisors

Tutors

Editors

Journal Title

Journal ISSN

Volume Title

Publisher

Metrics

Export

Research Projects

Organizational Units

Journal Issue

Abstract

Description

Keywords

Bibliographic citation

Relation

Has part

Has version

Is based on

Is part of

Is referenced by

Is version of

Requires

Publisher version

Sponsors

Rights

Collections