Bicocca Open Archive

Ensuring the quality of knowledge graphs (KGs) is crucial for the success of the intelligent applications they support. Recent advances in large language models (LLMs) have demonstrated human-level performance across various tasks, raising the question of their potential for KG validation. In this work, we explore the role of LLMs in human-centric KG validation workflows, examining different collaboration strategies between LLMs and domain experts. We propose and evaluate nine distinct approaches, ranging from fully automated validation to hybrid methods that combine expert oversight with AI assistance. These workflows are tested within a real-world KG construction pipeline used to generate the Computer Science Knowledge Graph (CS-KG), a large-scale resource designed to support scientometric tasks such as trend forecasting and hypothesis generation. CS-KG comprises 41 million statements represented as 350 million triples within the Computer Science domain. Our findings show that integrating LLMs into the CS-KG verification process enhances precision by 12%, improving alignment with expert-level validation. However, this comes at the cost of recall, resulting in a 5% decrease in the overall F1 score. In contrast, a hybrid approach which involves both human-in-the-loop and LLM modules, yields the best overall results, improving F1 score by 5% with minimal human involvement.

Tsaneva, S., Dessì, D., Osborne, F., Sabou, M. (2025). Knowledge graph validation by integrating LLMs and human-in-the-loop. INFORMATION PROCESSING & MANAGEMENT, 62(5) [10.1016/j.ipm.2025.104145].

Knowledge graph validation by integrating LLMs and human-in-the-loop

Tsaneva S.;Dessì D.;Osborne F.;Sabou M.

2025

Abstract

Ensuring the quality of knowledge graphs (KGs) is crucial for the success of the intelligent applications they support. Recent advances in large language models (LLMs) have demonstrated human-level performance across various tasks, raising the question of their potential for KG validation. In this work, we explore the role of LLMs in human-centric KG validation workflows, examining different collaboration strategies between LLMs and domain experts. We propose and evaluate nine distinct approaches, ranging from fully automated validation to hybrid methods that combine expert oversight with AI assistance. These workflows are tested within a real-world KG construction pipeline used to generate the Computer Science Knowledge Graph (CS-KG), a large-scale resource designed to support scientometric tasks such as trend forecasting and hypothesis generation. CS-KG comprises 41 million statements represented as 350 million triples within the Computer Science domain. Our findings show that integrating LLMs into the CS-KG verification process enhances precision by 12%, improving alignment with expert-level validation. However, this comes at the cost of recall, resulting in a 5% decrease in the overall F1 score. In contrast, a hybrid approach which involves both human-in-the-loop and LLM modules, yields the best overall results, improving F1 score by 5% with minimal human involvement.

Scheda breve

Scheda completa

Scheda completa (DC)

	Sottotipologia
	
				Articolo in rivista - Articolo scientifico
			
	Parole chiave
	
				Hybrid human-AI workflows; Knowledge graph validation; Large language models;
			
	Lingua del contenuto
	
				English
			
	Data ahead of print o Data prima pubblicazione Online
	
				9-apr-2025
			
	Data di pubblicazione
	
				2025
			
	Rivista
	
				INFORMATION PROCESSING & MANAGEMENT
			
	Numero del volume
	
				62
			
	Fascicolo
	
				5
			
	Article number
	
				104145
			
	DOI dell'articolo
	
				https://dx.doi.org/10.1016/j.ipm.2025.104145
			
	Fulltext
	
				open
			
	Citazione
	
				Tsaneva, S., Dessì, D., Osborne, F., Sabou, M. (2025). Knowledge graph validation by integrating LLMs and human-in-the-loop. INFORMATION PROCESSING & MANAGEMENT, 62(5) [10.1016/j.ipm.2025.104145].
			
	Appare nelle tipologie:
	
				01 - Articolo su rivista

File in questo prodotto:

File	Dimensione	Formato
Tsaneva-2025-Information Processing and Management-VoR.pdf accesso aperto Descrizione: Knowledge graph validation by integrating LLMs and human-in-the-loop Tipologia di allegato: Publisher’s Version (Version of Record, VoR) Licenza: Creative Commons Dimensione 1.87 MB Formato Adobe PDF Visualizza/Apri	1.87 MB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/553720

Citazioni

15

6

Social impact