Bicocca Open Archive

This paper introduces a system which integrates large language models (LLMs) into clinical trials retrieval, improving patient-trial matching while preserving data privacy and expert oversight. We evaluate six LLMs for query generation, focusing on open-source and small models requiring minimal computational resources. Our findings show that these models achieve retrieval effectiveness comparable to or exceeding expert-created queries and consistently outperform standard baselines and literature approaches. The best-performing LLMs exhibit fast response times (1.7-8 seconds) and generate a manageable number of query terms (15-63). Our results suggest that small, open-source LLMs can effectively balance performance, computational efficiency, and real-world applicability in clinical trial retrieval.

Peikos, G., Kasela, P., Pasi, G. (2024). Leveraging Large Language Models for Medical Information Extraction and Query Generation. In Proceedings - 2024 IEEE/WIC International Conference on Web Intelligence and Intelligent Agent Technology, WI-IAT 2024 (pp.367-372). Institute of Electrical and Electronics Engineers Inc. [10.1109/WI-IAT62293.2024.00058].

Leveraging Large Language Models for Medical Information Extraction and Query Generation

Georgios Peikos;Pranav Kasela;Gabriella Pasi

2024

Abstract

This paper introduces a system which integrates large language models (LLMs) into clinical trials retrieval, improving patient-trial matching while preserving data privacy and expert oversight. We evaluate six LLMs for query generation, focusing on open-source and small models requiring minimal computational resources. Our findings show that these models achieve retrieval effectiveness comparable to or exceeding expert-created queries and consistently outperform standard baselines and literature approaches. The best-performing LLMs exhibit fast response times (1.7-8 seconds) and generate a manageable number of query terms (15-63). Our results suggest that small, open-source LLMs can effectively balance performance, computational efficiency, and real-world applicability in clinical trial retrieval.

Scheda breve

Scheda completa

Scheda completa (DC)

	Tipo di intervento
	
				paper
			
	Parole chiave
	
				clinical trial retrieval; information retrieval; large language models; natural language processing; text generation;
			
	Lingua del contenuto
	
				English
			
	Nome del convegno
	
				The 23rd IEEE/WIC International Conference on Web Intelligence and Intelligent Agent Technology - December 9-12, 2024
			
	Anno del convegno
	
				2024
			
	Titolo degli atti
	
				Proceedings - 2024 IEEE/WIC International Conference on Web Intelligence and Intelligent Agent Technology, WI-IAT 2024
			
	ISBN del volume degli atti
	
				9798331504946
			
	Data di pubblicazione
	
				2024
			
	Pagina iniziale
	
				367
			
	Pagina finale
	
				372
			
	DOI dell'intervento
	
				https://dx.doi.org/10.1109/WI-IAT62293.2024.00058
			
	Fulltext
	
				open
			
	Citazione
	
				Peikos, G., Kasela, P., Pasi, G. (2024). Leveraging Large Language Models for Medical Information Extraction and Query Generation. In Proceedings - 2024 IEEE/WIC International Conference on Web Intelligence and Intelligent Agent Technology, WI-IAT 2024 (pp.367-372). Institute of Electrical and Electronics Engineers Inc. [10.1109/WI-IAT62293.2024.00058].
			
	Appare nelle tipologie:
	
				02 - Intervento a convegno

File in questo prodotto:

File	Dimensione	Formato
Peikos-2024-23 IEEE/WIC Int Conf-AAM.pdf accesso aperto Tipologia di allegato: Author’s Accepted Manuscript, AAM (Post-print) Licenza: Creative Commons Dimensione 384.39 kB Formato Adobe PDF Visualizza/Apri	384.39 kB	Adobe PDF	Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/548721

Citazioni

2

1

Social impact