This paper introduces a system which integrates large language models (LLMs) into clinical trials retrieval, improving patient-trial matching while preserving data privacy and expert oversight. We evaluate six LLMs for query generation, focusing on open-source and small models requiring minimal computational resources. Our findings show that these models achieve retrieval effectiveness comparable to or exceeding expert-created queries and consistently outperform standard baselines and literature approaches. The best-performing LLMs exhibit fast response times (1.7-8 seconds) and generate a manageable number of query terms (15-63). Our results suggest that small, open-source LLMs can effectively balance performance, computational efficiency, and real-world applicability in clinical trial retrieval.

Peikos, G., Kasela, P., Pasi, G. (2024). Leveraging Large Language Models for Medical Information Extraction and Query Generation. In Proceedings - 2024 IEEE/WIC International Conference on Web Intelligence and Intelligent Agent Technology, WI-IAT 2024 (pp.367-372). Institute of Electrical and Electronics Engineers Inc. [10.1109/WI-IAT62293.2024.00058].

Leveraging Large Language Models for Medical Information Extraction and Query Generation

Georgios Peikos;Pranav Kasela;Gabriella Pasi
2024

Abstract

This paper introduces a system which integrates large language models (LLMs) into clinical trials retrieval, improving patient-trial matching while preserving data privacy and expert oversight. We evaluate six LLMs for query generation, focusing on open-source and small models requiring minimal computational resources. Our findings show that these models achieve retrieval effectiveness comparable to or exceeding expert-created queries and consistently outperform standard baselines and literature approaches. The best-performing LLMs exhibit fast response times (1.7-8 seconds) and generate a manageable number of query terms (15-63). Our results suggest that small, open-source LLMs can effectively balance performance, computational efficiency, and real-world applicability in clinical trial retrieval.
paper
clinical trial retrieval; information retrieval; large language models; natural language processing; text generation;
English
The 23rd IEEE/WIC International Conference on Web Intelligence and Intelligent Agent Technology - December 9-12, 2024
2024
Proceedings - 2024 IEEE/WIC International Conference on Web Intelligence and Intelligent Agent Technology, WI-IAT 2024
9798331504946
2024
367
372
open
Peikos, G., Kasela, P., Pasi, G. (2024). Leveraging Large Language Models for Medical Information Extraction and Query Generation. In Proceedings - 2024 IEEE/WIC International Conference on Web Intelligence and Intelligent Agent Technology, WI-IAT 2024 (pp.367-372). Institute of Electrical and Electronics Engineers Inc. [10.1109/WI-IAT62293.2024.00058].
File in questo prodotto:
File Dimensione Formato  
Peikos-2024-23 IEEE/WIC Int Conf-AAM.pdf

accesso aperto

Tipologia di allegato: Author’s Accepted Manuscript, AAM (Post-print)
Licenza: Creative Commons
Dimensione 384.39 kB
Formato Adobe PDF
384.39 kB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/548721
Citazioni
  • Scopus 2
  • ???jsp.display-item.citation.isi??? 1
Social impact