This paper introduces a system which integrates large language models (LLMs) into clinical trials retrieval, improving patient-trial matching while preserving data privacy and expert oversight. We evaluate six LLMs for query generation, focusing on open-source and small models requiring minimal computational resources. Our findings show that these models achieve retrieval effectiveness comparable to or exceeding expert-created queries and consistently outperform standard baselines and literature approaches. The best-performing LLMs exhibit fast response times (1.7-8 seconds) and generate a manageable number of query terms (15-63). Our results suggest that small, open-source LLMs can effectively balance performance, computational efficiency, and real-world applicability in clinical trial retrieval.
Peikos, G., Kasela, P., Pasi, G. (2024). Leveraging Large Language Models for Medical Information Extraction and Query Generation. In Proceedings - 2024 IEEE/WIC International Conference on Web Intelligence and Intelligent Agent Technology, WI-IAT 2024 (pp.367-372). Institute of Electrical and Electronics Engineers Inc. [10.1109/WI-IAT62293.2024.00058].
Leveraging Large Language Models for Medical Information Extraction and Query Generation
Georgios Peikos;Pranav Kasela;Gabriella Pasi
2024
Abstract
This paper introduces a system which integrates large language models (LLMs) into clinical trials retrieval, improving patient-trial matching while preserving data privacy and expert oversight. We evaluate six LLMs for query generation, focusing on open-source and small models requiring minimal computational resources. Our findings show that these models achieve retrieval effectiveness comparable to or exceeding expert-created queries and consistently outperform standard baselines and literature approaches. The best-performing LLMs exhibit fast response times (1.7-8 seconds) and generate a manageable number of query terms (15-63). Our results suggest that small, open-source LLMs can effectively balance performance, computational efficiency, and real-world applicability in clinical trial retrieval.| File | Dimensione | Formato | |
|---|---|---|---|
|
Peikos-2024-23 IEEE/WIC Int Conf-AAM.pdf
accesso aperto
Tipologia di allegato:
Author’s Accepted Manuscript, AAM (Post-print)
Licenza:
Creative Commons
Dimensione
384.39 kB
Formato
Adobe PDF
|
384.39 kB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


