We present a thorough empirical analysis of the compatibility between international classification of diseases (ICD) codes and human phenotype ontology (HPO) terms based on the unified medical language system (UMLS) Metathesaurus. ICD is used to annotate clinical diagnoses in EHR data, while HPO is used to annotate phenotypes in research databases. Bridging between the 2 artifacts is essential for health data integration and analysis. UMLS is a widely used source of cross-ontology mappings, and so it is important to quantitatively assess the extent to which ICD is mapped to HPO in the UMLS. The primary results from the paper include that a mere 2.2% of ICD codes in UMLS are directly linked to HPO. Furthermore, an analysis of our EHR dataset shows that less than half of the commonly used ICD codes can be mapped to HPO terms. Notably, commonly used ICD codes in EHR data tend to have corresponding mappings to HPO. In contrast, ICD codes representing rarer medical conditions are infrequently associated with HPO terms.

Tan, A., Goncalves, R., Yuan, W., Brat, G., Gentleman, R., Kohane, I., et al. (2024). Implications of mappings between International Classification of Diseases clinical diagnosis codes and Human Phenotype Ontology terms. JAMIA OPEN, 7(4) [10.1093/jamiaopen/ooae118].

Implications of mappings between International Classification of Diseases clinical diagnosis codes and Human Phenotype Ontology terms

Zambelli, A
Membro del Collaboration Group
2024

Abstract

We present a thorough empirical analysis of the compatibility between international classification of diseases (ICD) codes and human phenotype ontology (HPO) terms based on the unified medical language system (UMLS) Metathesaurus. ICD is used to annotate clinical diagnoses in EHR data, while HPO is used to annotate phenotypes in research databases. Bridging between the 2 artifacts is essential for health data integration and analysis. UMLS is a widely used source of cross-ontology mappings, and so it is important to quantitatively assess the extent to which ICD is mapped to HPO in the UMLS. The primary results from the paper include that a mere 2.2% of ICD codes in UMLS are directly linked to HPO. Furthermore, an analysis of our EHR dataset shows that less than half of the commonly used ICD codes can be mapped to HPO terms. Notably, commonly used ICD codes in EHR data tend to have corresponding mappings to HPO. In contrast, ICD codes representing rarer medical conditions are infrequently associated with HPO terms.
Articolo in rivista - Articolo scientifico
data interoperability; ontology; ontology interoperability;
English
18-nov-2024
2024
7
4
ooae118
open
Tan, A., Goncalves, R., Yuan, W., Brat, G., Gentleman, R., Kohane, I., et al. (2024). Implications of mappings between International Classification of Diseases clinical diagnosis codes and Human Phenotype Ontology terms. JAMIA OPEN, 7(4) [10.1093/jamiaopen/ooae118].
File in questo prodotto:
File Dimensione Formato  
Tan-2024-JAMIA Open-VoR.pdf

accesso aperto

Descrizione: This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/)
Tipologia di allegato: Publisher’s Version (Version of Record, VoR)
Licenza: Creative Commons
Dimensione 1.02 MB
Formato Adobe PDF
1.02 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/560265
Citazioni
  • Scopus 2
  • ???jsp.display-item.citation.isi??? 2
Social impact