We present a thorough empirical analysis of the compatibility between international classification of diseases (ICD) codes and human phenotype ontology (HPO) terms based on the unified medical language system (UMLS) Metathesaurus. ICD is used to annotate clinical diagnoses in EHR data, while HPO is used to annotate phenotypes in research databases. Bridging between the 2 artifacts is essential for health data integration and analysis. UMLS is a widely used source of cross-ontology mappings, and so it is important to quantitatively assess the extent to which ICD is mapped to HPO in the UMLS. The primary results from the paper include that a mere 2.2% of ICD codes in UMLS are directly linked to HPO. Furthermore, an analysis of our EHR dataset shows that less than half of the commonly used ICD codes can be mapped to HPO terms. Notably, commonly used ICD codes in EHR data tend to have corresponding mappings to HPO. In contrast, ICD codes representing rarer medical conditions are infrequently associated with HPO terms.
Tan, A., Goncalves, R., Yuan, W., Brat, G., Gentleman, R., Kohane, I., et al. (2024). Implications of mappings between International Classification of Diseases clinical diagnosis codes and Human Phenotype Ontology terms. JAMIA OPEN, 7(4) [10.1093/jamiaopen/ooae118].
Implications of mappings between International Classification of Diseases clinical diagnosis codes and Human Phenotype Ontology terms
Zambelli, AMembro del Collaboration Group
2024
Abstract
We present a thorough empirical analysis of the compatibility between international classification of diseases (ICD) codes and human phenotype ontology (HPO) terms based on the unified medical language system (UMLS) Metathesaurus. ICD is used to annotate clinical diagnoses in EHR data, while HPO is used to annotate phenotypes in research databases. Bridging between the 2 artifacts is essential for health data integration and analysis. UMLS is a widely used source of cross-ontology mappings, and so it is important to quantitatively assess the extent to which ICD is mapped to HPO in the UMLS. The primary results from the paper include that a mere 2.2% of ICD codes in UMLS are directly linked to HPO. Furthermore, an analysis of our EHR dataset shows that less than half of the commonly used ICD codes can be mapped to HPO terms. Notably, commonly used ICD codes in EHR data tend to have corresponding mappings to HPO. In contrast, ICD codes representing rarer medical conditions are infrequently associated with HPO terms.| File | Dimensione | Formato | |
|---|---|---|---|
|
Tan-2024-JAMIA Open-VoR.pdf
accesso aperto
Descrizione: This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/)
Tipologia di allegato:
Publisher’s Version (Version of Record, VoR)
Licenza:
Creative Commons
Dimensione
1.02 MB
Formato
Adobe PDF
|
1.02 MB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.


