Motivation: Approaches such as chromatin immunoprecipitation followed by sequencing (ChIP-seq) represent the standard for the identification of binding sites of DNA-associated proteins, including transcription factors and histone marks. Public repositories of omics data contain a huge number of experimental ChIP-seq data, but their reuse and integrative analysis across multiple conditions remain a daunting task. Results: We present the Combinatorial and Semantic Analysis of Functional Elements (CombSAFE), an efficient computational method able to integrate and take advantage of the valuable and numerous, but heterogeneous, ChIP-seq data publicly available in big data repositories. Leveraging natural language processing techniques, it integrates omics data samples with semantic annotations from selected biomedical ontologies; then, using hidden Markov models, it identifies combinations of static and dynamic functional elements throughout the genome for the corresponding samples. CombSAFE allows analyzing the whole genome, by clustering patterns of regions with similar functional elements and through enrichment analyses to discover ontological terms significantly associated with them. Moreover, it allows comparing functional states of a specific genomic region to analyze their different behavior throughout the various semantic annotations. Such findings can provide novel insights by identifying unexpected combinations of functional elements in different biological conditions.

Leone, M., Galeota, E., Masseroli, M., Pelizzola, M. (2022). Identification, semantic annotation and comparison of combinations of functional elements in multiple biological conditions. BIOINFORMATICS, 38(5), 1183-1190 [10.1093/bioinformatics/btab815].

Identification, semantic annotation and comparison of combinations of functional elements in multiple biological conditions

Pelizzola M
Co-ultimo
2022

Abstract

Motivation: Approaches such as chromatin immunoprecipitation followed by sequencing (ChIP-seq) represent the standard for the identification of binding sites of DNA-associated proteins, including transcription factors and histone marks. Public repositories of omics data contain a huge number of experimental ChIP-seq data, but their reuse and integrative analysis across multiple conditions remain a daunting task. Results: We present the Combinatorial and Semantic Analysis of Functional Elements (CombSAFE), an efficient computational method able to integrate and take advantage of the valuable and numerous, but heterogeneous, ChIP-seq data publicly available in big data repositories. Leveraging natural language processing techniques, it integrates omics data samples with semantic annotations from selected biomedical ontologies; then, using hidden Markov models, it identifies combinations of static and dynamic functional elements throughout the genome for the corresponding samples. CombSAFE allows analyzing the whole genome, by clustering patterns of regions with similar functional elements and through enrichment analyses to discover ontological terms significantly associated with them. Moreover, it allows comparing functional states of a specific genomic region to analyze their different behavior throughout the various semantic annotations. Such findings can provide novel insights by identifying unexpected combinations of functional elements in different biological conditions.
Articolo in rivista - Articolo scientifico
Chromatin Immunoprecipitation Sequencing; Genome; Genomics; Semantics; Sequence Analysis, DNA
English
2-dic-2021
2022
38
5
1183
1190
reserved
Leone, M., Galeota, E., Masseroli, M., Pelizzola, M. (2022). Identification, semantic annotation and comparison of combinations of functional elements in multiple biological conditions. BIOINFORMATICS, 38(5), 1183-1190 [10.1093/bioinformatics/btab815].
File in questo prodotto:
File Dimensione Formato  
Leone-2022-Bioinformatics-VoR.pdf

Solo gestori archivio

Descrizione: Article
Tipologia di allegato: Publisher’s Version (Version of Record, VoR)
Licenza: Tutti i diritti riservati
Dimensione 4.75 MB
Formato Adobe PDF
4.75 MB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/10281/446759
Citazioni
  • Scopus 1
  • ???jsp.display-item.citation.isi??? 1
Social impact