PuSH - Publikationsserver des Helmholtz Zentrums München

Salvatore, M.* ; Horlacher, M. ; Marsico, A. ; Winther, O.* ; Andersson, R.*

Transfer learning identifies sequence determinants of cell-type specific regulatory element accessibility.

NAR Gen. Bioinfo. 5:lqad026 (2023)
Verlagsversion DOI PMC
Open Access Gold
Creative Commons Lizenzvertrag
Dysfunction of regulatory elements through genetic variants is a central mechanism in the pathogenesis of disease. To better understand disease etiology, there is consequently a need to understand how DNA encodes regulatory activity. Deep learning methods show great promise for modeling of biomolecular data from DNA sequence but are limited to large input data for training. Here, we develop ChromTransfer, a transfer learning method that uses a pre-trained, cell-type agnostic model of open chromatin regions as a basis for fine-tuning on regulatory sequences. We demonstrate superior performances with ChromTransfer for learning cell-type specific chromatin accessibility from sequence compared to models not informed by a pre-trained model. Importantly, ChromTransfer enables fine-tuning on small input data with minimal decrease in accuracy. We show that ChromTransfer uses sequence features matching binding site sequences of key transcription factors for prediction. Together, these results demonstrate ChromTransfer as a promising tool for learning the regulatory code.
Impact Factor
Scopus SNIP
Altmetric
4.600
0.000
Tags
Anmerkungen
Besondere Publikation
Auf Hompepage verbergern

Zusatzinfos bearbeiten
Eigene Tags bearbeiten
Privat
Eigene Anmerkung bearbeiten
Privat
Auf Publikationslisten für
Homepage nicht anzeigen
Als besondere Publikation
markieren
Publikationstyp Artikel: Journalartikel
Dokumenttyp Wissenschaftlicher Artikel
Schlagwörter Transcription Factors; Gene-expression; Enhancers; Binding; Genome
Sprache englisch
Veröffentlichungsjahr 2023
HGF-Berichtsjahr 2023
ISSN (print) / ISBN 2631-9268
e-ISSN 2631-9268
Quellenangaben Band: 5, Heft: 2, Seiten: , Artikelnummer: lqad026 Supplement: ,
Verlag Oxford University Press
Verlagsort Great Clarendon St, Oxford Ox2 6dp, England
Begutachtungsstatus Peer reviewed
POF Topic(s) 30205 - Bioengineering and Digital Health
Forschungsfeld(er) Enabling and Novel Technologies
PSP-Element(e) G-503800-004
G-503800-001
Scopus ID 85159828077
PubMed ID 37007588
Erfassungsdatum 2023-10-06