PuSH - Publikationsserver des Helmholtz Zentrums München

Fischer, F. ; Fischer, D.S. ; Mukhin, R.* ; Isaev, A.* ; Biederstedt, E.* ; Villani, A.C.* ; Theis, F.J.

scTab: Scaling cross-tissue single-cell annotation models.

Nat. Commun. 15:6611 (2024)
Verlagsversion DOI PMC
Open Access Gold
Creative Commons Lizenzvertrag
Identifying cellular identities is a key use case in single-cell transcriptomics. While machine learning has been leveraged to automate cell annotation predictions for some time, there has been little progress in scaling neural networks to large data sets and in constructing models that generalize well across diverse tissues. Here, we propose scTab, an automated cell type prediction model specific to tabular data, and train it using a novel data augmentation scheme across a large corpus of single-cell RNA-seq observations (22.2 million cells). In this context, we show that cross-tissue annotation requires nonlinear models and that the performance of scTab scales both in terms of training dataset size and model size. Additionally, we show that the proposed data augmentation schema improves model generalization. In summary, we introduce a de novo cell type prediction model for single-cell RNA-seq data that can be trained across a large-scale collection of curated datasets and demonstrate the benefits of using deep learning methods in this paradigm.
Impact Factor
Scopus SNIP
Altmetric
14.700
0.000
Tags
Anmerkungen
Besondere Publikation
Auf Hompepage verbergern

Zusatzinfos bearbeiten
Eigene Tags bearbeiten
Privat
Eigene Anmerkung bearbeiten
Privat
Auf Publikationslisten für
Homepage nicht anzeigen
Als besondere Publikation
markieren
Publikationstyp Artikel: Journalartikel
Dokumenttyp Wissenschaftlicher Artikel
Sprache englisch
Veröffentlichungsjahr 2024
HGF-Berichtsjahr 2024
ISSN (print) / ISBN 2041-1723
e-ISSN 2041-1723
Zeitschrift Nature Communications
Quellenangaben Band: 15, Heft: 1, Seiten: , Artikelnummer: 6611 Supplement: ,
Verlag Nature Publishing Group
Verlagsort London
Begutachtungsstatus Peer reviewed
POF Topic(s) 30205 - Bioengineering and Digital Health
Forschungsfeld(er) Enabling and Novel Technologies
PSP-Element(e) G-503800-001
PubMed ID 39098889
Erfassungsdatum 2024-10-01