Zappia, L. ; Richter, S. ; Ramirez Suastegui, C. ; Kfuri-Rubens, R. ; Vornholz, L. ; Wang, W. ; Dietrich, O. ; Frishberg, A. ; Luecken, M. ; Theis, F.J.
Feature selection methods affect the performance of scRNA-seq data integration and querying.
Nat. Methods 22, 834-844 (2025)
The availability of single-cell transcriptomics has allowed the construction of reference cell atlases, but their usefulness depends on the quality of dataset integration and the ability to map new samples. Previous benchmarks have compared integration methods and suggest that feature selection improves performance but have not explored how best to select features. Here, we benchmark feature selection methods for single-cell RNA sequencing integration using metrics beyond batch correction and preservation of biological variation to assess query mapping, label transfer and the detection of unseen populations. We reinforce common practice by showing that highly variable feature selection is effective for producing high-quality integrations and provide further guidance on the effect of the number of features selected, batch-aware feature selection, lineage-specific feature selection and integration and the interaction between feature selection and integration models. These results are informative for analysts working on large-scale tissue atlases, using atlases or integrating their own data to tackle specific biological questions.
Impact Factor
Scopus SNIP
Web of Science
Times Cited
Scopus
Cited By
Altmetric
Publication type
Article: Journal article
Document type
Scientific Article
Thesis type
Editors
Keywords
Gene Selection; Single; Biomart; Atlas
Keywords plus
Language
english
Publication Year
2025
Prepublished in Year
0
HGF-reported in Year
2025
ISSN (print) / ISBN
1548-7091
e-ISSN
1548-7105
ISBN
Book Volume Title
Conference Title
Conference Date
Conference Location
Proceedings Title
Quellenangaben
Volume: 22,
Issue: 4,
Pages: 834-844
Article Number: ,
Supplement: ,
Series
Publisher
Nature Publishing Group
Publishing Place
New York, NY
Day of Oral Examination
0000-00-00
Advisor
Referee
Examiner
Topic
University
University place
Faculty
Publication date
0000-00-00
Application date
0000-00-00
Patent owner
Further owners
Application country
Patent priority
Reviewing status
Peer reviewed
POF-Topic(s)
30205 - Bioengineering and Digital Health
80000 - German Center for Lung Research
Research field(s)
Enabling and Novel Technologies
Lung Research
PSP Element(s)
G-503800-001
G-501800-833
Grants
Helmholtz Information and Data Science Academy to enable a short-term research stay at Helmholtz Munich
Helmholtz Association under the joint research school Munich School for Data Science
Chan Zuckerberg Initiative
Arts in the framework of the Bavarian Research Association 'ForInter' (Interaction of human brain cells)
Bavarian Ministry of Science
Copyright
Erfassungsdatum
2025-05-08