PuSH - Publikationsserver des Helmholtz Zentrums München

Schouten, J.P.E. ; Matek, C. ; Jacobs, L.F.P.* ; Buck, M.C.* ; Bošnački, D.* ; Marr, C.

Tens of images can suffice to train neural networks for malignant leukocyte detection.

Sci. Rep. 11:7995 (2021)
Verlagsversion DOI PMC
Open Access Gold
Creative Commons Lizenzvertrag
Convolutional neural networks (CNNs) excel as powerful tools for biomedical image classification. It is commonly assumed that training CNNs requires large amounts of annotated data. This is a bottleneck in many medical applications where annotation relies on expert knowledge. Here, we analyze the binary classification performance of a CNN on two independent cytomorphology datasets as a function of training set size. Specifically, we train a sequential model to discriminate non-malignant leukocytes from blast cells, whose appearance in the peripheral blood is a hallmark of leukemia. We systematically vary training set size, finding that tens of training images suffice for a binary classification with an ROC-AUC over 90%. Saliency maps and layer-wise relevance propagation visualizations suggest that the network learns to increasingly focus on nuclear structures of leukocytes as the number of training images is increased. A low dimensional tSNE representation reveals that while the two classes are separated already for a few training images, the distinction between the classes becomes clearer when more training images are used. To evaluate the performance in a multi-class problem, we annotated single-cell images from a acute lymphoblastic leukemia dataset into six different hematopoietic classes. Multi-class prediction suggests that also here few single-cell images suffice if differences between morphological classes are large enough. The incorporation of deep learning algorithms into clinical practice has the potential to reduce variability and cost, democratize usage of expertise, and allow for early detection of disease onset and relapse. Our approach evaluates the performance of a deep learning based cytology classifier with respect to size and complexity of the training data and the classification task.
Impact Factor
Scopus SNIP
Web of Science
Times Cited
Scopus
Cited By
Altmetric
4.379
1.377
2
8
Tags
Anmerkungen
Besondere Publikation
Auf Hompepage verbergern

Zusatzinfos bearbeiten
Eigene Tags bearbeiten
Privat
Eigene Anmerkung bearbeiten
Privat
Auf Publikationslisten für
Homepage nicht anzeigen
Als besondere Publikation
markieren
Publikationstyp Artikel: Journalartikel
Dokumenttyp Wissenschaftlicher Artikel
Schlagwörter Classification; Leukemia; Cancer
Sprache englisch
Veröffentlichungsjahr 2021
HGF-Berichtsjahr 2021
ISSN (print) / ISBN 2045-2322
e-ISSN 2045-2322
Zeitschrift Scientific Reports
Quellenangaben Band: 11, Heft: 1, Seiten: , Artikelnummer: 7995 Supplement: ,
Verlag Nature Publishing Group
Verlagsort London
Begutachtungsstatus Peer reviewed
POF Topic(s) 30205 - Bioengineering and Digital Health
Forschungsfeld(er) Enabling and Novel Technologies
PSP-Element(e) G-503800-001
Förderungen Deutsche Forschungsgemeinschaft
Scopus ID 85104271125
PubMed ID 33846442
Erfassungsdatum 2021-06-07