PuSH - Publikationsserver des Helmholtz Zentrums München

Albert, T.* ; Eskofier, B.M. ; Zanca, D.*

From patches to objects: Exploiting spatial reasoning for better visual representations.

SN appl. sci. 6:232 (2024)
Verlagsversion DOI
Open Access Gold
Creative Commons Lizenzvertrag
As the field of deep learning steadily transitions from the realm of academic research to practical application, the significance of self-supervised pretraining methods has become increasingly prominent. These methods, particularly in the image domain, offer a compelling strategy to effectively utilize the abundance of unlabeled image data, thereby enhancing downstream tasks’ performance. In this paper, we propose Spatial Reasoning, a novel auxiliary pretraining method that takes advantage of a more flexible formulation of contrastive learning by introducing spatial reasoning as an auxiliary task for discriminative self-supervised methods. Spatial Reasoning works by having the network predict the relative distances between sampled non-overlapping patches. We argue that this forces the network to learn more detailed and intricate internal representations of the objects and the relationships between their constituting parts. Our experiments demonstrate substantial improvement in downstream performance in linear evaluation compared to similar work and provide directions for further research into spatial reasoning.
Altmetric
Weitere Metriken?
Zusatzinfos bearbeiten [➜Einloggen]
Publikationstyp Artikel: Journalartikel
Dokumenttyp Wissenschaftlicher Artikel
Korrespondenzautor
Schlagwörter Contrastive Learning ; Linear Evaluation ; Relational Reasoning ; Self-supervised Learning ; Spatial Reasoning
ISSN (print) / ISBN 2523-3963
e-ISSN 2523-3971
Zeitschrift SN applied sciences
Quellenangaben Band: 6, Heft: 5, Seiten: , Artikelnummer: 232 Supplement: ,
Verlag Springer
Verlagsort [Cham]
Nichtpatentliteratur Publikationen
Begutachtungsstatus Peer reviewed
Institut(e) Institute of AI for Health (AIH)
Förderungen Projekt DEAL