PuSH - Publikationsserver des Helmholtz Zentrums München

Wu, S. ; Thalmann, M. ; Dayan, P.* ; Akata, Z. ; Schulz, E.

Building, reusing, and generalizing abstract representations from concrete sequences.

In: (13th International Conference on Learning Representations Iclr 2025, 24 - 28 April 2025, Singapur). 2025. 95534-95557 (13th International Conference on Learning Representations Iclr 2025)
Humans excel at learning abstract patterns across different sequences, filtering out irrelevant details, and transferring these generalized concepts to new sequences. In contrast, many sequence learning models lack the ability to abstract, which leads to memory inefficiency and poor transfer. We introduce a non-parametric hierarchical variable learning model (HVM) that learns chunks from sequences and abstracts contextually similar chunks as variables. HVM efficiently organizes memory while uncovering abstractions, leading to compact sequence representations. When learning on language datasets such as babyLM, HVM learns a more efficient dictionary than standard compression algorithms such as Lempel-Ziv. In a sequence recall task requiring the acquisition and transfer of variables embedded in sequences, we demonstrate HVM's sequence likelihood correlates with human recall times. In contrast, large language models (LLMs) struggle to transfer abstract variables as effectively as humans. From HVM's adjustable layer of abstraction, we demonstrate that the model realizes a precise trade-off between compression and generalization. Our work offers a cognitive model that captures the learning and transfer of abstract representations in human cognition and differentiates itself from LLMs.
Tags
Anmerkungen
Besondere Publikation
Auf Hompepage verbergern

Zusatzinfos bearbeiten
Eigene Tags bearbeiten
Privat
Eigene Anmerkung bearbeiten
Privat
Auf Publikationslisten für
Homepage nicht anzeigen
Als besondere Publikation
markieren
Publikationstyp Artikel: Konferenzbeitrag
Sprache englisch
Veröffentlichungsjahr 2025
HGF-Berichtsjahr 2025
ISSN (print) / ISBN [9798331320850]
Konferenztitel 13th International Conference on Learning Representations Iclr 2025
Konferzenzdatum 24 - 28 April 2025
Konferenzort Singapur
Quellenangaben Band: , Heft: , Seiten: 95534-95557 Artikelnummer: , Supplement: ,
POF Topic(s) 30205 - Bioengineering and Digital Health
Forschungsfeld(er) Enabling and Novel Technologies
PSP-Element(e) G-540011-001
G-530008-001
Scopus ID 105010243227
Erfassungsdatum 2025-07-17