PuSH - Publikationsserver des Helmholtz Zentrums München

Łazęcka, M.* ; Szczurek, E.

Factor Analysis with Correlated Topic Model for Multi-Modal Data.

In: (28th International Conference on Artificial Intelligence and Statistics, AISTATS 2025, 3-5 May 2025, Mai Khao). 2025. 1801-1809 (Proceedings of Machine Learning Research ; 258)
Postprint
Integrating various data modalities brings valuable insights into underlying phenomena. Multimodal factor analysis (FA) uncovers shared axes of variation underlying different simple data modalities, where each sample is represented by a vector of features. However, FA is not suited for structured data modalities, such as text or single cell sequencing data, where multiple data points are measured per each sample and exhibit a clustering structure. To overcome this challenge, we introduce FACTM, a novel, multi-view and multi-structure Bayesian model that combines FA with correlated topic modeling and is optimized using variational inference. Additionally, we introduce a method for rotating latent factors to enhance interpretability with respect to binary features. On text and video benchmarks as well as real-world music and COVID-19 datasets, we demonstrate that FACTM outperforms other methods in identifying clusters in structured data, and integrating them with simple modalities via the inference of shared, interpretable factors.
Weitere Metriken?
Zusatzinfos bearbeiten [➜Einloggen]
Publikationstyp Artikel: Konferenzbeitrag
Konferenztitel 28th International Conference on Artificial Intelligence and Statistics, AISTATS 2025
Konferzenzdatum 3-5 May 2025
Konferenzort Mai Khao
Quellenangaben Band: 258, Heft: , Seiten: 1801-1809 Artikelnummer: , Supplement: ,