PuSH - Publication Server of Helmholtz Zentrum München

Usynin, D.* ; Rueckert, D.* ; Kaissis, G.

Incentivising the federation: Gradient-based metrics for data selection and valuation in private decentralised training.

In: (EICC '24: Proceedings of the 2024 European Interdisciplinary Cybersecurity Conference). ACM, 2024. 179-185 (ACM International Conference Proceeding Series)
DOI
Obtaining high-quality data for collaborative training of machine learning models can be a challenging task due to A) regulatory concerns and B) a lack of data owner incentives to participate. The first issue can be addressed through the combination of distributed machine learning techniques (e.g. federated learning) and privacy enhancing technologies (PET), such as the differentially private (DP) model training. The second challenge can be addressed by rewarding the participants for giving access to data which is beneficial to the training model, which is of particular importance in federated settings, where the data is unevenly distributed. However, DP noise can adversely affect the underrepresented and the atypical (yet often informative) data samples, making it difficult to assess their usefulness. In this work, we investigate how to leverage gradient information to permit the participants of private training settings to select the data most beneficial for the jointly trained model. We assess two such methods, namely variance of gradients (VoG) and the privacy loss-input susceptibility score (PLIS). We show that these techniques can provide the federated clients with tools for principled data selection even in stricter privacy settings.
Altmetric
Tags
Annotations
Special Publikation
Hide on homepage

Edit extra information
Edit own tags
Private
Edit own annotation
Private
Hide on publication lists
on hompage
Mark as special
publikation
Publication type Article: Conference contribution
Keywords Data Valuation ; Differential Privacy ; Federated Learning
Language english
Publication Year 2024
HGF-reported in Year 2024
Conference Title EICC '24: Proceedings of the 2024 European Interdisciplinary Cybersecurity Conference
Quellenangaben Volume: , Issue: , Pages: 179-185 Article Number: , Supplement: ,
Publisher ACM
Institute(s) Institute for Machine Learning in Biomed Imaging (IML)
POF-Topic(s) 30205 - Bioengineering and Digital Health
Research field(s) Enabling and Novel Technologies
PSP Element(s) G-507100-001
Scopus ID 85196160222
Erfassungsdatum 2024-06-25