PuSH - Publikationsserver des Helmholtz Zentrums München

Wahl, S. ; Fenske, N.* ; Zeilinger, S. ; Suhre, K. ; Gieger, C. ; Waldenberger, M. ; Grallert, H. ; Schmid, M.*

On the potential of models for location and scale for genome-wide DNA methylation data.

BMC Bioinformatics 15:232 (2014)
Verlagsversion DOI PMC
Open Access Gold
Creative Commons Lizenzvertrag
BACKGROUND: With the help of epigenome-wide association studies (EWAS), increasing knowledge on the role of epigenetic mechanisms such as DNA methylation in disease processes is obtained. In addition, EWAS aid the understanding of behavioral and environmental effects on DNA methylation. In terms of statistical analysis, specific challenges arise from the characteristics of methylation data. First, methylation β-values represent proportions with skewed and heteroscedastic distributions. Thus, traditional modeling strategies assuming a normally distributed response might not be appropriate. Second, recent evidence suggests that not only mean differences but also variability in site-specific DNA methylation associates with diseases, including cancer. The purpose of this study was to compare different modeling strategies for methylation data in terms of model performance and performance of downstream hypothesis tests. Specifically, we used the generalized additive models for location, scale and shape (GAMLSS) framework to compare beta regression with Gaussian regression on raw, binary logit and arcsine square root transformed methylation data, with and without modeling a covariate effect on the scale parameter. RESULTS: Using simulated and real data from a large population-based study and an independent sample of cancer patients and healthy controls, we show that beta regression does not outperform competing strategies in terms of model performance. In addition, Gaussian models for location and scale showed an improved performance as compared to models for location only. The best performance was observed for the Gaussian model on binary logit transformed β-values, referred to as M-values. Our results further suggest that models for location and scale are specifically sensitive towards violations of the distribution assumption and towards outliers in the methylation data. Therefore, a resampling procedure is proposed as a mode of inference and shown to diminish type I error rate in practically relevant settings. We apply the proposed method in an EWAS of BMI and age and reveal strong associations of age with methylation variability that are validated in an independent sample. CONCLUSIONS: Models for location and scale are promising tools for EWAS that may help to understand the influence of environmental factors and disease-related phenotypes on methylation variability and its role during disease development.
Impact Factor
Scopus SNIP
Web of Science
Times Cited
Scopus
Cited By
Altmetric
2.672
1.264
13
12
Tags
Anmerkungen
Besondere Publikation
Auf Hompepage verbergern

Zusatzinfos bearbeiten
Eigene Tags bearbeiten
Privat
Eigene Anmerkung bearbeiten
Privat
Auf Publikationslisten für
Homepage nicht anzeigen
Als besondere Publikation
markieren
Publikationstyp Artikel: Journalartikel
Dokumenttyp Wissenschaftlicher Artikel
Schlagwörter Dna Methylation ; Beta Regression ; Gamlss ; Infinium Humanmethylation450k Beadchip ; Ewas ; Modeling Variability ; Resampling ; Model Performance ; Model Comparison ; Models For Location And Scale; Regression-analysis; Beta Regression; Association; Proportions; Disease; Variability; Population; Selection; Leukemia; Platform
Sprache englisch
Veröffentlichungsjahr 2014
HGF-Berichtsjahr 2014
ISSN (print) / ISBN 1471-2105
e-ISSN 1471-2105
Zeitschrift BMC Bioinformatics
Quellenangaben Band: 15, Heft: , Seiten: , Artikelnummer: 232 Supplement: ,
Verlag BioMed Central
Verlagsort London
Begutachtungsstatus Peer reviewed
POF Topic(s) 30202 - Environmental Health
30505 - New Technologies for Biomedical Discoveries
30501 - Systemic Analysis of Genetic and Environmental Factors that Impact Health
Forschungsfeld(er) Genetics and Epidemiology
Enabling and Novel Technologies
PSP-Element(e) G-504091-002
G-504091-001
G-503700-001
G-504100-001
G-504090-001
PubMed ID 24994026
Scopus ID 84907714293
Erfassungsdatum 2014-07-30