Weinhold, L.* ; Wahl, S. ; Pechlivanis, S.* ; Hoffmann, P.* ; Schmid, M.*
A statistical model for the analysis of beta values in DNA methylation studies.
BMC Bioinformatics 17:480 (2016)
Background: The analysis of DNA methylation is a key component in the development of personalized treatment approaches. A common way to measure DNA methylation is the calculation of beta values, which are bounded variables of the form M/(M+U) that are generated by Illumina's 450k BeadChip array. The statistical analysis of beta values is considered to be challenging, as traditional methods for the analysis of bounded variables, such as M-value regression and beta regression, are based on regularity assumptions that are often too strong to adequately describe the distribution of beta values. Results: We develop a statistical model for the analysis of beta values that is derived from a bivariate gamma distribution for the signal intensities M and U. By allowing for possible correlations between M and U, the proposed model explicitly takes into account the data-generating process underlying the calculation of beta values. Using simulated data and a real sample of DNA methylation data from the Heinz Nixdorf Recall cohort study, we demonstrate that the proposed model fits our data significantly better than beta regression and M-value regression. Conclusion: The proposed model contributes to an improved identification of associations between beta values and covariates such as clinical variables and lifestyle factors in epigenome-wide association studies. It is as easy to apply to a sample of beta values as beta regression and M-value regression.
Impact Factor
Scopus SNIP
Web of Science
Times Cited
Scopus
Cited By
Altmetric
Publikationstyp
Artikel: Journalartikel
Dokumenttyp
Wissenschaftlicher Artikel
Typ der Hochschulschrift
Herausgeber
Schlagwörter
Bounded Response Variables ; Dna Methylation ; Gamma Regression ; Gradient Boosting ; Human Methylation450k Bead Chip; Cpg Sites; Wide Analysis; Microarray; Disease; Prediction; Regression; Location; Pipeline; Package; Cancer
Keywords plus
Sprache
englisch
Veröffentlichungsjahr
2016
Prepublished im Jahr
HGF-Berichtsjahr
2016
ISSN (print) / ISBN
1471-2105
e-ISSN
1471-2105
ISBN
Bandtitel
Konferenztitel
Konferzenzdatum
Konferenzort
Konferenzband
Quellenangaben
Band: 17,
Heft: ,
Seiten: ,
Artikelnummer: 480
Supplement: ,
Reihe
Verlag
Biomed Central Ltd
Verlagsort
London
Tag d. mündl. Prüfung
0000-00-00
Betreuer
Gutachter
Prüfer
Topic
Hochschule
Hochschulort
Fakultät
Veröffentlichungsdatum
0000-00-00
Anmeldedatum
0000-00-00
Anmelder/Inhaber
weitere Inhaber
Anmeldeland
Priorität
Begutachtungsstatus
Peer reviewed
Institut(e)
Institute of Epidemiology (EPI)
POF Topic(s)
30202 - Environmental Health
Forschungsfeld(er)
Genetics and Epidemiology
PSP-Element(e)
G-504091-001
Förderungen
Copyright
Erfassungsdatum
2016-12-31