Molnos, S. ; Baumbach, C. ; Wahl, S. ; Müller-Nurasyid, M. ; Strauch, K. ; Wang-Sattler, R. ; Waldenberger, M. ; Meitinger, T. ; Adamski, J. ; Kastenmüller, G. ; Suhre, K. ; Peters, A. ; Grallert, H. ; Theis, F.J. ; Gieger, C.
pulver: An R package for parallel ultra-rapid p-value computation for linear regression interaction terms.
BMC Bioinformatics 18:429 (2017)
Background Genome-wide association studies allow us to understand the genetics of complex diseases. Human metabolism provides information about the disease-causing mechanisms, so it is usual to investigate the associations between genetic variants and metabolite levels. However, only considering genetic variants and their effects on one trait ignores the possible interplay between different “omics” layers. Existing tools only consider single-nucleotide polymorphism (SNP)–SNP interactions, and no practical tool is available for large-scale investigations of the interactions between pairs of arbitrary quantitative variables. Results We developed an R package called pulver to compute p-values for the interaction term in a very large number of linear regression models. Comparisons based on simulated data showed that pulver is much faster than the existing tools. This is achieved by using the correlation coefficient to test the null-hypothesis, which avoids the costly computation of inversions. Additional tricks are a rearrangement of the order, when iterating through the different “omics” layers, and implementing this algorithm in the fast programming language C++. Furthermore, we applied our algorithm to data from the German KORA study to investigate a real-world problem involving the interplay among DNA methylation, genetic variants, and metabolite levels. Conclusions The pulver package is a convenient and rapid tool for screening huge numbers of linear regression models for significant interaction terms in arbitrary pairs of quantitative variables. pulver is written in R and C++, and can be downloaded freely from CRAN at https://cran.r-project.org/web/packages/pulver/.
Impact Factor
Scopus SNIP
Web of Science
Times Cited
Scopus
Cited By
Altmetric
Publication type
Article: Journal article
Document type
Scientific Article
Thesis type
Editors
Keywords
Algorithm Linear regression interaction term SNP–CpG interaction Software; Genome-wide Association; Human Metabolism; Metabolomics; Genetics; Loci
Keywords plus
Language
english
Publication Year
2017
Prepublished in Year
HGF-reported in Year
2017
ISSN (print) / ISBN
1471-2105
e-ISSN
1471-2105
ISBN
Book Volume Title
Conference Title
Conference Date
Conference Location
Proceedings Title
Quellenangaben
Volume: 18,
Issue: 1,
Pages: ,
Article Number: 429
Supplement: ,
Series
Publisher
Biomed Central Ltd
Publishing Place
London
Day of Oral Examination
0000-00-00
Advisor
Referee
Examiner
Topic
University
University place
Faculty
Publication date
0000-00-00
Application date
0000-00-00
Patent owner
Further owners
Application country
Patent priority
Reviewing status
Peer reviewed
POF-Topic(s)
30202 - Environmental Health
30501 - Systemic Analysis of Genetic and Environmental Factors that Impact Health
30201 - Metabolic Health
30505 - New Technologies for Biomedical Discoveries
30205 - Bioengineering and Digital Health
Research field(s)
Genetics and Epidemiology
Enabling and Novel Technologies
PSP Element(s)
G-504091-004
G-504100-001
G-504091-003
G-504091-001
G-504091-002
G-504000-001
G-500700-001
G-505600-003
G-503700-001
G-503800-001
G-504090-001
Grants
Copyright
Erfassungsdatum
2017-10-11