Maura presents our new paper on “Efficient permutation-based genome-wide association studies for normal and skewed phenotypic distributions” at the 21st European Conference on Computational Biology (ECCB), the largest bioinformatics conference in Europe.

 

 

Abstract

Motivation Genome-wide association studies (GWAS) are an integral tool for studying the architecture of complex genotype and phenotype relationships. Linear mixed models (LMMs) are commonly used to detect associations between genetic markers and a trait of interest, while at the same time allowing to account for population structure and cryptic relatedness. Assumptions of LMMs include a normal distribution of the residuals and that the genetic markers are independent and identically distributed—both assumptions are often violated in real data. Permutation-based methods can help to overcome some of these limitations and provide more realistic thresholds for the discovery of true associations. Still, in practice, they are rarely implemented due to the high computational complexity.
Results We propose permGWAS, an efficient LMM reformulation based on 4D tensors that can provide permutation-based significance thresholds. We show that our method outperforms current state-of-the-art LMMs with respect to runtime and that permutation-based thresholds have lower false discovery rates for skewed phenotypes compared to the commonly used Bonferroni threshold. Furthermore, using permGWAS we re-analyzed more than 500 Arabidopsis thaliana phenotypes with 100 permutations each in less than 8 days on a single GPU. Our re-analyses suggest that applying a permutation-based threshold can improve and refine the interpretation of GWAS results.
 
Availability and implementation permGWAS is open-source and publicly available on GitHub for download: https://github.com/grimmlab/permGWAS.
 
Supplementary information Supplementary data are available at Bioinformatics online.

 

Original Publication (Open Access)
Maura John, Markus J Ankenbrand, Carolin Artmann, Jan A Freudenthal, Arthur Korte, Dominik G Grimm
Efficient permutation-based genome-wide association studies for normal and skewed phenotypic distributions, Bioinformatics, Volume 38, September 2022, Pages ii5–ii12, https://doi.org/10.1093/bioinformatics/btac455