Document Type


Publication Date



common disease, epilepsy, epistasis, genome-wide association study, GWAS, Phase III trials, ras-pathway, u-statistics


Background: Genome Wide Association Studies (GWAS) have had limited success when applied to complex diseases. Analyzing SNPs individually requires several large studies to integrate the often divergent results. In the presence of epistasis, multivariate approaches based on the linear model (including stepwise logistic regression) often have low sensitivity and generate an abundance of artifacts. Methods: Recent advances in distributed and parallel processing spurred methodological advances in non-parametric statistics. U-statistics for multivariate data (μStat) are not confounded by unrealistic assumptions (linearity, independence). Results: By incorporating knowledge about relationships between SNPs, μGWAS (GWAS based on μStat) can identify clusters of genes around biologically relevant pathways and pinpoint functionally relevant regions within these genes. Conclusion: With this computational biostatistics approach increasing power and guarding against artifacts, personalized medicine and comparative effectiveness will advance while subgroup analyses of Phase III trials can now suggest risk factors for adverse events and novel directions for drug development.


Future Medicine

RU Laboratory

Krueger Laboratory, Hospital Biostatistics

Permanent URL

Included in

Life Sciences Commons