Association_tests

Summary Table

NAME	CATEGORY	CITATION
CC-GWAS	Case-case GWAS	Peyrot, W. J., & Price, A. L. (2021). Identifying loci with different allele frequencies among cases of eight psychiatric disorders using CC-GWAS. Nature genetics, 53(4), 445-454.
TrajGWAS	GWAS of longitudinal trajectories	Ko, S., German, C. A., Jensen, A., Shen, J., Wang, A., Mehrotra, D. V., ... & Zhou, J. J. (2022). GWAS of longitudinal trajectories at biobank scale. The American Journal of Human Genetics, 109(3), 433-445.
GWAX	GWAS using family history	Liu, J. Z., Erlich, Y., & Pickrell, J. K. (2017). Case–control association mapping by proxy using family history of disease. Nature genetics, 49(3), 325-331.
LT-FH	GWAS using family history	Liu, J. Z., Erlich, Y., & Pickrell, J. K. (2017). Case–control association mapping by proxy using family history of disease. Nature genetics, 49(3), 325-331.
SiblingGWAS	GWAS using family history	Howe, L. J., Nivard, M. G., Morris, T. T., Hansen, A. F., Rasheed, H., Cho, Y., ... & Davies, N. M. (2022). Within-sibship genome-wide association analyses decrease bias in estimates of direct genetic effects. Nature genetics, 54(5), 581-592.
snipar	GWAS using family history	Young, A. I., Nehzati, S. M., Benonisdottir, S., Okbay, A., Jayashankar, H., Lee, C., ... & Kong, A. (2022). Mendelian imputation of parental genotypes improves estimates of direct genetic effects. Nature genetics, 54(6), 897-905. Guan, J., Nehzati, S. M., Benjamin, D. J., & Young, A. I. (2022). Novel estimators for family-based genome-wide association studies increase power and robustness. bioRxiv, 2022-10.
REGENIE	Gene-based analysis (rare variant)	Mbatchou, Joelle, et al. "Computationally efficient whole-genome regression for quantitative and binary traits." Nature genetics 53.7 (2021): 1097-1103.
SAIGE-GENE+	Gene-based analysis (rare variant)	Zhou, Wei, et al. "SAIGE-GENE+ improves the efficiency and accuracy of set-based rare variant association tests." Nature Genetics (2022): 1-4.
SAIGE-GENE	Gene-based analysis (rare variant)	Zhou, Wei, et al. "Scalable generalized linear mixed model for region-based association tests in large biobanks and cohorts." Nature genetics 52.6 (2020): 634-639.
SKAT-O	Gene-based analysis (rare variant)	Lee, Seunggeun, Michael C. Wu, and Xihong Lin. "Optimal tests for rare variant effects in sequencing association studies." Biostatistics 13.4 (2012): 762-775.
SKAT	Gene-based analysis (rare variant)	Wu, Michael C., et al. "Rare-variant association testing for sequencing data with the sequence kernel association test." The American Journal of Human Genetics 89.1 (2011): 82-93.
STAAR	Gene-based analysis (rare variant)	Li, Xihao, et al. "Dynamic incorporation of multiple in silico functional annotations empowers rare variant association analysis of large whole-genome sequencing studies at scale." Nature genetics 52.9 (2020): 969-983.
PGS-adjusted GWAS	PGS-adjusted GWAS	Campos, A. I., Namba, S., Lin, S. C., Nam, K., Sidorenko, J., Wang, H., ... & Yengo, L. (2023). Boosting the power of genome-wide association studies within and across ancestries by using polygenic scores. Nature Genetics, 1-8.
PGS-adjusted RVATs	PGS-adjusted GWAS	Jurgens, S. J., Pirruccello, J. P., Choi, S. H., Morrill, V. N., Chaffin, M., Lubitz, S. A., ... & Ellinor, P. T. (2023). Adjusting for common variant polygenic scores improves yield in rare variant association analyses. Nature Genetics, 55(4), 544-548.
Review-Povysil	Review	Povysil, Gundula, et al. "Rare-variant collapsing analyses for complex traits: guidelines and applications." Nature Reviews Genetics 20.12 (2019): 747-759.
BOLT-lMM	Single variant association tests	Loh, Po-Ru, et al. "Efficient Bayesian mixed-model analysis increases association power in large cohorts." Nature genetics 47.3 (2015): 284-290.
EMMAX	Single variant association tests
GEMMA	Single variant association tests	Zhou, Xiang, and Matthew Stephens. "Genome-wide efficient mixed-model analysis for association studies." Nature genetics 44.7 (2012): 821-824.
PLINK2	Single variant association tests	Chang, Christopher C., et al. "Second-generation PLINK: rising to the challenge of larger and richer datasets." Gigascience 4.1 (2015): s13742-015.
PLINK	Single variant association tests	Purcell, Shaun, et al. "PLINK: a tool set for whole-genome association and population-based linkage analyses." The American journal of human genetics 81.3 (2007): 559-575.
POLMM	Single variant association tests	Bi, W., Zhou, W., Dey, R., Mukherjee, B., Sampson, J. N., & Lee, S. (2021). Efficient mixed model approach for large-scale genome-wide association studies of ordinal categorical phenotypes. The American Journal of Human Genetics, 108(5), 825-839.
REGENIE	Single variant association tests	Mbatchou, Joelle, et al. "Computationally efficient whole-genome regression for quantitative and binary traits." Nature genetics 53.7 (2021): 1097-1103.
SAIGE	Single variant association tests	Zhou, Wei, et al. "Efficiently controlling for case-control imbalance and sample relatedness in large-scale genetic association studies." Nature genetics 50.9 (2018): 1335-1341.
fastGWA-GLMM	Single variant association tests	Jiang, Longda, et al. "A generalized linear mixed model association tool for biobank-scale data." Nature genetics 53.11 (2021): 1616-1621.
fastGWA	Single variant association tests	Jiang, Longda, et al. "A resource-efficient tool for mixed model association analysis of large-scale data." Nature genetics 51.12 (2019): 1749-1755.

Case-case GWAS

CC-GWAS

NAME : CC-GWAS
FULL NAME : case–case genome-wide association study
SHORT NAME : CC-GWAS
DESCRIPTION : The CCGWAS R package provides a tool for case-case association testing of two different disorders based on their respective case-control GWAS results
URL : https://github.com/wouterpeyrot/CCGWAS
CITATION : Peyrot, W. J., & Price, A. L. (2021). Identifying loci with different allele frequencies among cases of eight psychiatric disorders using CC-GWAS. Nature genetics, 53(4), 445-454.
CATEGORY : Case-case GWAS

GWAS of longitudinal trajectories

TrajGWAS

NAME : TrajGWAS
FULL NAME : GWAS of longitudinal trajectories
SHORT NAME : TrajGWAS
DESCRIPTION : TrajGWAS.jl is a Julia package for performing genome-wide association studies (GWAS) for continuous longitudinal phenotypes using a modified linear mixed effects model. It builds upon the within-subject variance estimation by robust regression (WiSER) method and can be used to identify variants associated with changes in the mean and within-subject variability of the longitduinal trait.
URL : https://github.com/OpenMendel/TrajGWAS.jl
CITATION : Ko, S., German, C. A., Jensen, A., Shen, J., Wang, A., Mehrotra, D. V., ... & Zhou, J. J. (2022). GWAS of longitudinal trajectories at biobank scale. The American Journal of Human Genetics, 109(3), 433-445.
CATEGORY : GWAS of longitudinal trajectories
KEYWORDS : biomarker trajectories, mean, within-subject (WS) variability, linear mixed effect model, within-subject variance estimation by robust regression (WiSER) method
YEAR : 2022

GWAS using family history

GWAX

NAME : GWAX
FULL NAME : genome-wide association by proxy
SHORT NAME : GWAX
DESCRIPTION : In randomly ascertained cohorts, replacing cases with their first-degree relatives enables studies of diseases that are absent (or nearly absent) in the cohort.
CITATION : Liu, J. Z., Erlich, Y., & Pickrell, J. K. (2017). Case–control association mapping by proxy using family history of disease. Nature genetics, 49(3), 325-331.
CATEGORY : GWAS using family history

LT-FH

NAME : LT-FH
FULL NAME : liability threshold model, conditional on case–control status and family history
SHORT NAME : LT-FH
DESCRIPTION : an association method based on posterior mean genetic liabilities under a liability threshold model, conditional on case-control status and family history (LT-FH)
URL : https://alkesgroup.broadinstitute.org/UKBB/LTFH/
CITATION : Liu, J. Z., Erlich, Y., & Pickrell, J. K. (2017). Case–control association mapping by proxy using family history of disease. Nature genetics, 49(3), 325-331.
CATEGORY : GWAS using family history

SiblingGWAS

NAME : SiblingGWAS
FULL NAME : Within-sibship genome-wide association analyses
SHORT NAME : SiblingGWAS
DESCRIPTION : Scripts for running GWAS using siblings to estimate Within-Family (WF) and Between-Family (BF) effects of genetic variants on continuous traits. Allows the inclusion of more than two siblings from one family.
URL : https://github.com/LaurenceHowe/SiblingGWAS
CITATION : Howe, L. J., Nivard, M. G., Morris, T. T., Hansen, A. F., Rasheed, H., Cho, Y., ... & Davies, N. M. (2022). Within-sibship genome-wide association analyses decrease bias in estimates of direct genetic effects. Nature genetics, 54(5), 581-592.
CATEGORY : GWAS using family history
YEAR : 2022

snipar

NAME : snipar
FULL NAME : single nucleotide imputation of parents
SHORT NAME : snipar
DESCRIPTION : snipar (single nucleotide imputation of parents) is a Python package for inferring identity-by-descent (IBD) segments shared between siblings, imputing missing parental genotypes, and for performing family based genome-wide association and polygenic score analyses using observed and/or imputed parental genotypes.
URL : https://github.com/AlexTISYoung/snipar
CITATION : Young, A. I., Nehzati, S. M., Benonisdottir, S., Okbay, A., Jayashankar, H., Lee, C., ... & Kong, A. (2022). Mendelian imputation of parental genotypes improves estimates of direct genetic effects. Nature genetics, 54(6), 897-905. Guan, J., Nehzati, S. M., Benjamin, D. J., & Young, A. I. (2022). Novel estimators for family-based genome-wide association studies increase power and robustness. bioRxiv, 2022-10.
CATEGORY : GWAS using family history
YEAR : 2022

Gene-based analysis (rare variant)

REGENIE

NAME : REGENIE
FULL NAME : REGENIE
SHORT NAME : REGENIE
DESCRIPTION : regenie is a C++ program for whole genome regression modelling of large genome-wide association studies. It is developed and supported by a team of scientists at the Regeneron Genetics Center.
URL : https://github.com/rgcgithub/regenie
CITATION : Mbatchou, Joelle, et al. "Computationally efficient whole-genome regression for quantitative and binary traits." Nature genetics 53.7 (2021): 1097-1103.
CATEGORY : Gene-based analysis (rare variant)
KEYWORDS : whole genome regression

SAIGE-GENE

NAME : SAIGE-GENE
FULL NAME : SAIGE-GENE
SHORT NAME : SAIGE-GENE
URL : https://github.com/weizhouUMICH/SAIGE
CITATION : Zhou, Wei, et al. "Scalable generalized linear mixed model for region-based association tests in large biobanks and cohorts." Nature genetics 52.6 (2020): 634-639.
CATEGORY : Gene-based analysis (rare variant)

SAIGE-GENE+

NAME : SAIGE-GENE+
FULL NAME : SAIGE-GENE+
SHORT NAME : SAIGE-GENE+
URL : https://github.com/weizhouUMICH/SAIGE
CITATION : Zhou, Wei, et al. "SAIGE-GENE+ improves the efficiency and accuracy of set-based rare variant association tests." Nature Genetics (2022): 1-4.
CATEGORY : Gene-based analysis (rare variant)

SKAT

NAME : SKAT
FULL NAME : sequence kernel association test
SHORT NAME : SKAT
DESCRIPTION : SKAT is a SNP-set (e.g., a gene or a region) level test for association between a set of rare (or common) variants and dichotomous or quantitative phenotypes, SKAT aggregates individual score test statistics of SNPs in a SNP set and efficiently computes SNP-set level p-values, e.g. a gene or a region level p-value, while adjusting for covariates, such as principal components to account for population stratification. SKAT also allows for power/sample size calculations for designing for sequence association studies.
URL : https://www.hsph.harvard.edu/skat/
CITATION : Wu, Michael C., et al. "Rare-variant association testing for sequencing data with the sequence kernel association test." The American Journal of Human Genetics 89.1 (2011): 82-93.
CATEGORY : Gene-based analysis (rare variant)
KEY WORDS :

SKAT-O

NAME : SKAT-O
FULL NAME : sequence kernel association test - optimal test
SHORT NAME : SKAT-O
DESCRIPTION : estimating the correlation parameter in the kernel matrix to maximize the power, which corresponds to the estimated weight in the linear combination of the burden test and SKAT test statistics that maximizes power.
URL : https://www.hsph.harvard.edu/skat/
CITATION : Lee, Seunggeun, Michael C. Wu, and Xihong Lin. "Optimal tests for rare variant effects in sequencing association studies." Biostatistics 13.4 (2012): 762-775.
CATEGORY : Gene-based analysis (rare variant)

STAAR

NAME : STAAR
FULL NAME : variant-set test for association using annotation information
SHORT NAME : STAAR
DESCRIPTION : STAAR is an R package for performing variant-Set Test for Association using Annotation infoRmation (STAAR) procedure in whole-genome sequencing (WGS) studies. STAAR is a general framework that incorporates both qualitative functional categories and quantitative complementary functional annotations using an omnibus multi-dimensional weighting scheme. STAAR accounts for population structure and relatedness, and is scalable for analyzing large WGS studies of continuous and dichotomous traits.
URL : https://github.com/xihaoli/STAAR
CITATION : Li, Xihao, et al. "Dynamic incorporation of multiple in silico functional annotations empowers rare variant association analysis of large whole-genome sequencing studies at scale." Nature genetics 52.9 (2020): 969-983.
CATEGORY : Gene-based analysis (rare variant)
KEYWORDS : functional annotations

PGS-adjusted GWAS

NAME : PGS-adjusted GWAS
FULL NAME : PGS-adjusted GWAS
SHORT NAME : PGS-adjusted GWAS
DESCRIPTION : adjustment of GWAS analyses for polygenic scores (PGSs) increases the statistical power for discovery across all ancestries
CITATION : Campos, A. I., Namba, S., Lin, S. C., Nam, K., Sidorenko, J., Wang, H., ... & Yengo, L. (2023). Boosting the power of genome-wide association studies within and across ancestries by using polygenic scores. Nature Genetics, 1-8.
CATEGORY : PGS-adjusted GWAS
KEYWORDS : LOCO-PGSs, two-stage meta-analysis strategy
YEAR : 2023

PGS-adjusted RVATs

NAME : PGS-adjusted RVATs
FULL NAME : PGS-adjusted rare variant association tests
SHORT NAME : PGS-adjusted RVATs
DESCRIPTION : adjusting for common variant polygenic scores improves yield in gene-based rare variant association tests
CITATION : Jurgens, S. J., Pirruccello, J. P., Choi, S. H., Morrill, V. N., Chaffin, M., Lubitz, S. A., ... & Ellinor, P. T. (2023). Adjusting for common variant polygenic scores improves yield in rare variant association analyses. Nature Genetics, 55(4), 544-548.
CATEGORY : PGS-adjusted GWAS
KEYWORDS : PGS, Rare variants
YEAR : 2023

Review

Review-Povysil

NAME : Review-Povysil
CITATION : Povysil, Gundula, et al. "Rare-variant collapsing analyses for complex traits: guidelines and applications." Nature Reviews Genetics 20.12 (2019): 747-759.
CATEGORY : Review

Single variant association tests

BOLT-lMM

NAME : BOLT-lMM
FULL NAME : BOLT-lMM
SHORT NAME : BOLT-lMM
DESCRIPTION : The BOLT-LMM software package currently consists of two main algorithms, the BOLT-LMM algorithm for mixed model association testing, and the BOLT-REML algorithm for variance components analysis (i.e., partitioning of SNP-heritability and estimation of genetic correlations).
URL : https://alkesgroup.broadinstitute.org/BOLT-LMM/BOLT-LMM_manual.html
CITATION : Loh, Po-Ru, et al. "Efficient Bayesian mixed-model analysis increases association power in large cohorts." Nature genetics 47.3 (2015): 284-290.
CATEGORY : Single variant association tests
KEYWORDS : non-infinitesimal model, mixture of two Gaussian distributions

EMMAX

NAME : EMMAX
FULL NAME : efficient mixed-model association eXpedited
SHORT NAME : EMMAX
DESCRIPTION : EMMAX is a statistical test for large scale human or model organism association mapping accounting for the sample structure. In addition to the computational efficiency obtained by EMMA algorithm, EMMAX takes advantage of the fact that each loci explains only a small fraction of complex traits, which allows us to avoid repetitive variance component estimation procedure, resulting in a significant amount of increase in computational time of association mapping using mixed model.
URL : https://genome.sph.umich.edu/wiki/EMMAX
CITATION :
CATEGORY : Single variant association tests

GEMMA

NAME : GEMMA
FULL NAME : genome-wide efficient mixed-model association
SHORT NAME : GEMMA
DESCRIPTION : GEMMA is the software implementing the Genome-wide Efficient Mixed Model Association algorithm for a standard linear mixed model and some of its close relatives for genome-wide association studies (GWAS). It fits a standard linear mixed model (LMM) to account for population stratification and sample structure for single marker association tests. It fits a Bayesian sparse linear mixed model (BSLMM) using Markov chain Monte Carlo (MCMC) for estimating the proportion of variance in phenotypes explained (PVE) by typed genotypes (i.e. chip heritability), predicting phenotypes, and identifying associated markers by jointly modeling all markers while controlling for population structure. It is computationally efficient for large scale GWAS and uses freely available open-source numerical libraries.
URL : http://stephenslab.uchicago.edu/software.html#gemma
CITATION : Zhou, Xiang, and Matthew Stephens. "Genome-wide efficient mixed-model analysis for association studies." Nature genetics 44.7 (2012): 821-824.
CATEGORY : Single variant association tests

PLINK

NAME : PLINK
FULL NAME : PLINK
SHORT NAME : PLINK
DESCRIPTION : A Tool Set for Whole-Genome Association and Population-Based Linkage Analyses. PLINK is a free, open-source whole genome association analysis toolset, designed to perform a range of basic, large-scale analyses in a computationally efficient manner. The focus of PLINK is purely on analysis of genotype/phenotype data, so there is no support for steps prior to this (e.g. study design and planning, generating genotype or CNV calls from raw data). Through integration with gPLINK and Haploview, there is some support for the subsequent visualization, annotation and storage of results.
URL : https://www.cog-genomics.org/plink/
CITATION : Purcell, Shaun, et al. "PLINK: a tool set for whole-genome association and population-based linkage analyses." The American journal of human genetics 81.3 (2007): 559-575.
CATEGORY : Single variant association tests

PLINK2

NAME : PLINK2
FULL NAME : PLINK2
SHORT NAME : PLINK2
URL : https://www.cog-genomics.org/plink/2.0/
CITATION : Chang, Christopher C., et al. "Second-generation PLINK: rising to the challenge of larger and richer datasets." Gigascience 4.1 (2015): s13742-015.
CATEGORY : Single variant association tests

POLMM

NAME : POLMM
FULL NAME : proportional odds logistic mixed model (POLMM)
SHORT NAME : POLMM
DESCRIPTION : Proportional Odds Logistic Mixed Model (POLMM) for ordinal categorical data analysis
URL : https://github.com/WenjianBI/POLMM
CITATION : Bi, W., Zhou, W., Dey, R., Mukherjee, B., Sampson, J. N., & Lee, S. (2021). Efficient mixed model approach for large-scale genome-wide association studies of ordinal categorical phenotypes. The American Journal of Human Genetics, 108(5), 825-839.
CATEGORY : Single variant association tests
KEY WORDS : ordinal categorical phenotypes

REGENIE

NAME : REGENIE
FULL NAME : REGENIE
SHORT NAME : REGENIE
DESCRIPTION : regenie is a C++ program for whole genome regression modelling of large genome-wide association studies. It is developed and supported by a team of scientists at the Regeneron Genetics Center.
URL : https://github.com/rgcgithub/regenie
CITATION : Mbatchou, Joelle, et al. "Computationally efficient whole-genome regression for quantitative and binary traits." Nature genetics 53.7 (2021): 1097-1103.
CATEGORY : Single variant association tests
KEY WORDS : whole genome regression

SAIGE

NAME : SAIGE
FULL NAME : Scalable and Accurate Implementation of GEneralized mixed model
SHORT NAME : SAIGE
DESCRIPTION : SAIGE is an R package with Scalable and Accurate Implementation of Generalized mixed model (Chen, H. et al. 2016). It accounts for sample relatedness and is feasible for genetic association tests in large cohorts and biobanks (N > 400,000). SAIGE performs single-variant association tests for binary traits and quantitative taits. For binary traits, SAIGE uses the saddlepoint approximation (SPA)(mhof, J. P. , 1961; Kuonen, D. 1999; Dey, R. et.al 2017) to account for case-control imbalance.
URL : https://github.com/weizhouUMICH/SAIGE
CITATION : Zhou, Wei, et al. "Efficiently controlling for case-control imbalance and sample relatedness in large-scale genetic association studies." Nature genetics 50.9 (2018): 1335-1341.
CATEGORY : Single variant association tests
KEYWORDS : case-control imbalance, saddlepoint approximation (SPA)

fastGWA

NAME : fastGWA
FULL NAME : fastGWA
SHORT NAME : fastGWA
URL : https://yanglab.westlake.edu.cn/software/gcta/#fastGWA
CITATION : Jiang, Longda, et al. "A resource-efficient tool for mixed model association analysis of large-scale data." Nature genetics 51.12 (2019): 1749-1755.
CATEGORY : Single variant association tests
KEY WORDS : grid-search-based REML algorithm

fastGWA-GLMM

NAME : fastGWA-GLMM
FULL NAME : fastGWA-GLMM
SHORT NAME : fastGWA-GLMM
URL : https://yanglab.westlake.edu.cn/software/gcta/#fastGWA
CITATION : Jiang, Longda, et al. "A generalized linear mixed model association tool for biobank-scale data." Nature genetics 53.11 (2021): 1616-1621.
CATEGORY : Single variant association tests