Abstract
Population structure, including population stratification and cryptic relatedness, can cause spurious associations in genome-wide association studies (GWAS). Usually, the scaled median or mean test statistic for association calculated from multiple single-nucleotide-polymorphisms across the genome is used to assess such effects, and ‘genomic control’ can be applied subsequently to adjust test statistics at individual loci by a genomic inflation factor. Published GWAS have clearly shown that there are many loci underlying genetic variation for a wide range of complex diseases and traits, implying that a substantial proportion of the genome should show inflation of the test statistic. Here, we show by theory, simulation and analysis of data that in the absence of population structure and other technical artefacts, but in the presence of polygenic inheritance, substantial genomic inflation is expected. Its magnitude depends on sample size, heritability, linkage disequilibrium structure and the number of causal variants. Our predictions are consistent with empirical observations on height in independent samples of ∼4000 and ∼133 000 individuals.
Similar content being viewed by others
Log in or create a free account to read this content
Gain free access to this article, as well as selected content from this journal and more on nature.com
or
References
Hindorff LA, Sethupathy P, Junkins HA et al: Potential etiologic and functional implications of genome-wide association loci for human diseases and traits. Proc Natl Acad Sci USA 2009; 106: 9362–9367.
Maher B : Personal genomes: the case of the missing heritability. Nature 2008; 456: 18–21.
Manolio TA, Collins FS, Cox NJ et al: Finding the missing heritability of complex diseases. Nature 2009; 461: 747–753.
Speliotes EK, Willer CJ, Berndt SI et al: Association analyses of 249 796 individuals reveal 18 new loci associated with body mass index. Nat Genet 2010; 42: 937–948.
Lango Allen H, Estrada K, Lettre G et al: Hundreds of variants clustered in genomic loci and biological pathways affect human height. Nature 2010; 467: 832–838.
Heid IM, Jackson AU, Randall JC et al: Meta-analysis identifies 13 new loci associated with waist-hip ratio and reveals sexual dimorphism in the genetic basis of fat distribution. Nat Genet 2010; 42: 949–960.
Franke A, McGovern DPB, Barrett JC et al: Genome-wide meta-analysis increases to 71 the number of confirmed Crohn's disease susceptibility loci. Nat Genet 2010; 42: 1118–1125.
Teslovich TM, Musunuru K, Smith AV et al: Biological, clinical and population relevance of 95 loci for blood lipids. Nature 2010; 466: 707–713.
Devlin B, Roeder K : Genomic control for association studies. Biometrics 1999; 55: 997–1004.
Reich DE, Goldstein DB : Detecting association in a case-control study while correcting for population stratification. Genet Epidemiol 2001; 20: 4–16.
Zheng G, Freidlin B, Gastwirth JL : Robust genomic control for association studies. Am J Hum Genet 2006; 78: 350–356.
Cardon LR, Palmer LJ : Population stratification and spurious allelic association. The Lancet 2003; 361: 598–604.
Marchini J, Cardon LR, Phillips MS, Donnelly P : The effects of human population structure on large genetic association studies. Nat Genet 2004; 36: 512–517.
Campbell CD, Ogburn EL, Lunetta KL et al: Demonstrating stratification in a European American population. Nat Genet 2005; 37: 868–872.
Hao K, Li C, Rosenow C, Wong WH : Detect and adjust for population stratification in population-based association study using genomic control markers: an application of Affymetrix Genechip Human Mapping 10 K array. Eur J Hum Genet 2004; 12: 1001–1006.
WTCCC: Genome-wide association study of 14 000 cases of seven common diseases and 3000 shared controls. Nature 2007; 447: 661–678.
Chapman JM, Cooper JD, Todd JA, Clayton DG : Detecting disease associations due to linkage disequilibrium using haplotype tags: a class of tests and the determinants of statistical power. Hum Hered 2003; 56: 18–31.
Spencer CC, Su Z, Donnelly P, Marchini J : Designing genome-wide association studies: sample size, power, imputation, and the choice of genotyping chip. PLoS Genet 2009; 5: e1000477.
Purcell SM, Wray NR, Stone JL et al: Common polygenic variation contributes to risk of schizophrenia and bipolar disorder. Nature 2009; 460: 748–752.
Yang J, Wray NR, Visscher PM : Comparing apples and oranges: equating the power of case-control and quantitative trait association studies. Genet Epidemiol 2010; 34: 254–257.
Yang J, Benyamin B, McEvoy BP et al: Common SNPs explain a large proportion of the heritability for human height. Nat Genet 2010; 42: 565–569.
Purcell S, Neale B, Todd-Brown K et al: PLINK: a tool set for whole-genome association and population-based linkage analyses. Am J Hum Genet 2007; 81: 559–575.
Yang J, Lee SH, Goddard ME, Visscher PM : GCTA: a tool for genome-wide complex trait analysis. Am J Hum Genet 2010; 88: 76–82.
Acknowledgements
We thank all three reviewers for helpful comments. We acknowledge funding from the Australian National Health and Medical Research Council (NHMRC Grants 389891, 389892, 613672 and 613601), the Australian Research Council (ARC Grants DP0770096 and DP1093900) and the US National Institute of Health (NIH Grants AA13320, AA13321 and DA12854).
Author information
Authors and Affiliations
Consortia
Corresponding author
Ethics declarations
Competing interests
The authors declare no conflict of interest.
Rights and permissions
About this article
Cite this article
Yang, J., Weedon, M., Purcell, S. et al. Genomic inflation factors under polygenic inheritance. Eur J Hum Genet 19, 807–812 (2011). https://doi.org/10.1038/ejhg.2011.39
Received:
Revised:
Accepted:
Published:
Issue date:
DOI: https://doi.org/10.1038/ejhg.2011.39
Keywords
This article is cited by
-
Multi-ancestry genome-wide meta-analysis identifies novel basal cell carcinoma loci and shared genetic effects with squamous cell carcinoma
Communications Biology (2024)
-
Genome-wide polygenic risk score for major osteoporotic fractures in postmenopausal women using associated single nucleotide polymorphisms
Journal of Translational Medicine (2023)
-
Preselection of QTL markers enhances accuracy of genomic selection in Norway spruce
BMC Genomics (2023)
-
Gene-environment interaction explains a part of missing heritability in human body mass index
Communications Biology (2023)
-
Regulatory controls of duplicated gene expression during fiber development in allotetraploid cotton
Nature Genetics (2023)


