Physiology Helps GWAS Take a Step Closer to Mechanism
Genome-wide association studies (GWAS) have been extremely successful at identifying replicable associations between common genetic variants and type 2 diabetes risk. The latest studies, including 35,000 European (1), 7,000 East Asian (2), 5,500 South Asian (3), and most recently 3,800 Latin American (4) and 6,000 Japanese (5) type 2 diabetes cases, bring the total number of associated variants to more than 70. There is strong evidence that many of the associated genetic variants lie in or close to genes important in type 2 diabetes etiology (e.g., the regions of the genome identified by GWAS are enriched for monogenic diabetes genes, such as HNF1A, HNF1B, and PPARG, and small noncoding regions of the genome [enhancers] critical for islet-specific gene expression ). Nevertheless, the field has not moved from genetic associations to improved understanding of biology as quickly or as often as hoped.
In this issue, Dimas et al. (7) present data that move the field a step closer to mechanisms. They tested the hypothesis that a systematic analysis of insulin secretion and insulin resistance measures in nondiabetic individuals would improve the understanding of the intermediate mechanisms by which genetic variants predispose to type 2 diabetes. They performed the most extensive analysis yet to group variants into categories based on their likely intermediate mechanism. The authors combined data from thousands of individuals with fasting-, oral-, and intravenous-based measures of insulin secretion and resistance. This approach has been used before for a smaller number of loci and individuals (8), but here the authors added a statistical clustering approach to provide the most robust categorization of the type 2 diabetes variants. This clustering analysis successfully binned 16 type 2 diabetes risk variants into four broad groups. Four variants fitted a clear insulin resistance pattern, two reduced insulin secretion with fasting hyperglycemia, nine reduced insulin secretion with normal fasting glycemia, and one altered insulin processing. A further 20 variants did not fit a clear physiological category, probably because of their relatively weaker effects on type 2 diabetes risk. These findings move the field forward by providing functional biologists with more information about where to start their experiments.
It could be argued that there are few surprises among the findings. For example, the variants categorized as insulin resistance, those near PPARG, IRS1, GCKR, and KLF11, either lie near genes with clear roles in insulin resistance (PPARG, IRS1) or are associated with insulin resistance–related measures (GCKR  and KLF11 ). Likewise, studies have established that diabetes risk alleles in or near TCF7L2 and SLC30A8 are associated with reduced insulin secretory capacity in response to a glucose challenge (8). Nevertheless, the data provide the strongest evidence yet that those genetic variants in or near (but not necessarily functioning through) the PROX1, TMEM, CDKAL1, CDKN2A/B, THADA, HHEX/IDE, and ADCY5 genes operate primarily through an insulin secretory defect. Dimas et al. also highlight the intriguing pattern of associations observed with the variant near ARAP1. Previously noted in a genome-wide study of proinsulin levels (11), Dimas et al. categorize this variant as “insulin processing.” Most type 2 diabetes risk alleles are associated with raised proinsulin levels, and this is in keeping with the epidemiologic associations (12). In contrast, the type 2 diabetes risk allele in the ARAP1 locus is associated with reduced proinsulin levels relative to insulin levels in the fasting state. The underlying explanation of this paradoxical finding is still not known.
The study reemphasizes some old questions and raises some new questions. Notably, why is it so difficult to assign an intermediate physiological mechanism to alleles clearly associated with type 2 diabetes? Despite using up to 58,000 individuals with fasting measures, 11,000 with oral glucose tolerance tests (OGTTs), and 4,600 with intravenous-based measures including 2,600 with euglycemic-hyperinsulinemic clamps, Dimas et al. were only able to group 16 of the 37 strongest type 2 diabetes risk variants into recognizable categories. Intuitively, a genetic risk variant should be associated with the trait that leads to type 2 diabetes more strongly than diabetes, but this rarely seems to be the case. Sample size is unlikely to be to blame—most of the diabetes risk alleles studied were discovered using less than 12,000 cases. One of the most likely explanations is measurement error. Insulin and, to a lesser extent, glucose vary within individuals much more than type 2 diabetes status. Imprecision in measuring intermediate physiology will reduce statistical power to detect effects (13), and it is noticeable that 12 of the 16 classifiable variants are among the 22 with the strongest odds ratios for type 2 diabetes. It may be that studies of 12,000 individuals at the ∼5% extreme end of the population with the poorest β-cell function and highest insulin resistance (those with diabetes) may be much more powerful than studies of 12,000 individuals from the remaining 95% of the distribution.
The explanation of the unclassified associations may not be just lack of statistical power. It is also an intriguing possibility that some variants may not play a role in altering physiology in normal individuals (at least as assessed by the submaximal tests performed in these studies), but may be important in influencing initial β-cell mass or the rate of deterioration of β-cell function once diabetes develops. These possible alternatives may explain why some of the variants lying close to the known monogenic β-cell genes, including HNF1A, HNF1B, WFS1, and KCNJ11, were not grouped into clear physiological categories.
Also worth highlighting are the common genetic variants noticeable by their absence. Variants in or near the G6PC2 and MADD (11) genes are among those with the strongest effects on fasting glycemia and proinsulin levels, respectively, and yet are not associated (even nominally) with type 2 diabetes. Hence, these variants were not included in Dimas et al., but further study of these variants and genes could improve knowledge of β-cell function. Since the GWAS finding, the G6PC2 gene (also called islet-specific glucose-6-phosphatase–related protein [IGRP]) has been the subject of renewed interest (14).
These large studies of subtle physiological effects also allow examination of the rather crude tools available to examine intermediate traits in large numbers of individuals. It is clear that in normoglycemic individuals the vast majority of variation in derived homeostasis model assessment of β-cell function is explained by fasting glucose, and this model adds little. Interestingly, there is not a clear increase in precision when intravenous glucose tolerance tests are used rather than OGTT-derived indices. It would be interesting to examine if the more sophisticated modeling of OGTT data using deconvolution of C-peptide could give new insights (15).
Perhaps the most important general message emerging from Dimas et al. (7) is that the type 2 diabetes GWAS field needs scientists from other areas—those with expertise in physiology, cell biology, and functional biology—to carefully inspect the 70 loci, the associated phenotypes, and the nearby genes. By way of example, follow-up of the variants and genes in the loci labeled as PROX1, TMEM, CDKAL1, CDKN2A/B, THADA, HHEX/IDE, and ADCY5 should target insulin secretion mechanisms.
Duality of Interest. No potential conflicts of interest relevant to this article were reported.
See accompanying article, p. 2158.
- © 2014 by the American Diabetes Association.
Readers may use this article as long as the work is properly cited, the use is educational and not for profit, and the work is not altered. See http://creativecommons.org/licenses/by-nc-nd/3.0/ for details.