Quantitative Trait Analysis of Type 2 Diabetes Susceptibility Loci Identified From Whole Genome Association Studies in the Insulin Resistance Atherosclerosis Family Study
- Nicholette D. Palmer1,2,
- Mark O. Goodarzi3,
- Carl D. Langefeld4,
- Julie Ziegler4,
- Jill M. Norris5,
- Steven M. Haffner6,
- Michael Bryer-Ash7,8,
- Richard N. Bergman9,
- Lynne E. Wagenknecht4,
- Kent D. Taylor3,
- Jerome I. Rotter3 and
- Donald W. Bowden1,2,10
- 1Department of Biochemistry, Wake Forest University School of Medicine, Winston-Salem, North Carolina
- 2Center for Human Genomics, Wake Forest University School of Medicine, Winston-Salem, North Carolina
- 3Medical Genetics Institute, Cedars-Sinai Medical Center, Los Angeles, California
- 4Department of Public Health Sciences, Wake Forest University School of Medicine, Winston-Salem, North Carolina
- 5Department of Preventive Medicine and Biometrics, University of Colorado Health Sciences Center, Denver, Colorado
- 6Department of Medicine, University of Texas, Health Sciences Center at San Antonio, San Antonio, Texas
- 7Gonda (Goldschmied) Diabetes Center, David Geffen School of Medicine at UCLA, Los Angeles, California
- 8Division of Endocrinology, Diabetes, and Hypertension, David Geffen School of Medicine at UCLA, Los Angeles, California
- 9Department of Physiology and Biophysics, Keck School of Medicine, University of Southern California, Los Angeles, California
- 10Department of Internal Medicine, Wake Forest University School of Medicine, Winston-Salem, North Carolina
- Address correspondence reprint requests to Donald W. Bowden, PhD, Department of Biochemistry, Wake Forest University School of Medicine, Winston-Salem, NC 27157. E-mail:
OBJECTIVE—Evaluate type 2 diabetes susceptibility variants identified from genome-wide association studies in Hispanic Americans and African Americans from the Insulin Resistance Atherosclerosis Family Study (IRAS-FS) for association with quantitative measures of glucose homeostasis and determine their biological role in vivo.
RESEARCH DESIGN AND METHODS—Seventeen type 2 diabetes–associated single nucleotide polymorphisms (SNPs) were genotyped in 1,268 Hispanic- and 581 African-American participants from the IRAS-FS. SNPs were tested for association with quantitative measures of glucose homeostasis, including insulin sensitivity index (SI), acute insulin response (AIR), and disposition index.
RESULTS—Previously identified risk variants in cyclin-dependent kinase 5 regulatory subunit associated protein 1-like 1 (CDKAL1) were associated with reduced AIR (P < 0.0046) in Hispanic Americans. Additionally in Hispanic Americans, the variant in a hypothetical gene (chromosome 11; LOC387761) was significantly associated with AIR (P = 0.0046) with the risk allele showing protective effects, i.e., increased AIR. In both Hispanic- and African-American populations, risk variants at the solute carrier family 30, member 8 (SLC30A8) locus were nominally associated with decreased disposition index (P < 0.078). Risk variants in the insulin-like growth factor 2 mRNA-binding protein 2 (IGF2BP2) locus were associated with a decreased disposition index (P = 0.011) exclusively in Hispanic Americans.
CONCLUSIONS—These data indicate a distinct, limited number of diabetes-related genes, more specifically the SNPs in the genes identified in European-derived populations, with modest evidence for association with glucose homeostasis traits in Hispanic Americans and African Americans. We observe evidence that diabetes risk for CDKAL1, SLC30A8, IGF2BP2, and LOC387761 is specifically mediated through defects in insulin secretion. The mechanisms of other predisposing genes remain to be elucidated.
- AIR, acute insulin response
- ALX4, aristaless-like 4
- CDKAL1, cyclin-dependent kinase 5 regulatory subunit associated protein 1-like 1
- CDKN2B/CDKN2A, cyclin-dependent kinase inhibitor 2A/B
- EXT2, exostosin 2
- FSIGT, frequently sampled intravenous glucose tolerance test
- FTO, fat mass- and obesity-associated
- GWA, genome-wide association
- HHEX, hematopoietically expressed homeobox
- IDE, insulin-degrading enzyme
- IGF2BP2, insulin-like growth factor 2 mRNA-binding protein 2
- IRAS-FS, Insulin Resistance Atherosclerosis Family Study
- KIF11, kinesin family member 11
- MAF, minor allele frequency
- PKN2, protein kinase N2
- SI, insulin sensitivity index
- SLC30A8, solute carrier family 30, member 8
- SNP, single nucleotide polymorphism
Type 2 diabetes is a complex disease whose pathophysiology can be characterized by peripheral insulin resistance and reduced insulin secretion. Type 2 diabetes is a heritable disease (1), with multiple variants conferring modest risk to its polygenic inheritance (2). Prior investigations of the genetic determinants of type 2 diabetes have identified loci, each with relatively modest impacts on disease risk and whose impact has been difficult to replicate across studies (2).
Recent technical advances have facilitated genome-wide association (GWA) studies which can systematically and more comprehensively search the genome for disease susceptibility loci. Using this technique, novel etiological pathways of type 2 diabetes risk have been elucidated. Recently, four type 2 diabetes GWA studies have been reported (3–6). Taken together, these studies have identified 11 novel loci for involvement in type 2 diabetes susceptibility in European-derived populations. Of these 11 loci, 8 have been replicated across studies.
The purpose of this study was to evaluate variants within the 11 novel type 2 diabetes susceptibility loci identified from GWA studies in a large cohort of Hispanic Americans and African Americans from the Insulin Resistance Atherosclerosis Family Study (IRAS-FS). Quantitative trait analysis was performed to assess the impact of type 2 diabetes susceptibility variants identified in European-derived populations in these two ethnic minority populations. This analysis would enable the assessment of the metabolic pathway (i.e., insulin sensitivity or insulin secretion) through which these susceptibility genes act.
RESEARCH DESIGN AND METHODS
Study design, recruitment, and phenotyping for IRAS-FS have been described previously in detail (7). Briefly, the IRAS-FS is a multicenter study designed to identify the genetic determinants of quantitative measures of glucose homeostasis. Members of large families of self-reported Hispanic ancestry (n = 1,268 individuals in 92 pedigrees from San Antonio, Texas, and San Luis Valley, Colorado) and African Americans (n = 581 individuals in 42 pedigrees from Los Angeles, California) were recruited. A clinical examination was performed that included an interview, a frequently sampled intravenous glucose tolerance test (FSIGT), anthropometric measurements, and blood collection. Specific to this report, measures of glucose homeostasis included those from the FSIGT using the reduced sampling protocol (8–10) calculated by mathematical modeling methods (MINMOD) (11): insulin sensitivity index (SI), acute insulin response (AIR), and disposition index. Distributions of the primary phenotypes are listed in Table 1.
Seventeen single nucleotide polymorphisms (SNPs) from 11 unique loci identified from type 2 diabetes GWA studies (3–6) were selected for analysis. Genotyping was performed on the Sequenom MassArray Genotyping System. Seventy blind duplicates were included to evaluate genotyping accuracy.
Initially, each SNP was examined for Mendelian inconsistencies using PedCheck (12). Genotypes inconsistent with Mendelian inheritance were converted to missing. Maximum likelihood estimates of allele frequencies were computed using the largest set of unrelated Hispanic- and African-American individuals (n = 229 and 58, respectively), and then genotypes were tested for departures from Hardy-Weinberg proportions.
To test for association between individual SNPs and each quantitative phenotype, variance component analysis was performed as implemented in SOLAR (13). When necessary, quantitative traits were transformed to best approximate the distributional assumptions of the test and minimize heterogeneity of the variance. For each phenotype, the 2 degrees of freedom test of genotypic association was performed. In addition, three individual contrasts defined by a priori genetic models (dominant, additive, and recessive) were computed (i.e., dominant model contrasts those with the polymorphism versus those without, additive model tests for a dose effect in the number of alleles, and recessive model contrasts individuals homozygous for the polymorphisms versus not). If the overall genotypic association was significant, the a priori contrasts were examined directly. If the overall genotypic association was not significant, the a priori contrasts were examined after adjusting for the three comparisons using a Bonferroni adjustment. This approach is consistent with the Fisher's protected least significant difference multiple comparisons procedure. Tests reported here were computed adjusting for age, sex, recruitment center, and BMI. Adjustments for multiple comparison tests were not performed because of selection of SNPs based on a priori hypotheses.
To examine the joint effect of these polymorphisms and their explanatory power for continuous traits, the model R2 was computed. The R2 statistic was calculated over just the covariates (i.e., age, sex, recruitment center, and BMI) and then with the inclusion of individual SNPs. In addition, stepwise model building was computed (i.e., forward selection with backward elimination) but did not provide additional explanatory information for these traits and SNPs (data not shown). Subjects with type 2 diabetes were excluded from the analysis of glucose homeostasis traits because overt diabetes and its treatment cause secondary changes in glycemic traits that obscure their underlying genetic determinants. SNP alleles were defined as “risk” or “protective” based on previous association studies of type 2 diabetes in European-derived populations (3–6).
This study evaluated 1,849 IRAS-FS participants, 1,268 Hispanic Americans and 581 African Americans. Table 1 summarizes descriptive statistics by ethnicity. On average, the Hispanic- and African-American participants had a similar proportion of women and comparable age and BMI values. Compared with African Americans, Hispanic Americans were more insulin sensitive (SI 2.15 vs. 1.63 × 10−5 min−1/[pmol/l]; P = 0.013), had reduced insulin secretion (AIR 760 vs. 1,006 pmol/l; P < 0.001), and had a reduced disposition index (1,317 vs. 1,426 × 10−5 min−1; P = 0.004). Marker genotyping success rates were 93.3–95.4% for the 17 SNPs examined, and blind duplicates were concordant. PedCheck analysis resulted in the exclusion of 11 of 34,527 genotypes. All SNPs were consistent with Hardy-Weinberg proportions in the Hispanic- and African-American populations.
The results of the quantitative trait analyses in Hispanic Americans are summarized in Table 2 and compared with the results of previous GWA studies from European-derived populations (3–6) in Table 3. The strongest evidence for association was observed with two SNPs (rs7754840, P = 0.0043, and rs10946398, P = 0.0046) in the intronic region of the cyclin-dependent kinase 5 regulatory subunit associated protein 1-like 1 (CDKAL1) gene with AIR. These SNPs showed the strongest evidence of association in the dominant model (Supplemental Table 1A, which is detailed in the online appendix [available at http://dx.doi.org/10.2337/db07-1169]; P = 0.0010 and 0.0011, respectively) with an 18.1% average decrease corresponding to 151 pmol/l insulin in the genotypic mean for AIR associated with the presence of the “risk” alleles C. The next strongest association was also observed with AIR for rs7480010 (P = 0.0046) in a hypothetical gene (chromosome 11; LOC387761). This SNP showed the strongest evidence of association in the additive model (Supplemental Table 1A; P = 0.0011) with an increase of 100 pmol/l (14.2%; genotype A/G) and 307 pmol/l (43.7%; genotype G/G) in the genotypic mean for AIR associated with the increasing copy number of the G allele. Notably, the G allele was previously denoted the “risk” allele in type 2 diabetes studies of European-derived populations because of increased prevalence of the allele in type 2 diabetes cases versus controls (5). Association at this locus was also seen with disposition index (P = 0.036) following an additive model (Supplemental Table 1A; P = 0.011) with an increase of 50 × 10−5 min−1 (3.9%; genotype A/G) and 353 × 10−5 min−1 (27.5%; genotype G/G) in the genotypic mean for disposition index associated with the increasing number of G alleles. In addition, SNP rs4402960 in the intronic region of insulin-like growth factor 2 mRNA-binding protein 2 (IGF2BP2) was associated with disposition index (P = 0.011). This SNP showed the strongest association in the additive model (Supplemental Table 1A; P = 0.0031) with a decrease of 221 × 10−5 min−1 (15.3%; genotype G/T) and 383 × 10−5 min−1 (26.6%; genotype T/T) in the genotypic mean for disposition index associated with number of the “risk” allele T. The SNPs evaluated explained, on average, <1% of the variance (R2) for the three quantitative traits examined (R2; Table 2). Analysis of SNPs in the other eight loci, protein kinase N2 (PKN2), a hypothetical gene (FLJ39370), solute carrier family 30, member 8 (SLC30A8), cyclin-dependent kinase inhibitor 2A/B (CDKN2B/CDKN2A), the insulin-degrading enzyme (IDE)/kinesin family member 11 (KIF11)/hematopoietically expressed homeobox (HHEX) gene cluster, an intragenic region on chromosome 11, the exostosin 2 (EXT2)/aristaless-like 4 (ALX4) gene region, and fat mass- and obesity-associated (FTO), did not show any evidence of association in the Hispanic-American subjects.
In African Americans (Tables 4 and 5, study comparisons), the strongest evidence for association was observed between two SNPs (rs7754840, P = 0.049, and rs10946398, P = 0.063) in the CDKAL1 gene and SI. These SNPs showed the strongest evidence of association in the additive model (Supplemental Table 1B; P = 0.016 and 0.027, respectively) with an average decrease of 0.24 × 10−5 min−1/[pmol/l] (12.9%; genotype G/C) and 0.33 × 10−5 min−1/[pmol/l] (17.5%; genotype C/C) in the genotypic mean for SI associated with the number of “risk” alleles C. Two additional loci showed evidence for association with disposition index. A nonsynonymous SNP, rs13266634 (P = 0.050), in the SLC30A8 gene was associated with disposition index following an additive model (Supplemental Table 1B; P = 0.021). The “risk” allele C was associated with a 1,011 × 10−5 min−1 (38.9%; genotype T/C) and 1,236 × 10−5 min−1 (47.5%; genotype C/C) decrease in disposition index associated with the number of “risk” alleles. SNP rs7923837, downstream of the IDE/KIF11/HHEX gene cluster, was modestly associated with disposition index (P = 0.045) following an additive model (Supplemental Table 1B; P = 0.024). The “risk” allele G was associated with a 505 × 10−5 min−1 (64.4%; genotype A/G) and 664 × 10−5 min−1 (84.7%; genotype G/G) increase in disposition index associated with the number of risk alleles. The SNPs evaluated explained, on average, <1% of the variance (R2) for the three quantitative traits examined (R2; Table 4). Analysis of SNPs in the other eight genes, PKN2, IGF2BP2, FLJ39370, CDKN2B/CDKN2A, LOC387761, intragenic region on chromosome 11, EXT2/ALX4, and FTO, did not show any evidence of association in the African-American subjects.
Quantitative trait analysis results of glucose homeostasis phenotypes differed dramatically between the two populations examined (Tables 2 and 4). The most striking associations observed in the Hispanic American population were at the CDKAL1 locus. Two highly correlated SNPs (rs7754840 and rs10946398; r2 = 1.0) were associated significantly with β-cell function as measured by AIR (P < 0.0046). Genotypic means for AIR were consistent with the “risk” alleles C having a reduced AIR following a dominant model (P < 0.0011). In African Americans, there was a dramatic difference in minor allele frequency (MAF) for these SNPs (C allele; 0.63 vs. 0.34 in Hispanic Americans), and associations at this locus were limited to nominal association with SI (P < 0.063). Genotypic means associated with the “risk” allele C had a decreased SI following an additive model (P < 0.027), which is consistent with previous reports (3,14). This difference in trait association may reflect the significant biological differences observed between the African- and Hispanic-American subjects with regard to SI and AIR, as seen in Table 1. Results of this association could also reflect pleiotropy, however, the genetic correlation between SI and AIR in the African American subjects is −0.09 ± 0.23, which is inconsistent with this hypothesis.
Similar to the results of association analysis with CDKAL1, a variant in a hypothetical locus (LOC387761) was associated with different phenotypes in the two populations examined. In Hispanic Americans, the previously identified “risk” allele G of rs7480010 was significantly associated with an increased AIR (P = 0.0046) and modestly associated with an increased disposition index (P = 0.036). These traits are mathematically related (disposition index = SI × AIR) and have a genetic correlation in these Hispanic-American subjects of 0.68 ± 0.07. In African Americans, there is a trend toward association at this locus with decreased SI (P = 0.068) corresponding to the “risk” allele. The confounding associations observed at the CDKAL1 and LOC387761 loci could be attributed partially to the dramatic difference in MAF between Hispanic Americans and African Americans which raises the possibility that the identified susceptibility variant is not causal but exhibits effects via linkage disequilibrium, patterns which are different between populations. These are the only two loci that were associated with quantitative measures of glucose homeostasis in both populations in this study, although they have contrasting evidence of phenotypic association across populations. In addition, it is worth noting that the Hispanic- and African-American cohorts examined are phenotypically diverse in terms of glucose homeostasis parameters with African Americans having a significantly lower SI (P = 0.013) and higher AIR (P < 0.001). Therefore, the lack of a compensatory increase in AIR observed in the African-American cohort in the presence of a significantly decreased SI could be attributed to an already increased baseline AIR.
At the SLC30A8 locus, a nonsynonymous variant (R325W; rs13266634) was associated with variation in the disposition index in the African-American cohorts (P = 0.050) and more modestly in the Hispanic-American (P = 0.078) cohorts. The “risk” allele C, identified and replicated across all four GWA studies (3–6), was associated with a reduced disposition index (P = 0.05) following an additive genetic model (P = 0.021). Of the loci examined, variation at the SLC30A8 locus represents the only evidence of consistent association with the GWA reports (3–6) as to the direction of “risk” and consistent findings in the two non–European-origin populations examined herein.
Association observed at the IGF2BP2 locus was limited to the Hispanic-American cohort. SNP rs4402960 was associated with alteration of the disposition index with the “risk” allele T, as determined from the GWA reports (3–6), at a comparable frequency compared with estimates from the European-derived populations and associated with reduced disposition index (P = 0.011) following an additive model (P = 0.0031). Lack of association with glucose homeostasis phenotypes in the African-American population could be attributed to a substantially increased diabetes “risk” allele frequency (MAF = 0.50) and linkage disequilibrium block boundaries, which differ between the European-American and African populations as suggested from HapMap data.
In addition, evidence for association of variants located downstream of the IDE/KIF11/HHEX gene cluster was limited to a single SNP (rs7923837) associated modestly with disposition index (P = 0.045) exclusively in the African American population. Proposed susceptibility variants in PKN2, FLJ39370, CDKN2A/CDKN2B, an intragenic region on chromosome 11, EXT2/ALX4, and FTO failed to show evidence of association with the measures of glucose homeostasis evaluated in either ethnic group.
Although the IRAS-FS was designed to study quantitative traits related to glucose homeostasis (7), there were additional subjects with type 2 diabetes in these families, which allowed us to perform association analysis of the 17 GWA SNPs with type 2 diabetes as a qualitative trait. Likely reflecting the relatively modest numbers of those affected by type 2 diabetes in IRAS-FS (181 Hispanic Americans and 71 African Americans), results were inconsistent and largely nonsignificant (Supplemental Tables 2A and 2B). In Hispanic Americans, a single SNP (rs9300039) in an intragenic region on chromosome 11 was found to be associated with type 2 diabetes (P = 0.039; Supplemental Table 2A). The A allele of SNP rs9300039 had an odds ratio (OR) of 0.49 (95% CI 0.25–0.96) and therefore was found to be associated with protection from type 2 diabetes. Scott et al. (4) found association of the C allele with type 2 diabetes “risk”, 1.48 (1.28–1.71), but this result failed to replicate in the companion publications (3,6). In the African American population, a single SNP (rs4402960) in IGF2BP2 was significantly associated with type 2 diabetes as a qualitative trait (P = 0.021; Supplemental Table 2B). The T allele of SNP rs4402960 was associated with protection from type 2 diabetes, 0.59 (0.38–0.92). This finding is inconsistent with three GWA publications (3,4,6), which found this allele to be associated with type 2 diabetes “risk” (meta analysis OR 1.14). The difference in directionality of association could be due to a marked difference in MAF between African-American (MAF = 0.50) and European-derived (MAF = 0.30) populations, which is consistent with HapMap estimates of allele frequency and linkage disequilibrium structural differences. As noted above, there is modest power given the sample size (Supplemental Table 3), nominal P values, and limited evidence that these SNPs contribute to differential type 2 diabetes risk in an independent African-American type 2 diabetes case/control sample (J. Lewis, personal communication). These results suggest these type 2 diabetes results should be viewed as preliminary findings in these populations and any conclusions on the genetic basis of clinical diabetes are not warranted from these data alone.
The IRAS-FS was designed to determine the underlying genetic and environmental contributors to insulin resistance and more broadly glucose homeostasis through quantitative trait analysis. The availability of high-quality metabolic testing in the IRAS-FS, which few studies have, facilitates interrogation of metabolic pathways through which loci implicated in type 2 diabetes susceptibility may influence glucose metabolism. This, taken together with the recruitment of multigenerational pedigrees, with attendant significant increase in power over a sibpair study design, enhance the ability of IRAS-FS to detect and comprehensively evaluate genes related to glucose homeostasis and, in turn, type 2 diabetes. Taken together, the results of the association analyses reported here suggest that a small number of type 2 diabetes susceptibility loci, CDKAL1, LOC387761, SLC30A8, and IGF2BP2, identified from studies in European-derived type 2 diabetes populations, contribute modestly to variation in glucose homeostasis in Hispanic Americans and African Americans. The balance of the associations with measures of glucose homeostasis suggest that the CDKAL1, LOC387761, SLC30A8, and IGF2BP2 loci are contributing to diabetes susceptibility primarily through effects on insulin secretion as measured by AIR or through homeostatic regulation of the balance of insulin secretion and insulin sensitivity. Any strong evidence for association with insulin sensitivity, a primary component of diabetes susceptibility, is strikingly absent. Therefore, further research for genes effecting insulin sensitivity is in order.
This research was supported in part by NIH grants HL060894, HL060931, HL060944, HL061019, and HL061210.
Published ahead of print at http://diabetes.diabetesjournals.org on 5 February 2008. DOI: 10.2337/db07-1169.
The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked “advertisement” in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.
- Received August 17, 2007.
- Accepted January 15, 2008.