TCF7L2 Is Not a Major Susceptibility Gene for Type 2 Diabetes in Pima Indians
Analysis of 3,501 Individuals
Analysis of 3,501 Individuals
Abstract
OBJECTIVE— The transcription factor 7-like 2 (TCF7L2) gene was initially reported to be associated with type 2 diabetes in Icelandic, Danish, and U.S. populations. We investigated whether TCF7L2 also has a role in type 2 diabetes susceptibility in Pima Indians.
RESEARCH DESIGN AND METHODS— The six variants reported to be associated with type 2 diabetes in the Icelandic study were genotyped in a population-based sample of 3,501 Pima Indians (1,561 subjects had type 2 diabetes, and 1,940 did not have diabetes). In addition, the coding and promoter regions of TCF7L2 were sequenced in 24 Pima subjects. The one variant identified by sequencing, 35 additional database variants positioned in introns, and the six variants reported in the Icelandic study were genotyped in Pima families to determine the haplotype structure of TCF7L2 among Pima Indians. Fourteen representative variants were selected and genotyped in 3,501 Pima Indians.
RESULTS— The six variants initially reported to be associated with type 2 diabetes were less common in Pima Indians compared with samples of European origin, and none were associated with type 2 diabetes. One representative variant, rs1225404, was nominally associated with type 2 diabetes in a general model (additive P = 0.03, dominant P = 0.005) but not in a within-family analysis (additive P = 0.2, dominant P = 0.07). However, several variants were associated with BMI; in particular, rs12255372 was associated in both general and within-family analyses (both P = 0.0007). Modest associations were also found with traits predictive for type 2 diabetes.
CONCLUSIONS— Variation within TCF7L2 does not confer major risk for type 2 diabetes among the Pima Indian population.
A microsatellite marker (DG10S478) within intron 3 of the transcription factor 7-like 2 (TCF7L2) gene and five intronic single nucleotide polymorphisms (SNPs) have been reported to be highly associated with type 2 diabetes in subjects from Iceland, Denmark, and the U.S. (1). Associations with these specific variants and type 2 diabetes have subsequently been replicated consistently and robustly in multiple studies involving subjects of European origin (2–11), Asian Indians (12), and Japanese subjects (13,14). To investigate whether variation in TCF7L2 also has a major role in type 2 diabetes susceptibility in Pima Indians, a population with an extraordinarily high prevalence of type 2 diabetes, variants from the initial report (1), as well as 14 additional representative variants, were genotyped in a population-based sample of full-heritage Pima Indians for association analyses.
RESEARCH DESIGN AND METHODS
All subjects are Pima Indians who are participants in our ongoing longitudinal study of type 2 diabetes among members of the Gila River Indian Community (15). Initially, a family-based sample was genotyped to determine the haplotype structure in this population, and representative SNPs were subsequently genotyped in a population-based sample for association analyses. The family-based sample consisted of 1,037 subjects (578 with type 2 diabetes and 459 without diabetes) from 332 nuclear families in 112 pedigrees. The population-based sample consisted of 3,501 full-heritage Pima Indians for whom there was DNA and information on diabetes status and BMI. This sample consisted of 1,561 subjects with type 2 diabetes (580 of whom were male and 981 female, with mean ± SD BMI 38.5 ± 8.4 kg/m2 and age of onset 37.2 ± 12.1 years) and 1,940 nondiabetic subjects (902 of whom were male and 1,038 female, with BMI 35.7 ± 8.2 kg/m2, aged 31.1 ± 14.5 years), as defined by a 2-h oral glucose tolerance test (OGTT) (16). A total of 896 subjects overlapped between the family- and population-based samples. Among the nondiabetic subjects, a subset (n = 372) had additionally undergone metabolic phenotyping as inpatients in our clinical research center. Glucose tolerance was determined by a 75-g OGTT with measures of fasting plasma glucose; 30-, 60-, 120-, and 180-min plasma glucose; and insulin concentrations. The acute insulin response was measured by collecting blood samples before a 25-g glucose bolus infusion and 3, 4, 5, 6, 8, and 10 min afterward. Acute insulin response was calculated as the mean increment in plasma insulin concentrations from 3 to 5 min (17). Insulin sensitivity was assessed using a two-step hyperinsulinemic-euglycemic clamp (17). Body composition was estimated by underwater weighing or dual-energy X-ray absorptiometry (DPX-1; Lunar Radiation) (18). All of the studies were approved by the Gila River Indian Community Council and the institutional review board of the National Institute of Diabetes and Digestive and Kidney Diseases.
SNP identification and genotyping.
Sixteen exons that encode the 10 transcripts of TCF7L2, all exon-intron boundaries extending >100 bp into each intron, the 5′ and 3′ untranslated regions, and 2 kb of the upstream (putative promoter) region were sequenced in DNA samples from 24 non–first-degree–related Pima Indians (of whom 12 had age of type 2 diabetes onset <25 years and 12 were confirmed to be without diabetes at age ≥45 years), using Big Dye terminator (Applied Biosystems) on an automated DNA capillary sequencer (model 3730; Applied Biosystems). SNPs identified by sequencing and database SNPs positioned within unsequenced intronic regions were genotyped by the method of SNPlex (Applied Biosystems). Microsatellite marker DG10S478 was amplified and analyzed by capillary electrophoresis on an ABI 3730 DNA sequencer (Applied Biosystems).
Statistical analysis.
The relationship between genotype and continuous variables was assessed by linear regression with adjustment for appropriate covariates. The association of SNPs with type 2 diabetes was assessed by logistic regression with adjustment for covariates. Both linear and logistic models were fit with generalized estimating equations to account for familial relationships (i.e., sibships). For regression modeling in the additive model, homozygotes for the major allele (1/1), heterozygotes (1/2), and homozygotes for the minor allele (2/2) were coded to a continuous numeric variable for genotype (0, 1, and 2). The dominant model was defined as 1/1 + 1/2 vs. 2/2 and the recessive model as 1/1 vs. 1/2 + 2/2. In addition to these general association tests, within-family tests of association were conducted by a modification of the method of Abecasis et al. (19) to control for potential population stratification. A sliding window analysis using a four-SNP window was used for haplotype analysis. This approach provides 16 different windows from the 19 SNPs. The MLINK program (20) was used to assign a probability of carriage of a given haplotype as previously described (21). An exhaustive analysis was done, testing all common (minor allele frequency >1%) haplotypes for all possible combinations of one, two, three, and four SNPs within each window. The haplotype frequencies were estimated with ILINK to account for the familial relationships (20).
RESULTS
Association with type 2 diabetes.
The microsatellite marker DG10S478 and five SNPs (rs7901695, rs7903146, rs7895340, rs11196205, and rs12255372) that were highly associated with type 2 diabetes in the Icelandic study (1) were genotyped in a population-based sample of Pima Indians (n = 3,501). DG10S478 was monomorphic for the protective allele (designated as allele 0 in ref. 1) in full-heritage Pima Indians. The minor alleles of the five SNPs, which were the diabetes risk alleles in other populations, were less common in Pima Indians (frequencies from 0.01 to 0.1 in Pima Indians compared with 0.2 to 0.5 in Caucasians), and none were associated with type 2 diabetes under either a general or within-family model in these Native American samples (Table 1). In a recent meta-analysis containing >17,000 case and >29,000 control subjects from various populations, the odds ratio (OR) for diabetes per copy of the rs7903146 T-allele was 1.46 (95% CI 1.42–1.51) (22). In the present study, the 95% CI for the OR excludes an effect of this magnitude (OR per copy of the T-allele 1.04 [95% CI 0.82–1.32]), and the Pima OR is significantly different from the global estimate (Q = 7.62; P = 0.006); thus, the lack of association with rs7903146 in the present study does not reflect inadequate power. On the other hand, rs12255372 is so rare in Pima Indians that the power to detect an association is limited even with the present sample size. Although the OR suggests no association with rs12255372, the CIs are consistent with a fairly large effect (OR per copy of the T-allele 0.87 [0.41–1.87]).
The association of rs7903146 with type 2 diabetes in Pima Indians was not substantially changed when BMI was included as a covariate. The OR for the T-allele was 1.10 per copy (95% CI 0.85–1.41), P = 0.46, when controlled for BMI. If the analysis for type 2 diabetes was stratified by sextiles of BMI, the T-allele tended to be associated with lower type 2 diabetes prevalence among those in the lowest sextile (BMI <29.5 kg/m2; n = 462, 125 with diabetes; OR 0.67 per copy of the T-allele) (supplemental Table 1 [available at http://dx.doi.org/10.2337/db07-0621]). Conversely, among those in the highest sextile of BMI (>44.8 kg/m2; n = 458, 262 with diabetes), the T-allele tended to be associated with higher diabetes prevalence (OR 1.87). The P value for the rs7903146 genotype–BMI interaction is 0.002; however, given the absence of an overall association between genotype and diabetes in this population, it is unclear how to interpret a P value of this magnitude.
To examine additional variation across this locus in Pima Indians, TCF7L2 was sequenced in 24 subjects, and one rare SNP predicting a Pro500Thr was identified (frequency of Thr-allele 0.03). The Pro500Thr SNP, the 5 Icelandic SNPs, and the 35 additional SNPs (all noncoding), detected either by sequencing or selected from public databases within intronic regions that were not sequenced, were genotyped in a family-based group of 1,037 Pima Indians to determine the haplotype structure across TCF7L2 (23). Genotypes from these 41 SNPs (supplemental Fig. 1) were used to select 14 representative SNPs. The Tagger algorithm (24) as implemented in Haploview was used to select representative SNPs for genotyping in the full population from among the 20 SNPs with minor allele frequency >0.2. In this analysis, r2 > 0.8 was considered indicative of redundance. As had been done with the five Icelandic SNPs, the representative SNPs were genotyped in the population-based Pima Indian sample (n = 3,501). One SNP, rs1225404, was nominally associated with type 2 diabetes using a general analysis (additive P = 0.03, dominant P = 0.005) but not using a less powerful within-family analysis (additive P = 0.2, dominant P = 0.07) (Table 1).
Haplotypes were constructed from the 19 SNPs in Table 1. A sliding window approach using four SNPs per window was used for the haplotype association analysis (Fig. 1). Modest associations were observed with windows that included the three SNPs rs7085532, rs10787475, and rs1225404, where the G- (rs7085532), T- (rs10787475), and T- (rs1225404) alleles were more common among the nondiabetic subjects compared with subjects with type 2 diabetes in the Pima population sample (frequency 0.08 vs. 0.06; additive P = 0.005; OR 0.68 [95% CI 0.52–0.89]). It is notable that rs1225404, which is part of this haplotype, was the only SNP nominally associated with type 2 diabetes in the single-marker analysis, where the T-allele was the protective allele (Table 1). Modest associations were also observed with windows including the C-, C-, and A-alleles for rs10787475, Pro500Thr, and rs911770, respectively, which were more common among nondiabetic subjects compared with type 2 diabetic subjects (frequency 0.22 vs. 0.20; additive P = 0.006; OR 0.79 [0.67–0.93]). However, none of the SNPs in these haplotypes overlapped with the most significant SNPs described by Grant et al. (1), and these modest haplotype associations could be attributed to multiple variant testing.
Association with BMI.
The 19 SNPs (Icelandic and additional representative SNPs) and their haplotypes were also analyzed for associations with BMI in the Pima population sample (Fig. 1 and Table 2). Many of these subjects have been studied longitudinally and had multiple measurements of BMI; therefore, maximum BMI (without regard to diabetes status) was selected for analysis. The rare SNP rs12255372 had the strongest association with BMI, where individuals with the G-allele (frequency 0.99) had an ∼4.5 kg/m2 higher BMI than those without (P = 0.0007). The C-allele of rs7903146 (frequency 0.92) was also associated with increased BMI (P = 0.001), with a difference in BMI of ∼1.2 kg/m2 per copy of the C-allele. Although this C-allele is the low-risk allele for diabetes in most populations, it has previously been reported to be associated with higher BMI (11,25). In Pima Indians, the within-family association of higher BMI with the rs12255372 G-allele was also significant (P = 0.007), but this was not the case for the rs7903146 C-allele (P = 0.35).
Helgason et al. (25) identified a haplotype, termed HapA, that was strongly associated with BMI. This haplotype consisted of the rs10885406 A-allele and the rs7903146 C-allele. rs10885406 was not typed in the present study; however, it is in virtually complete concordance with rs7895340 in Asian populations; thus, in the current study, HapA was constructed using the rs7903146 C-allele and rs7895340 G-allele. By this definition, HapA was associated with higher BMI in Pima Indians (∼1.1 kg/m2 per copy of HapA; P = 0.002). However, a different haplotype, consisting of the rs7903146 C-allele and rs12255372 G-allele, provided the strongest association with BMI in the Pima Indian study (1.2 kg/m2 per copy; P = 0.0007). The individual SNPs of this second haplotype, rs7903146 and rs12255372, were not highly concordant (r2 = 0.07), and each remained significantly associated with BMI after controlling for the association of the other, although the statistical significance was attenuated. After controlling for the genotype at rs7903146, the rs12255372 G-allele remained associated with higher BMI (P = 0.01); likewise, after controlling for the rs12255372 genotype, the rs7903146 C-allele remained associated with higher BMI (P = 0.01). The fully parameterized haplotype model did not provide a significantly better fit than the model containing the additive effects of the two genotypes, suggesting that the information for the BMI association is in the genotypes from the two individual SNPs. In Pima Indians, the C-allele of rs7903146 was highly concordant with HapA (r2 = 0.91), making it difficult to differentiate between the effects of HapA and the rs7903146 C-allele on BMI. Likewise, it is difficult to determine whether the associations of rs7903146 and rs12255372 with BMI represent the effects of a single or two distinct functional variants.
Association with pre-diabetic metabolic traits.
Among the 3,501 Pima Indians in the population sample, 372 had been metabolically studied as inpatients in our clinical research center when they did not have diabetes. SNPs were analyzed for associations with quantitative traits that predict type 2 diabetes and obesity in these subjects (Table 3). The five Icelandic SNPs were not associated with any trait, with the exception of the 2-h plasma glucose level following a 75-g OGTT, which showed a nominal association (P = 0.08–0.03; data shown for rs7903146 and rs7895340 in Table 3), where the risk allele for type 2 diabetes in other populations had a higher 2-h plasma glucose level. Among the additional representative SNPs, the C-allele of rs7895307, which was nominally associated with higher BMI in the population sample (P = 0.04) (Table 2), was also nominally associated with higher percentage of body fat and BMI (P = 0.03 and 0.009, respectively) (Table 3) among the metabolically characterized subgroup. This C-allele was also nominally associated with a lower acute insulin response to an intravenous glucose tolerance test (P = 0.02). However, these associations are very modest and would not be significant if adjusted for multiple testing (19 SNPs analyzed for eight diabetes-related quantitative traits).
DISCUSSION
Although variants in TCF7L2, particularly rs7903146, have been reproducibly associated with type 2 diabetes in numerous populations (1–14), the present study shows that these variants are not strongly associated with type 2 diabetes among Pima Indians. However, HapA and the C-allele at rs7903146, which is the low-risk allele for type 2 diabetes among most populations, were associated with higher BMI in the Pima Indians, and similar associations with BMI have been reported in a few other studies (11,25). The Pima Indians have a high prevalence of obesity and type 2 diabetes, and it is possible that the higher BMI associated with the C-allele at rs7903146 overwhelms its protective effect for diabetes in this population. This seems an unlikely explanation for the lack of association between this SNP and type 2 diabetes in the Pima Indians, since the OR was largely unmodified by adjustment for BMI and since the C-allele tended to be associated with higher prevalence of type 2 diabetes among the leanest individuals in the present study, which is opposite the effect seen in most populations. It is possible that variants in TCF7L2 interact with other unidentified genetic or environmental risk factors that are highly prevalent in Pima Indians and that this results in no overall association in this population. Alternatively, since the functional consequences of alleles at rs7903146 are largely unknown, it remains possible that its association with type 2 diabetes in most populations reflects linkage disequilibrium with more distant functional alleles that have a different linkage disequilibrium pattern or are invariant in the Pima Indians. In the present study, additional representative SNPs across TCF7L2 were genotyped, and none were strongly associated with diabetes. Although it is possible that important functional variants in the unresequenced regions of the gene were not detected, the present data do not support a major role for variants in TCF7L2 in the development of type 2 diabetes in this population.
Single SNP and sliding window haplotype analysis for association of TCF7L2 with type 2 diabetes and BMI. An exhaustive analysis was done, testing all common (minor allele frequency >1%) haplotypes for all possible combinations of one, two, three, and four SNPs within each window. The best P value within each window is plotted at the window midpoint (dashed line), along with the P value for the single SNP analysis (points along solid line). The box on the x-axis in A indicates the region of the Icelandic SNPs that are highly associated with type 2 diabetes in other populations. Each of the 19 SNPs listed in Tables 1 and 2 is shown as a square for the single SNP analysis.
Association between SNPs in TCF7L2 and type 2 diabetes in a Pima Indian population–based study
Association between SNPs in TCF7L2 and BMI in a Pima Indian population–based study
Association between SNPs in TCF7L2 and obesity and diabetes-related phenotypes among nondiabetic Pima Indians
Acknowledgments
This study was supported by the intramural research program of the National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health. M.T., Y.L.M., and L.M. are supported by grants from the American Diabetes Association.
Footnotes
Published ahead of print at http://diabetes.diabetesjournals.org on 1 October 2007. DOI: 10.2337/db07-0621.
Additional information for this article can be found in an online appendix at http://dx.doi.org/10.2337/db07-0621.
The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked “advertisement” in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.
- Accepted September 18, 2007.
- Received May 7, 2007.
- DIABETES