Strong Parent-of-Origin Effects in the Association of KCNQ1 Variants With Type 2 Diabetes in American Indians
- Robert L. Hanson⇑,
- Tingwei Guo,
- Yunhua L. Muller,
- Jamie Fleming,
- William C. Knowler,
- Sayuko Kobes,
- Clifton Bogardus and
- Leslie J. Baier
- Phoenix Epidemiology and Clinical Research Branch, National Institute of Diabetes and Digestive and Kidney Diseases, Phoenix, Arizona
- Corresponding author: Robert L. Hanson, .
Parent-of-origin effects were observed in an Icelandic population for several genetic variants associated with type 2 diabetes, including those in KLF14 (rs4731702), MOB2 (rs2334499), and KCNQ1 (rs2237892, rs231362). We analyzed parent-of-origin effects for these variants, along with two others in KCNQ1 identified in previous genome-wide association studies (rs2237895, rs2299620), in 7,351 Pima Indians from 4,549 nuclear families; 34% of participants had diabetes. In a subset of 287 normoglycemic individuals, acute insulin secretion was measured by an intravenous glucose tolerance test. Statistically significant (P < 0.05) parent-of-origin effects were seen for association with type 2 diabetes for all variants. The strongest effect was seen at rs2299620 in KCNQ1; the C allele was associated with increased diabetes when maternally derived (odds ratio [OR], 1.92; P = 4.1 × 10−12), but not when paternally derived (OR, 0.93; P = 0.47; P = 9.9 × 10−6 for difference in maternal and paternal effects). A maternally derived C allele also was associated with a 28% decrease in insulin secretion (P = 0.002). This study confirms parent-of-origin effects in the association with type 2 diabetes for variants in KLF14, MOB2, and KCNQ1. In Pima Indians, the effect of maternally derived KCNQ1 variants appears to be mediated through decreased insulin secretion and is particularly strong, accounting for 4% of the variance in liability to diabetes.
Several single nucleotide polymorphisms (SNPs) reproducibly associated with type 2 diabetes recently have been identified (1–4). Many of these are in regions of the genome that are imprinted, and studies of an Icelandic population suggest that there are parent-of-origin effects at four of these variants (5); in other words, the extent of association with the risk allele depends on whether it is inherited from the mother or from the father. The SNPs for which parent-of-origin effects have been observed include one in KLF14 (rs4731702), one near MOB2 (rs2334499), and two independent SNPs in KCNQ1 (rs231362 and rs2237892) (5). The presence of parent-of-origin effects at these SNPs is consistent with imprinting and may have important implications for the mechanisms by which variants in or near these genes confer susceptibility to type 2 diabetes. However, for some of the SNPs, the current statistical evidence for parent-of-origin effects is modest. Furthermore, to our knowledge, these effects have not been replicated in other ethnic groups, nor have parent-of-origin effects been analyzed for metabolic traits that underlie the risk of type 2 diabetes, perhaps because few large studies have family data. In the current study, we have analyzed parent-of-origin effects at these SNPs in Pima Indians, an American Indian population in which the prevalence of type 2 diabetes is extraordinarily high (6) and in which family and detailed metabolic data were obtained.
RESEARCH DESIGN AND METHODS
The participants were derived from a longitudinal, population-based, epidemiologic study conducted in the Gila River Indian Community in central Arizona, where most of the residents are Pima Indians (6). In this study, all community members older than 5 years of age were invited to attend an examination every 2 years. Information on family relationships was collected. The examinations included a 75-g oral glucose tolerance test. Diabetes was diagnosed according to the 1997 American Diabetes Association criteria (7) (a fasting plasma glucose ≥7.0 mmol/L or a 2-h postload plasma glucose ≥11.1 mmol/L or a diagnosis made during the course of routine clinical care). The current study was conducted in 7,351 individuals whose self-reported heritage was at least half American Indian, who had DNA available, and who had data regarding presence of type 2 diabetes. This included 3,604 individuals (2,061 women and 1,543 men) whose heritage was full Pima or Tohono O'odham (a closely related tribe); the mean (±SD) age of these individuals was 40.0 (±16.7) years and 46% had diabetes. The remaining 3,747 individuals (2,052 women and 1,695 men; mean age, 28.4 ± 14.1 years; 22% with diabetes) were largely of “mixed” heritage; on average, the self-reported heritage of this group was 56% Pima Indian and 83% American Indian (which includes other tribes). The full-heritage Pima Indians and the mixed-heritage individuals were initially analyzed separately, and similar association results were obtained. Thus, results are presented for the combined analysis.
Exposure to a diabetic intrauterine environment is a strong risk factor for development of diabetes in Pima Indians independent of genetic background (8), and such intrauterine effects may confound the analysis of parent-of-origin effects (9). To account for this, individuals were classified into three categories based on their likely exposure to a diabetic intrauterine environment. Individuals whose mothers had diabetes diagnosed before the child’s birth were considered to have “definite” exposure; those whose mothers had a nondiabetic examination during the longitudinal study ≥1 year after the child’s birth were considered to have “unlikely” exposure to a diabetic intrauterine environment; and the exposure of all others was considered “indeterminate.” Covariates representing these categories were included in the analyses of association with diabetes, and separate analyses were conducted for each of these groups.
A subset of the population participated in detailed metabolic studies to measure the following traits related to development of diabetes: insulin sensitivity; insulin secretion; and body composition. Percentage body fat was calculated from measurements made by hydrodensitometry or by dual X-ray absorptiometry with use of a conversion equation to make measurements comparable, as previously described (10). These measurements were made in 400 full-heritage Pima Indians (171 women and 229 men; mean age, 26.7 ± 6.1 years). In these same individuals, insulin sensitivity was measured by the hyperinsulinemic-euglycemic clamp technique. As described previously (11), insulin was infused at a rate to approximate physiologic levels (∼130 µmol/L) and glucose was infused at a varying rate titrated to maintain normoglycemia. The rate of glucose uptake, estimated using tracer amounts of [3-3H]glucose and normalized to effective metabolic body size (defined as fat-free mass plus 17.7 kg) (12), was taken as a measure of insulin sensitivity (mg glucose/kg estimated metabolic body size ⋅ min). Insulin secretion was measured by a 25-g intravenous glucose tolerance test as the acute insulin response (μU/mL) 3–5 min after the glucose bolus (11) in 287 individuals (104 women and 183 men; mean age, 26.7 ± 6.2 years) with normal glucose tolerance.
DNA was extracted from nuclear pellets derived from buffy coat using a high-salt precipitation method. Genotyping was conducted using SNPplex (Applied Biosystems, Carlsbad, CA), BeadXpress (Illumina, San Diego, CA), or Assays on Demand (Applied Biosystems) methods according to the manufacturer’s instructions. Each of the four SNPs reported as showing a parent-of-origin effect in the Icelandic study was genotyped (rs4731702, rs2334499, rs231362, rs2237892). Two additional KCNQ1 SNPs, rs2237895 and rs2237897, had associations of genome-wide significance with type 2 diabetes in a Japanese population, and had modest to moderate concordance with rs2237892 (r2 = 0.30 in Japanese) (3). Thus, rs2237895 and rs2299620 (r2 = 0.97 with rs2237897 in Pima Indians) also were genotyped in the current study. Analyses of “blind” duplicates and of Hardy-Weinberg equilibrium were performed to ensure genotyping quality; the concordance rate for duplicate genotypes for all SNPs was >97.3% and all had P > 0.001 for Hardy-Weinberg equilibrium. In addition, 45 SNPs with large differences in allele frequencies between American Indians and European populations (13) were genotyped to obtain estimates of individual European admixture for use as a covariate in association analyses. Individual admixture estimates were derived from these markers by the method of Hanis et al. (14). Relationships among nuclear family members were confirmed with the PREST program (15) by analysis of up to 1,178 SNPs typed in ongoing genetic studies. The 7,351 individuals constituted 4,549 sibships.
The association between genotype and type 2 diabetes was analyzed with control for covariates by a logistic regression model that was fit by the generalized estimating equation procedure to account for dependence among siblings. To assess general association in the absence of parent-of-origin effects, the odds ratio (OR) associated with each copy of the risk allele was calculated from a model in which the number of risk alleles was coded as a numeric variable (0, 1, or 2). The likely parental origin of alleles was assigned by analysis of genotypes observed in an individual, their parents, and their siblings (19% of individuals had both parents available for genotyping, whereas 42% had one parent available and 39% had neither parent available). Two variables, GM and GF, representing the presence of a risk allele inherited from the mother (M) or the father (F), respectively, were included in the logistic regression model. For homozygous individuals, these variables can be assigned unambiguously, i.e., GM = 1 and GF = 1 for those homozygous for the risk allele and GM = 0 and GF = 0 for those homozygous for the low-risk allele. For heterozygous individuals with at least one homozygous parent, the variables also can be assigned unambiguously, e.g., GM = 1 and GF = 0 for a heterozygous individual whose mother is homozygous for the risk allele. For heterozygous individuals for whom the assignment was uncertain, GM and GF were assigned their expected values. This expected value was calculated from the population allele frequencies and the genotypes of all family members using the MLINK program (16). Supplementary Table 1 shows the assignment of GM and GF for various combinations of genotypes for parent and child.
The regression coefficients for the effects of GM and GF (βM and βF) were used to calculate the ORs associated with the presence of a maternally inherited risk allele (ORM) and a paternally inherited risk allele (ORF). A test of parent-of-origin effects was conducted by testing the null hypothesis of equality of these ORs. This was performed by comparison of the difference of βM − βF with its SE, which was derived from the SEs of βM, βF, and their covariance as follows
Pdif is the P value associated with this test.
Simulations conducted with the present set of families suggest that this method has levels of type I error that are close to nominal values under the null hypothesis ORM = ORF and produces estimates that generally closely approximate the true parent-specific ORs (Supplementary Figs. 1 and 2). In the Icelandic study, the OR for the risk allele inherited from the presumptively expressed parent is ∼1.3 (5). The simulations suggest that power to detect an OR of this magnitude at P < 0.05 in the present families was 84% for a risk allele with frequency of 0.1 and 99% for a risk allele with frequency of 0.5; powers to detect a parent-of-origin effect at Pdif < 0.05 were 50 and 70%, respectively.
Analyses of association with continuous variables were conducted in a manner analogous to those for diabetes, with use of a linear mixed model to account for familial effects. The logarithm of each of the traits was analyzed as the dependent variable in these analyses, and the regression coefficient (β) was exponentiated to obtain the average effect on trait values associated with the risk allele, expressed as a multiplier (e.g., exp[β] = 1.10 represents an increase of 10% in the trait value for each copy of the risk allele, whereas exp[β] = 0.90 represents a decrease of 10%). To examine the potential differential association of diabetes with alleles at multiple SNPs in linkage disequilibrium, the methods were extended to assess the parental origin of haplotypes inferred from the genotypes at two or three SNPs. This allows for analysis of parent-specific association for haplotypes composed of the risk allele at one variant and the low-risk allele at one or two of the others.
Results for association of specific parentally derived alleles with type 2 diabetes were combined across the Pima Indian and Icelandic studies by the inverse variance method (17). The published ORs and P values from the Icelandic study were used in these analyses (5), and these P values were used to infer test-based SEs. Cochran's Q statistic was used to assess heterogeneity between the ORs for the two populations. P values for the parent-of-origin effect were combined across populations by an unweighted Z method (18). The proportion of the variance in liability to diabetes accounted for by an association was estimated by iterative application of formulae that relate this value to the OR, allele frequency, and disease prevalence (19), with modification for parent-of-origin effects.
Combined analysis of Pima Indian and Icelandic populations.
Table 1 shows the results of parent-specific association analyses combined across the Pima Indian and Icelandic populations for the four SNPs typed in common. There was a statistically significant parent-of-origin effect (Pdif < 0.05) for each of the SNPs, and the directions of the parent-specific associations were consistent with those expected from the Icelandic study. In general, the evidence for parent-of-origin effects was strongest when both populations were combined. For the KCNQ1 SNP rs2237892, the formal evidence for a parent-of-origin effect was marginal in the Icelandic study, in which the high frequency of the risk allele (0.93) resulted in limited power to detect these effects. In the Pima Indians, the risk allele has a frequency near 0.50, and this results in high power to discriminate parental origin; when the Pima and Icelandic populations are combined, the P value for the parent-of-origin effect at rs2237892 is stronger than in the Icelandic population alone (Pdif = 6.4 × 10−4). There was little evidence for heterogeneity between the populations, except at the KCNQ1 SNP rs231362, where there was suggestion of a protective effect of the paternally derived C allele in Pima Indians, but not in the Icelandic population.
Association with type 2 diabetes in Pima Indians.
Results for the associations of alleles at each of the six SNPs with type 2 diabetes according to parental origin in Pima Indians are shown in Fig. 1. The additional two KCNQ1 SNPs derived from the Japanese studies (rs2237895 and rs2299620) also showed significant parent-of-origin effects (Pdif < 0.05) and strong associations with type 2 diabetes. The strongest associations were seen with the KCNQ1 SNPs rs2237892, rs2237895, and rs2299620.
Table 2 shows the association in individuals classified according to their likelihood of exposure to a diabetic intrauterine environment. In general, the results were consistent across categories of exposure to diabetes in utero. Although there is some variability for the small group of individuals with definite exposure, there were no statistically significant interactions between exposure categories and parent-specific alleles. Furthermore, the parent-of-origin effects for all SNPs remained statistically significant (Pdif < 0.05) among those with unlikely exposure to diabetes in utero, with the exception of the KCNQ1 SNP rs231362, for which Pdif = 0.07. These analyses indicate that confounding of the parent-of-origin effects by exposure to diabetes in utero is unlikely.
Association with metabolic traits.
To determine potential physiologic mechanisms associated with diabetes risk for these SNPs, their association with quantitative metabolic traits was analyzed (Table 3). There was an association between the diabetes risk allele (C) of the KLF14 SNP rs4731702 and lower insulin sensitivity, but the parent-of-origin effect was not statistically significant. The three KCNQ1 SNPs, rs2237892, rs2237895, and rs2299620, were associated with insulin secretion, such that the diabetes risk allele was associated with lower insulin secretion. These associations were statistically significant when the risk allele was of maternal origin, but not when it was of paternal origin. At rs2299620, for example, a maternally inherited C allele was associated with an acute insulin secretion that was 28% lower than for a maternally inherited T allele (P = 0.002)
Haplotype analysis of KCNQ1 variants.
The KCNQ1 SNPs, rs2237892, rs2237895, and rs2299620, are in stronger linkage disequilibrium in Pima Indians than in other populations. The r2 between rs2237892 and rs2237895 is 0.64 (D′= 0.86), whereas between rs2237892 and rs2299620 it is 0.84 (D′= 0.98) and between rs2237895 and rs229620 it is 0.73 (D′= 0.99). To determine potential independent associations with diabetes, parent-specific haplotypic associations with diabetes were analyzed for pairs of these SNPs (Fig. 2). SNPs rs2237892 and rs2237895 contributed independently to the association with diabetes because maternally derived haplotypes with the risk allele at one marker, but not the other, had increased odds of diabetes compared with those with the low-risk allele at both markers. However, when haplotypes of rs2237892 or rs2237895 were considered in conjunction with rs2299620, maternally derived haplotypes with the risk allele (C) at rs2299620 had a significantly greater risk of diabetes than those with both low-risk alleles, although there were few individuals with haplotypes that carried the low-risk allele at rs2299620 and the risk allele at one of the other SNPs. Furthermore, in the analysis of haplotypes composed of all three SNPs, all haplotypes containing the C allele at rs2299620 are associated with increased risk of diabetes, regardless of alleles at the other loci, which exist in all possibilities (Fig. 2). Therefore, rs2299620 is the primary association signal in Pima Indians.
In recent years a number of genetic variants reproducibly associated with type 2 diabetes have been identified (1–4), and four of these variants across three genes in genomic regions known to be imprinted have been identified as having parent-of-origin effects (5). However, the initial studies demonstrating parent-of-origin effects, which were performed in Iceland, have not been replicated in other populations. Robust demonstration of parent-of-origin effects requires family data for large numbers of individuals with and without diabetes, and few studies have such data. In the present analysis, we replicated the parent-of-origin effects for all four of the variants identified in the Icelandic study in 7,351 Pima Indians, a high-risk population for which family data are available. Our findings suggest that functional variants underlying the observed effects at these loci are subject to imprinting and are important determinants of type 2 diabetes in diverse populations, including non-European populations at high risk for diabetes.
When a child’s mother has diabetes during the pregnancy, this exposure to a diabetic intrauterine environment increases the child’s subsequent risk for type 2 diabetes; such exposure to a diabetic intrauterine environment is an important risk factor for diabetes in Pima Indians (8). In some situations, such intrauterine effects can confound the assessment of parent-of-origin effects (9). However, this does not appear to be the case in the current analyses because the parent-of-origin effects were seen even among those who were not likely exposed to a diabetic intrauterine environment (in that the mother had an examination after the child’s birth that did not indicate diabetes).
The consistency of the findings between Pima Indian and Icelandic populations (5) and the consistency with established patterns of imprinting in humans is further evidence that these associations are mediated by imprinted genes. According the Geneimprint website (www.geneimprint.com), KLF14 and KCNQ1 are maternally expressed in humans, which is consistent with our observation that the maternally derived risk alleles at variants in these genes are associated with a statistically significant increased risk of diabetes. The SNP rs2334499 is near MOB2 and the KRTAP5 cluster, neither of which is known to be imprinted but both are located among the large cluster of imprinted genes on 11p15 (www.geneimprint.com). In this region, the parental alleles appear to act with opposite effect. In the Icelandic study, a strong increased risk of diabetes was noted with inheritance of the paternal risk allele T, whereas the same allele was protective when maternally derived (i.e., C is the risk allele when maternally derived). A similar pattern with respect to maternally derived and paternally derived risk alleles was observed in the current study of Pima Indians. It is possible that this pattern reflects more than one functional variant with different parent-specific expression profiles, but functional studies and more detailed mapping of this region are required for confirmation. In general, there was little evidence for heterogeneity in the ORs between the Icelandic and Pima Indian populations. The one exception involved the KCNQ1 SNP rs231362, for which a protective effect of a paternally derived C allele was suggested in Pima Indians in addition to the increased risk associated with a maternally derived C allele seen in both populations. However, the risk allele was at high frequency in Pima Indians (0.90), so this finding involves a relatively small number of individuals.
The strongest associations with diabetes in the Pima Indians are with the cluster of KCNQ1 SNPs rs2237892/rs2273895/rs2299620. These associations, which achieved genome-wide statistical significance in the current study, were “missed” in the initial genome-wide association studies in European populations, because of the high frequency of the risk alleles, and were identified in East Asian populations (3,4). In East Asian populations, the three SNPs are in moderate linkage disequilibrium and may represent independent association signals. In the Pima Indians, the linkage disequilibrium is stronger and the primary association appears to be with rs2299620; the relatively strong association between diabetes and a maternally derived risk allele at rs2299620 in Pima Indians may reflect co-occurrence of the “independent” associations from other populations on a single haplotype. In the Pima Indians, it thus is likely that the functional variants, which remain unidentified, are most strongly concordant with rs2299620.
Maternally derived alleles at rs4731702 in KLF14 are associated with expression of a number of genes that are correlated with insulinemia and other characteristics of metabolic syndrome (20). The diabetes risk allele also was associated with hyperinsulinemia in a previous study, which suggests increased insulin resistance, although parent-of-origin effects were not assessed (1). In the current study, an association with increased insulin resistance was observed using a direct measure, but a parent-of-origin effect was not seen.
Several previous studies have suggested that KCNQ1 SNPs influence diabetes risk through an effect on insulin secretion (3,21–27); however, many of these studies have included individuals who may have had impaired insulin secretion secondary to impaired glucose tolerance. The current study confirms this finding in individuals with normal glucose tolerance. Furthermore, because maternally derived risk alleles were associated with lower insulin secretion but paternally derived alleles were not, the current study is the first to suggest that the effects on insulin secretion are subject to imprinting. Because KCNQ1 is imprinted in most, but not all, human tissues (28), studies of imprinting in tissues relevant to insulin secretion (pancreatic islets, central nervous system, duodenum) would be mechanistically informative.
Among ∼65 variants identified as reproducibly associated with type 2 diabetes, 4 now have been robustly demonstrated to have parent-of-origin effects. This is a high proportion relative to other disorders and is more than expected given estimates that ∼1% of the human genome is imprinted (29). Because many imprinted genes are related to fetal growth and development (29,30), these findings may reflect the importance of these processes in the development of type 2 diabetes. The findings of the current study also illustrate the importance of considering parent-of-origin effects when present, and they demonstrate the extent to which the contribution of individual loci to development of type 2 diabetes varies with ethnicity. In European populations, the TCF7L2 variant rs7903146 is the strongest known genetic contributor to diabetes risk, because it has OR of ∼1.4 and accounts for ∼1.3% of variation in liability for diabetes. In the Pima Indians, however, the TCF7L2 variants have little association with type 2 diabetes (OR, 1.04; 31), and the OR of 1.34 seen in the current study for the KCNQ1 SNP rs2299620 (without consideration of parent-of-origin effects) corresponds to ∼1.6% of the variance in liability for diabetes. When the parent-of-origin effect is taken into account, there is nearly a 15% difference in prevalence of diabetes between carriers of a maternally derived C allele at rs2299620 and carriers of a maternally derived T allele. The proportion of variance in liability for diabetes that the association explains in the Pima Indians is ∼4.0%. This represents one of the largest contributions reported for a single variant to type 2 diabetes risk in any human population.
This work was supported by the intramural research program of the National Institute of Diabetes and Digestive and Kidney Diseases.
No potential conflicts of interest relevant to this article were reported.
R.L.H. wrote the manuscript, researched data, and contributed to discussion. T.G., Y.L.M., J.F., W.C.K., S.K., C.B., and L.J.B. researched data, contributed to discussion, and reviewed and edited the manuscript. R.L.H. is the guarantor of this work and, as such, had full access to all the data in the study and takes responsibility for the integrity of the data and the accuracy of the data analysis.
Parts of this study were presented at the 72nd Scientific Sessions of the American Diabetes Association, Philadelphia, Pennsylvania, 8–12 June 2012.
The authors thank the participants who volunteered for the study and the staff of the Phoenix Epidemiology and Clinical Research Branch who provided assistance.
This article contains Supplementary Data online at http://diabetes.diabetesjournals.org/lookup/suppl/doi:10.2337/db12-1767/-/DC1.
- Received December 14, 2012.
- Accepted April 20, 2013.
- © 2013 by the American Diabetes Association.
Readers may use this article as long as the work is properly cited, the use is educational and not for profit, and the work is not altered. See http://creativecommons.org/licenses/by-nc-nd/3.0/ for details.