DOI: 10.2337/db05-1369 © 2006 by the American Diabetes Association Polymorphisms in the Glucokinase-Associated, Dual-Specificity Phosphatase 12 (DUSP12) Gene Under Chromosome 1q21 Linkage Peak Are Associated With Type 2 Diabetes
1 Division of Endocrinology and Metabolism, Department of Medicine, University of Arkansas for Medical Sciences College of Medicine, Little Rock, Arkansas Address correspondence and reprint requests to Steven C. Elbein, MD, Professor of Medicine, University of Arkansas for Medical Sciences, Endocrinology 111J-1/LR, John L. McClellan Memorial Veterans Hospital, 4700 W. 7th St., Little Rock, AR 72205. E-mail: elbeinstevenc{at}uams.edu
Abbreviations:
AIRg, acute insulin response to glucose; CNG, conserved nongenic region; FSIGT, frequently sampled intravenous glucose tolerance test; SNP, single nucleotide polymorphism; STR, simple tandem repeat
Linkage of type 2 diabetes to chromosome 1q21-q23 is well replicated across populations. In an initial 50-kb marker map (580 markers) across the linked region, one of the two strongest associations observed in Utah Caucasians was at marker rs1503814 (P < 0.00001 in pools, P < 0.004 in individuals). Based on this association, we typed additional markers and screened for sequence variation in the nearby DUSP12 gene. The strongest associations mapped to a highly conserved nongenic sequence just telomeric to rs1503814 and extended 10 kb telomeric through the DUSP12 gene and into the 5' end of the adjacent ATF6 gene. No coding variant could explain the association in the DUSP12 gene. An extended haplotype encompassing markers from –8,379 to +10,309 bp relative to the ATG start was more common in Caucasian case (0.381) than control subjects (0.285, P = 0.005) and was uniquely tagged by a 194-bp allele at either of two simple tandem repeat variants or by the T allele at marker +7,580. Markers –8,379 and +7,580 were nominally associated with type 2 diabetes in African-American subjects (P < 0.05), but with different alleles. Marker rs1503814 was strongly associated with postchallenge insulin levels among family members (P = 0.000002), but sequence variation in this region was not associated with type 2 diabetes in three other populations of European ancestry. Our data suggest that sequences in or upstream of DUSP12 may contribute to type 2 diabetes susceptibility, but the lack of replication suggests a small effect size. A genetic etiology underlying the high prevalence of type 2 diabetes is widely accepted. Nearly 30 genome scans and countless association studies have been conducted in multiple populations (1), which in aggregate and in recent meta-analyses (2) suggest that the genetic susceptibility to type 2 diabetes will involve many susceptibility loci. Genes identified to date have had small to moderate effects and have been difficult to replicate, and risk alleles have generally been marked by noncoding variants (2–5). Reported associations likely account for only a small portion of the total type 2 diabetes genetic susceptibility. Linkage of type 2 diabetes to chromosome 1q21-q23 was described initially by our laboratory (6) in Northern European Caucasians and in Pima Indians (7), and it was subsequently replicated in British (8), French (9), and Amish Caucasians (1,10); in Chinese Han (1,11); and in supportive data in African Americans from Arkansas (S.C.E., unpublished observations). This well-replicated region of type 2 diabetes linkage is characterized by an extraordinary gene density, including a plethora of strong candidate genes. We recently reported that the region encompasses at least two and possibly three linkage peaks (12), and we reported previously that two other genes in this region were associated with type 2 diabetes (5,13). Both associations were observed in other populations (14–16), and they support a model of multiple susceptibility loci accounting for the chromosome 1 linkage signal. In an initial collaborative effort to fine-map type 2 diabetes in Caucasians, we typed 580 single nucleotide polymorphisms (SNPs) over a 20-Mb region (mean 1 SNP/50 kb) that encompassed both major linkage peaks, using MassArray MALDI-TOF mass spectrometry (Sequenom, San Diego, CA) (5,17,18) in pooled DNA samples of 100 case and 100 control subjects of Northern European ancestry (5). We identified two prominent associations, one in intron 2 of the calsequestrin 1 gene (5) and a second at marker rs1503814 (P < 0.00001), which was verified by individual typing of the case and control subjects (P = 0.004). Based on these initial data, we evaluated additional SNPs from the public database across an 800-kb region surrounding the initial observation, and we narrowed the association to a 20-kb region that included dual-specificity phosphatase gene 12 (DUSP12) and the 5' end of the endoplasmic reticulum stress factor ATF6. We report studies to identify the associated variants in this region in Caucasians and extend the analysis to African-American case and control subjects. We report that noncoding variants in and near the DUSP12 gene, which has been shown to associate with and activate glucokinase in the liver (19), are associated with type 2 diabetes in both Utah Caucasian and Arkansas African-American populations, although a subset of the variants failed to replicate in other European populations.
DUSP12 gene variation was detected in 16 unrelated Caucasian individuals with type 2 diabetes from Utah families linked to the 1q21 region and 8 nondiabetic (control) family members, and it was detected in 16 diabetic members of African-American families used in the chromosome 1 linkage studies and 8 African-American control subjects. Association studies were conducted in 191 unrelated Caucasian case subjects and 188 unrelated Caucasian control subjects, as described previously (5,20). To reduce genotyping costs, some markers for initial genotyping were typed in pooled DNA samples, as described in detail previously (20,21). African-American samples were ascertained in Arkansas and included 130 unrelated nondiabetic individuals who had no family history of type 2 diabetes and 275 type 2 diabetic subjects who had at least 1 diabetic first-degree relative. Pooled DNA was constructed from 130 control individuals, 125 case subjects with type 2 diabetes and no nephropathy, and 150 individuals with type 2 diabetes and diabetic nephropathy (20). For markers more recently tested, we first typed 48 unrelated nondiabetic Caucasian and 48 nondiabetic African-American subjects for linkage disequilibrium before selecting markers for typing in the full population. Family-based association studies were conducted on 704 members of 68 Utah families used to define the original linkage, of which 292 individuals were considered affected (6,12). We tested the physiological impact of associated markers rs3820449 and simple tandem repeat (STR) marker –8379 in 120 individuals from the Utah families who had undergone tolbutamide-modified frequently sampled intravenous glucose tolerance tests (FSIGTs) (22) and an additional 182 Caucasian individuals from Arkansas who had undergone insulin-modified (n = 78) or tolbutamide-modified (n = 104) FSIGTs. Subjects ascertained in Utah provided written informed consent under a protocol approved by the University of Utah institutional review board. Subjects studied in Arkansas provided written informed consent under protocols approved by the University of Arkansas for Medical Sciences human research advisory committee.
DUSP12 screening.
Genotyping of sequence variants. SNPs were typed, using pyrosequencing on a PSQ-96 (Biotage, Uppsala, Sweden) (5). SNPs at a distance from the initial observation were typed first in pooled Caucasian samples, and differences between case and control frequencies 5% were confirmed in individual samples. Three SNPs not amenable to typing by pyrosequencing were typed in individuals by oligonucleotide ligation (13). STR and insertion deletion markers were detected on LI-COR GR4200 sequencers and scored, using Gene Imager version 3.56 software (Scanalytic, Rockville, MD). For the noncoding regions, particularly in the DUSP12 5' flanking region, which was not highly conserved, we first determined linkage disequilibrium and allele frequencies in 48 control samples. SNPs were chosen for typing in the full Caucasian and African-American populations based on a minor allele frequency over 5% and selection of tagged SNPs, using the parameter r2 > 0.95 and the programs LDSelect (23) and TAGGER (32) (supplemental Table 1S, which is detailed in the online appendix [available at http://diabetes.diabetesjournals.org]). All SNPs included in this study were in Hardy-Weinberg equilibrium. Quality control steps included reliable automated calls by pyrosequencing software and inclusion of at least 30 duplicate samples for the full set. No assays failed the duplicate checks, and all variants typed in families showed Mendelian inheritance. All members of families ascertained in Utah were typed for five DUSP12 variants: rs1503814 (–10473 bp), the –8379 bp STR (rs6143445), rs1027702 (–6735 bp), rs1063178 (+2115 bp), and rs3820449 (+7580 bp). African-American subjects were typed primarily for SNPs shown to be associated in Caucasian individuals, based on linkage disequilibrium determined in 48 unrelated African-American control samples. Additionally, we have typed >30 ancestrally informed SNPs as well as >70 total SNPs in candidate genes in the African-American population to detect admixture (24). These results did not support evidence for spurious association based on admixture among African-American subjects.
CNG.
Analysis of mRNA expression in transformed lymphocytes. Total RNA was isolated from Epstein-Barr virus–transformed lymphocytes, and allele-specific expression of SNP +2115 was quantified from 15 heterozygous individuals, as described previously (20,21). DUSP12 mRNA levels were measured from transformed lymphocyte RNA by quantitative real-time PCR (RT-PCR), using SYBR Green, and normalized to 18S RNA.
Typing of International Type 2 Diabetes Chromosome 1q Consortium samples.
Statistical methods. For analysis of International Type 2 Diabetes Chromosome 1q Consortium data, between-group differences in allele frequency were evaluated on a population-specific basis, using standard contingency table methods, and exact P values were calculated using Stata SE version 8 (Stata, College Station, TX). Single-point data from the case-control samples were combined, using the Mantel-Haenszel fixed-effects method (Stata SE version 8), and combined odds ratios were generated under dominant and recessive models. Analyses in the Amish took into account correlations among related individuals.
Analysis of DUPS12 region in Caucasians. Based on the initial observation of an association at SNP rs1503814 (location 158,440,777 bp), we selected SNPs for 5-kb spacing extending from 158,416,548 to 158,897,701 bp and identified 48 SNPs that could be typed successfully (mean interval 5.1 kb), including 7 SNPs extending 24.2 kb centromeric and 41 SNPs telomeric to SNP rs1503814 (supplemental Table 1S). The associated SNPs clustered primarily in the 48-kb region telomeric to rs1503814 (158,440,777 to 158,488,694 bp; May 2004 Build 35) and encompassing the DUSP12 gene, a widely expressed dual-specificity phosphatase with homology to the yeast gene YVH1 (38) that has been shown to regulate glucokinase (19). The associated SNPs spanned a region of high sequence conservation among all mammalian species (CNG) between 158,440,995 and 158,443,222 bp (supplemental Fig. 2S), the 3' boundary of which was 8,017 bp upstream from the DUSP12 ATG start (Fig. 1). We screened the full region from –10 kb upstream (a highly conserved, nongenic region) to 10 kb downstream of the DUSP12 ATG start site (Fig. 1). We identified 47 SNPs, including 7 SNPs previously typed (supplemental Table 1S), and, based on the observed associations in Caucasians (Table 1 and Fig. 1), extended the screening to 24 African-American individuals. Among Caucasians, 19 variants were associated with type 2 diabetes (P < 0.05), clustered primarily between –8379 and +7580 bp relative to the DUSP12 ATG start site (15/19 SNPs) and encompassing the DUSP12 gene and 5' flanking region (Fig. 1). An additional six variants showed a trend to an association with type 2 diabetes (P < 0.1), of which three again fell in this interval. Based on these results, we focused the study on the region from –16 to +16 kb. The data are summarized in Table 1 and graphically in Fig. 1, and raw genotype counts are shown in supplemental Table 2S. Based on the confidence interval (CI) block definition (39), the entire DUSP12 gene, the 5' promoter, part of the CNG, the 3' flanking region to +6180 bp, and most associated variants were encompassed in one block (block 3) (Fig. 1 and supplemental Fig. 1S). Most associated SNPs that fell outside of this interval were nonetheless in linkage disequilibrium with block 3 SNPs by r2 (Fig. 1). However, rs1503814 (–10473), which was the original observation, fell outside of the associated block and was not in strong linkage disequilibrium with the most significantly associated variants.
The associated region included two STR variants at –8379 and +6705. We tested the association in Caucasians using the CLUMP program (29) (supplemental Tables 3S and 4S); for both STRs, the 194 allele was significantly overrepresented in case subjects relative to control subjects. Using the T3 test, uncorrected P values were 0.0013 and 0.0019 for STRs –8379 and +6705, respectively, with P values based on Monte-Carlo simulation of 0.011 and 0.023, respectively. To facilitate further analyses, we dichotomized both STRs to 194 versus X, where X was any other size. In post hoc analyses, we tested dominant and recessive models for all SNPs and for dichotomized STRs. The associated sequence variants were most consistent with a dominant or additive model for the minor allele (Table 1 and supplemental Table 2S).
Haplotype analysis in Caucasians.
Association and haplotype analysis in African Americans. To test for replication and to narrow the associated region, we examined 27 SNPs (–10473 to +192519) in 48 African-American individuals (supplemental Figs. 3S and 4S). We selected 16 SNPs and the –8379 STR for minor allele frequencies >5% and based on linkage disequilibrium (r2 < 0.95) to test in the full sample (Table 3). Variants –8379 and +7580 were nominally associated with type 2 diabetes, but in both case subjects, the allele frequencies and associated alleles differed from Caucasians (Fig. 2 and supplemental Table 4S). We observed 12 alleles at –8379 (from 194 to 224 bp), which showed nominal association with type 2 diabetes: CLUMP analysis (29); T2 and T3 tests P = 0.049 and P = 0.011, respectively; empirical P values of 0.079 and 0.070, respectively, with 10,000 replicates. At SNP +7580 (Table 3), the minor T allele was 10-fold less common among African Americans (0.03 in type 2 diabetic case subjects) than among Caucasian case subjects; the major C allele was overrepresented in type 2 diabetes. Only 2 of 14 haplotypes for the block including DUSP12 showed a nominal association (P < 0.05) with type 2 diabetes (haplotypes 3 and 10) (supplemental Table 7).
Metabolic and family analyses of DUSP12 variants in Caucasians. We used SIMWALK 2 (40) to estimate haplotypes for markers –10473, –8379, –6375, +2115, and +7580 in 740 members of the previously described Utah families (5,6). No variant or haplotype showed excess transmission from parents to affected offspring (5,13), even when restricted to the specific families that generated the linkage signal in this region (6,12). Furthermore, the –8379 194-bp allele did not explain the linkage signal, using the GIST (Genotype-IBD Sharing Test) (41). In contrast, the –10473 SNP was significantly associated with fasting (P = 0.018), 30-min (P = 0.02), 60-min (P = 0.014), and 120-min (P = 0.001) insulin, and particularly the multivariate measure combining all five postglucose challenge insulin levels (P = 0.000002). SNPs +2115 and +7580 showed a similar albeit lesser association with the multivariate insulin measure (P < 0.01). No SNP was associated with fasting or postchallenge glucose levels, however. Transformed lymphocytes were available from most family members. Among 15 individuals heterozygous for transcribed SNP +2115, equal amounts of cDNA were observed from both alleles, and total DUSP12 message levels did not differ between 21 T/T homozygotes and 8 C/C homozygotes (DUSP12–to–18S ratios 9.84 ± 8.73 T/T vs. 9.59 ± 3.17 C/C).
Replication in other chromosome 1q–linked populations.
Chromosome 1q21-q23 is among the best-replicated regions of linkage to type 2 diabetes, with evidence for linkage in multiple Caucasian populations, Pima Indians, Chinese, and, in unpublished studies from our laboratory, African Americans. Previous studies from our laboratory strongly support the existence of multiple susceptibility genes contributing to the overall linkage signal and possibly to the replication of linkage on 1q21-q23. Our current studies have focused on the largest linkage peak, centered at 158 Mb, encompassing markers CRP and ApoA2, and including the calsequestrin 1 gene (CASQ1; 157 Mb), for which we (5) and Fu et al. (14) recently reported an association. That association seemed unlikely to account for the entire linkage signal, and in the current study we followed up on an association at SNP rs1503814, 1.4 Mb away from the CASQ1 gene. This region included only three genes. FCRLM2, which is 5' to rs1503814 and in a region without associated SNPs, is a member of the Fc receptor family involved in antibody-dependent cell cytotoxicity. The dual-specificity phosphatase 12 (DUSP12) gene is widely expressed and shows broad conservation even in yeast. Indeed, the human DUSP12 gene can complement a yeast mutant (38). The entire gene and most of a 5' CNG were included in a single haplotype block (Fig. 1), and within this block most of the common variants were associated with type 2 diabetes in Utah Caucasians (Fig. 1 and Table 1). Although we found some evidence among Caucasians for associated SNPs in the 5' end of the ATF6 gene, these were clearly outside the DUSP12 haplotype block and not in linkage disequilibrium with the DUSP12 SNPs (data not shown). ATF6 is an essential activator of endoplasmic reticulum stress and the unfolded protein response (42) and, based on recent data suggesting a role of endoplasmic reticulum stress in type 2 diabetes, insulin resistance, and impaired insulin secretion (43), a strong functional candidate. Work is in progress to fully evaluate the large ATF6 gene, but we focused the current study on DUSP12 as the more probable candidate based on the clustering of associated SNPs and STRs, the CNG, and evidence for association (albeit with different alleles) in African-American subjects. DUSP12 was pulled from a rat hepatic cDNA library with a yeast two-hybrid system, using glucokinase as bait, and identified as the glucokinase-associated protein (19). DUSP12 (glucokinase-associated protein) accelerated glucokinase activity in a manner suggesting functional significance. Hence, DUSP12 may modulate glycolysis in the liver and pancreatic ß-cell through dephosphorylation of glucokinase in the cytoplasm. The wide distribution and relatively high levels of expression suggest that DUSP12 may have other, as yet unidentified, roles. Because we identified no coding variants that could explain the association in DUSP12, we propose that the associated variant or variants, mainly in highly conserved regions, may be regulatory. The –8379 STR, which showed strong associations in Utah Caucasians and more modest associations in African Americans, is within a very highly conserved region of >2 kb, 8 kb upstream of DUSP12 (supplemental Fig. 2S). The 194 allele, which was associated with type 2 diabetes in Caucasians, eliminates the binding site for the transcription factor SRY (sex-determining region Y), but this factor appears unlikely to have a role in type 2 diabetes. Based on the strong conservation across mammalian species, this region may act as an enhancer for DUSP12, ATF6, or a chromosome 1 gene beyond this region of association. Given the size of the CNG, we considered that the region might be transcribed, but we were unable to detect a transcript in HepG2 cells or transformed lymphocytes (unpublished data). The +6705 microsatellite is not in a conserved region, but it contains multiple copies of the predicted binding site for the homeobox transcription factor CDXA (CDX1; caudal type homeobox transcription factor 1), which is widely expressed. The multiple CDXA binding sites and lack of conservation suggest that the +6705 variant may not itself be the causative variant. In contrast, SNP +7580, at the 3' end of DUSP12, is at the downstream end of a region of conservation among mammals. The common C (G) allele is conserved in dogs, chimpanzees, and humans, and it binds the basic helix-loop-helix transcription factor TH1/E47. That factor in turn interacts with key pancreatic ß-cell genes to regulate insulin gene transcription. Altered binding might impact transcription of either DUSP12 upstream or ATF6 downstream in a tissue-specific manner. However, opposite alleles are associated in Caucasians and African Americans, which appears inconsistent with a causative variant. Finally, although STR –8379 and SNP +7580 show the strongest associations with type 2 diabetes and are in the most conserved regions among mammals, the strongest quantitative trait association was with SNP –10473 farther upstream and post–glucose load insulin levels. This SNP was the location of the original observation, but it was no longer the strongest association with type 2 diabetes once additional case and control subjects were typed. Replication of association with complex disease genes has been difficult, and, consequently, concerns have been raised about the power to detect and replicate complex disease associations (44). Multiple apparently convincing associations in smaller samples have failed to replicate in very large populations (45–47). Both of our study populations are relatively small, although our previously described association at the PKLR locus has held up in the larger International Type 2 Diabetes Chromosome 1q Consortium data set (16). Thus, the associations found here may well be spurious. Facts favoring a role for DUSP12 in type 2 diabetes include the strong candidacy based on a previous study (19); the finding of associated DUSP12 region markers in a second population, albeit with less significance and with different alleles; and the association of SNPs in this region with postchallenge insulin levels in unaffected family members. Nonetheless, several aspects of our extensive analyses did not support a direct role of DUSP12. First, we could not find a difference in allelic expression in transformed lymphocytes for a marker (SNP +2115) that is in strong linkage disequilibrium with the risk haplotype, nor did DUSP12 expression differ among homozygotes at this SNP. This finding is inconsistent with a cis-acting regulatory variant, and may suggest involvement of a gene other than DUSP12. Alternatively, altered DUSP12 regulation may be tissue specific. Second, the only physiologic consequence of DUSP12 region variants was on the insulin response to an oral glucose load, but detailed measures of insulin sensitivity and insulin response to intravenous glucose were not altered. Other challenges, such as a graded glucose infusion or measurement of hepatic glucose function, may be required to understand the physiologic importance of DUSP12 variants. Third, the associated variants were not transmitted in excess from parents to affected offspring, nor could we explain the linkage signal using the GIST program. Such tests have also failed for PKLR alleles and CASQ1 variants (5,13). For variants with a minor allele frequency of 0.2, using all available Utah families, we have 63% power to detect a transmission rate of 0.6 from a heterozygous parent to affected offspring, which translates to a relative risk of 1.24 under a dominant and 1.37 under a recessive model. Finally, and perhaps most importantly, analysis of SNPs in this region, including the associated SNP +7580, showed no association with type 2 diabetes in the European-based populations from the International Type 2 Diabetes Chromosome 1q Consortium when Utah samples were not included. Notably, these studies did not include the two STR variants that show the strongest associations in the Utah population. Although typed SNPs, particularly +7580 (rs3820449), should have served as a proxy for the 194-bp allele, the possibility remains that these studies did not type the functional variants.
In summary, we have performed a detailed analysis of markers in a
This work was supported by grants from the National Institutes of Health/National Institute of Diabetes and Digestive and Kidney Diseases (NIDDK; no. DK39311) and by the Research Service of the Department of Veterans Affairs. Subject ascertainment was supported in part by grants from the American Diabetes Association, and subject ascertainment and metabolic studies were supported by General Clinical Research Center Grant M01RR14288 from the National Institutes of Health/National Center for Research Resources to the University of Arkansas for Medical Sciences. Studies of the International Type 2 Diabetes Chromosome 1q Consortium were supported primarily as a supplement to NIDDK award U01-DK58026. Additional funding sources and International Type 2 Diabetes Chromosome 1q Consortium members are listed in the supplemental data, which is detailed in the online appendix. We thank Yiwen Jia for assisting with lymphocyte mRNA measures.
S.K.D. and W.S.C. contributed equally to the work reported in the manuscript. Additional information can be found in an online appendix at http://diabetes.diabetesjournals.org. The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. Section 1734 solely to indicate this fact. Received for publication October 20, 2005 and accepted in revised form May 30, 2006
This article has been cited by other articles:
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||