Diabetes 54:2487-2491, 2005 © 2005 by the American Diabetes Association, Inc.
A Large-Scale Association Analysis of Common Variation of the HNF1
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| ABSTRACT |
|---|
|
|
|---|
(TCF1) is a key transcription factor that is essential for pancreatic ß-cell development and function. Rare mutations of HNF1
cause maturity-onset diabetes of the young. A common variant, G319S, private to the Oji-Cree population, predisposes to type 2 diabetes, but the role of common HNF1
variation in European populations has not been comprehensively assessed. We determined the linkage disequilibrium and haplotype structure across the HNF1
gene region using 29 single nucleotide polymorphisms (SNPs). Eight tagging SNPs (tSNPs) that efficiently capture common haplotypes and the amino acid–changing variant, A98V, were genotyped in 5,307 subjects (2,010 type 2 diabetic case subjects, 1,643 control subjects, and 1,654 members of 521 families). We did not find any evidence of association between the tSNPs or haplotypes and type 2 diabetes. We could exclude odds ratios (ORs) >1.25 for all tSNPs. The rare V98 allele (
3% frequency) showed possible evidence of association with type 2 diabetes (OR 1.23 [95% CI 0.99–1.54], P = 0.07), a result that was supported by meta-analysis of this and published studies (OR 1.31 [1.08–1.59], P = 0.007). Further studies are required to investigate this association, demonstrating the difficulty of defining the role of rare (<5%) alleles in type 2 diabetes risk.
Abbreviations: LD, linkage disequilibrium; MAF, minor allele frequency; SNP, single nucleotide polymorphism; tSNP, tagging SNP
The HNF1
gene (TCF1) is an excellent candidate gene for type 2 diabetes. It codes for a key member of a transcription factor network that is essential for the development and function of the pancreatic ß-cell (1,2). Rare severe mutations of the HNF1
gene are the commonest cause of maturity-onset diabetes of the young, a young-onset monogenic subtype of diabetes (3,4). The private HNF1
variant, G319S, predisposes to type 2 diabetes in the Oji-Cree population of Canada (5), and a number of genome-wide scans have found evidence of linkage of type 2 diabetes to 12q24 (6–8), where the HNF1
gene is located.
Recent studies suggest that comprehensive analysis of monogenic diabetes genes can be a productive approach for identifying type 2 diabetes susceptibility variants: rare mutations in Kir6.2, PPAR
, and HNF4
cause monogenic diabetes, and common variants of these genes have been reproducibly associated with type 2 diabetes using large-scale studies and meta-analysis (9–13). The role of common HNF1
variation in type 2 diabetes susceptibility has not been comprehensively assessed. A few studies (14–22), each consisting of a few hundred subjects, have previously examined the role of the amino acid–changing variants I27L, S487N, and A98V but with conflicting results in type 2 diabetes, although there is good evidence that the A98V variant affects ß-cell function in nondiabetic subjects. In this study, we analyzed 5,307 samples that gave us >80% power to detect odds ratios (ORs) for common variants (minor allele frequency [MAF] >5%) of 1.2–1.4 at P < 0.01.
To identify variants and determine linkage disequilibrium (LD) structure across the HNF1
gene, we sequenced all 10 exons, 5 conserved noncoding regions, and 9 amplicons containing more than one single nucleotide polymorphism (SNP) for at least 24 U.K. Caucasian subjects. This gave 91% power to detect alleles of 5% frequency. We identified 29 SNPs, covering 69,396 bp (28 kb 5' and 17 kb 3' of HNF1
) with an MAF >5%. These included the three nonsynonymous variants: I27L, S487N, and A98V. We found strong LD and limited haplotype diversity across the region. In this subsample, five haplotypes that were tagged by three SNPs occurred at >5% frequency and accounted for 68% of all haplotypes (82% of all haplotypes when limited to intragenic SNPs). Supplementary Fig. 1 in the online appendix (available at http://diabetes.diabetesjournals.org) shows a schematic of LD across the HNF1
gene. Supplementary Table 1 provides information on all 29 variants (online appendix).
|
|
gene but across a larger region (118 kbp) and at a greater SNP density than our own study. The eight tSNPs captured a large proportion of identified variation in the region (51% of SNPs with MAF >10% had an r2 value of >0.8, with at least one of the eight tSNPs, and 76% had an r2 value of >0.5) (23). A98V was not efficiently captured by the tSNPs because of its relatively low minor allele frequency (
3%). We therefore also typed this SNP, as it is a nonsynonymous variant, and some, but not all, previous studies have suggested that it is associated with type 2 diabetes and measures of ß-cell function (14,15,17,20–22). We genotyped the eight tSNPs and A98V in a case-control study of 2,010 U.K. type 2 diabetic subjects, 1,643 U.K. control subjects, and a type 2 diabetic family association study (1,654 individuals from 521 families). Clinical details of the subjects used are given in Table 1. The results of our case-control study are shown in Table 2. Two SNPs, A98V (OR 1.30 [95% CI 1.01–1.68], P = 0.04) and rs2071190 (OR 1.12 [1.01–1.25], P = 0.04), demonstrated nominal evidence of association with type 2 diabetes. There were no significant deviations from expected transmission rates in our family-based study (Table 2). Combining the family-based results with the case-control results attenuated the A98V (OR 1.23 [0.99–1.54], P = 0.07) and rs2071190 (OR 1.09 [0.99–1.20], P = 0.06) type 2 diabetes associations. Individual genotype and allele counts are available in supplementary Table 2 (online appendix).
|
The eight tSNPs defined six haplotypes with frequency >5%, which accounted for 84% of all haplotypes across the HNF1
region in the case-control groups. No haplotype differed significantly in frequency between case and control subjects (Table 3). In particular, there was no evidence that the haplotype capturing the Leu27 allele (which is in almost perfect LD with rs3830659, r2 = 0.97) (haplotype A) predisposes to type 2 diabetes. No haplotype differed significantly from expected familial transmission ratios in the familial haplotype association study (Table 3).
|
in type 2 diabetes susceptibility in U.K. Caucasians, we have found no evidence that common HNF1
haplotypes are associated with type 2 diabetes. We were able to exclude ORs of 1.25 for all tSNPs; however, even with this large sample size, certain methodological considerations may have led us to miss real associations. We have not typed all SNPs across HNF1
and therefore may not have adequately captured disease variants across the gene. We note, however, the high level of LD in the region (supplementary Fig. 1 of the online appendix) and the fact that we have captured each of the individual SNPs that showed nominal evidence for association (P < 0.05) in the initial cohort of Winckler et al. (23), with r2 values ranging from 0.66 to 1.0. We clearly cannot exclude that multiple rare alleles (<5% MAF) are associated with type 2 diabetes. A further potential source of error is population stratification, which can cause false-positive or false-negative results. However, all subjects are of U.K. Caucasian origin and family- and case-control–based results are consistent with each other. A further source of false-negative results is the fact that our control subjects are a younger population-based resource rather than age matched to case subjects. As
5% of the control subjects will go on to develop diabetes, this would slightly reduce the power of our study compared with the use of the same number of control subjects that were age matched to the case subjects.
In conclusion, using a tSNP and comparative genomics approach, we have analyzed the HNF1
gene for association of common variation with type 2 diabetes. No variants or haplotypes were associated with type 2 diabetes in the U.K. population.
| RESEARCH DESIGN AND METHODS |
|---|
|
|
|---|
Case-control subjects.
The clinical characteristics of the subjects in our study are given in Table 1. All participants gave informed consent and were U.K. Caucasians. All type 2 diabetic subjects had diabetes defined either by World Health Organization criteria or by being treated with medication for diabetes. Known subtypes such as maturity-onset diabetes of the young or mitochondrial-inherited diabetes and deafness were excluded by clinical criteria and/or genetic testing. Subjects in the type 2 diabetic case group were recruited from three sources, as previously described (13). In brief, these were a collection of young-onset type 2 diabetic subjects (n = 277), probands from type 2 diabetic sibships from the Warren 2 repository (n = 541), and a new collection of type 2 diabetic subjects from the Warren 2 repository (n = 1,192). Patients who had GAD autoantibodies at recruitment have been excluded for the first two groups of case subjects but not the third. Population control subjects were recruited from two sources: 1) parents from a consecutive birth cohort (Exeter Family Study) with normal (<6.0 mmol/l) fasting glucose and/or normal HbA1c (A1C) levels (<6%) (n = 1,176) and 2) a nationally recruited population control sample of U.K. Caucasians obtained from the European Cell Culture Collection (n = 467). Excluding the young-onset type 2 diabetic subjects, which are younger at diagnosis and more obese, from the combined analysis did not affect the overall association results.
Family-based subjects.
The clinical characteristics of the affected probands in our family-based study are shown in Table 1. All subjects are independent of those from the case-control study. Families fitting the following criteria were included: an affected proband with both parents (404 "trios" families) or one parent and at least one unaffected sibling (117 "duos" families). The characteristics of some of these families have been previously described (13,24).
Comparative genomics to identify conserved noncoding sequences.
We downloaded the human, mouse, and rat sequences of HNF1
, including 50 kb upstream and 500 bp downstream from the Santa Cruz genome browser builds from November 2002, February 2003, and January 2003, respectively. Further sequence 3' was not analyzed due to the presence of another gene starting at 536 bp 3' of HNF1
. Pairwise alignments were performed using BLAST2. A cutoff of >100 bp showing >75% matches was used to define conserved noncoding sequences. Regions sequenced that contained conserved noncoding sequences occurred at base pairs (NCBI34) 119,816,898–119,817,223 (5'); 119,828,047–119,828,501 (promoter); 119,829–119,829,732 (intron 1); 119,830,033–119,831,037 (intron 1); and 119,850,821–119,851,107 (3' untranslated region).
Variant detection.
To ascertain all common variation in the coding and conserved noncoding sequence, we sequenced the 10 exons and conserved noncoding sequences in a minimum of 24 subjects. In addition, we sequenced a further nine amplicons, five 5' and four 3', of the HNF1
gene, selected as having more than one SNP present in the SNP database. Sequencing was performed on ABI-377 platforms using standard protocols. Where SNPs were identified in the initial samples, further samples were sequenced so that 62 subjects in total had been analyzed, with the exception of when the MAF was <5% in the initial sequencing.
Genotyping and quality control.
Genotyping was performed by Kbiosciences (Herts, U.K.) using modified TAQMAN assays (available at http://www.kbioscience.co.uk). Each 1,536-well plate contained a mixture of case, control, and negative control subjects. The presence of 10% duplicate samples revealed that the genotyping accuracy was 99.7%. The genotyping success rate was 99% for control subjects, 97% for case subjects, and 95% for families. After excluding families with obvious relationship inconsistencies (as determined by the genotyping of an additional 43 SNPs), the Mendelian inconsistency error rate was 0.0004. All SNPs were in Hardy-Weinberg equilibrium in both case and control subjects (
2 P > 0.01). The three groups that made up the case subjects and the two groups that made up the control subjects did not differ significantly in frequency (all individual SNP/study genotype and allele frequencies are presented in supplementary Table 2 of the online appendix).
Haplotype construction and assessment of LD.
Pairwise LD estimates were calculated using GOLD (25). Haplotype construction was performed using SNPHAP (version 1.2; available at http://www-gene.cimr.cam.ac.uk/clayton/software/snphap.txt) and PHASE (26).
Association study statistical analysis.
ORs and P values were determined for our case-control analyses using
2 tests. We used COCAPhase (27) to estimate haplotype frequencies and perform tests of haplotype association. PHASE (26) produced very similar haplotype frequencies. To analyze our family data, we used the FBAT program (27) (available at http://www.biostat.harvard.edu/
fbat/default.html). The TDT/sibTDT program (28) produced very similar results. To estimate the ORs for the families with parents and unaffected siblings available, we used the discordant allele test (29). To estimate the ORs for the case-control and family-based combined analysis we combined ORs from the discordant allele, TDT, and case-control studies using Mantel-Haenszel meta-analysis (13). Approximate power for allelic ORs for each of the SNPs was calculated using the power calculator at the University of California, Los Angeles website (available at http://calculators.stat.ucla.edu/powercalc/). To include the family-based subjects in the power approximations, we included probands as case subjects and parents as control subjects. The significance level was set to 0.01 for a range of allele frequencies.
A98V meta-analysis methods.
To identify all relevant published studies and abstracts, we searched Pubmed, ISI Web Of Science, and Google using combinations of the following key words: A98V, Ala98Val, 98, TCF1, HNF1, diabetes, type 2 diabetes, genetic association, maturity-onset diabetes of the young, and SNP. Because there were zero alleles in the case group from the Rissanen et al. (20) study, we used the Haldane correction to plot the 95% CIs and exact methods for pooled OR estimation and significance testing (30). Although a funnel plot suggested that publication bias was not a major problem, we cannot exclude the possibility that such bias exists. We used the Q statistic to test for heterogeneity and the Mantel-Haenszel method to combine the studies.
| ACKNOWLEDGMENTS |
|---|
We thank Wendy Winckler, David Altshuler, and colleagues, who kindly shared information on their analysis of HNF1
throughout this study.
| FOOTNOTES |
|---|
Received for publication February 7, 2005 and accepted in revised form May 17, 2005
| REFERENCES |
|---|
|
|
|---|
gene predicts susceptibility to type 2 diabetes. Diabetes 53:1141–1149, 2004
gene on chromosome 20q is associated with type 2 diabetes and appears to contribute to the evidence for linkage in an Ashkenazi Jewish population. Diabetes 53:1134–1140, 2004
P2 promoter are associated with type 2 diabetes in the U.K. population. Diabetes 53:3002–3006, 2004
gene is associated with reduced serum C-peptide and insulin responses to an oral glucose challenge. Diabetes 46:912–916, 1997[Abstract]
gene in Danish Caucasians with late onset NIDDM. Daibetologia 40:473–475, 1997
and -4
genes in Finnish and Chinese subjects with late-onset type 2 diabetes. Diabetes Care 23:1533–1538, 2000[Abstract]
genes contribute to glucose intolerance in a South Indian population. Diabetes 53:2122–2125, 2004
gene region with risk of type 2 diabetes. Diabetes 54:2336–2342, 2005
coding mutations are an uncommon contributor to early-onset type 2 diabetes in Ashkenazi Jews. Diabetes 47:967–969, 1998[Medline]This article has been cited by other articles:
![]() |
M. M. Sale, S. G. Smith, J. C. Mychaleckyj, K. L. Keene, C. D. Langefeld, T. S. Leak, P. J. Hicks, D. W. Bowden, S. S. Rich, and B. I. Freedman Variants of the Transcription Factor 7-Like 2 (TCF7L2) Gene Are Associated With Type 2 Diabetes in an African-American Population Enriched for Nephropathy Diabetes, October 1, 2007; 56(10): 2638 - 2642. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. Winckler, M. N. Weedon, R. R. Graham, S. A. McCarroll, S. Purcell, P. Almgren, T. Tuomi, D. Gaudet, K. B. Bostrom, M. Walker, et al. Evaluation of Common Variants in the Six Known Maturity-Onset Diabetes of the Young (MODY) Genes for Association With Type 2 Diabetes Diabetes, March 1, 2007; 56(3): 685 - 693. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. J. Willer, L. L. Bonnycastle, K. N. Conneely, W. L. Duren, A. U. Jackson, L. J. Scott, N. Narisu, P. S. Chines, A. Skol, H. M. Stringham, et al. Screening of 134 Single Nucleotide Polymorphisms (SNPs) Previously Associated With Type 2 Diabetes Replicates Association With 12 SNPs in Nine Genes Diabetes, January 1, 2007; 56(1): 256 - 264. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. N. Weedon, B. Shields, G. Hitman, M. Walker, M. I. McCarthy, A. T. Hattersley, and T. M. Frayling No Evidence of Association of ENPP1 Variants With Type 2 Diabetes or Obesity in a Study of 8,089 U.K. Caucasians Diabetes, November 1, 2006; 55(11): 3175 - 3179. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. L. Bonnycastle, C. J. Willer, K. N. Conneely, A. U. Jackson, C. P. Burrill, R. M. Watanabe, P. S. Chines, N. Narisu, L. J. Scott, S. T. Enloe, et al. Common Variants in Maturity-Onset Diabetes of the Young Genes Contribute to Risk of Type 2 Diabetes in Finns Diabetes, September 1, 2006; 55(9): 2534 - 2540. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. L. Gloyn, D. J.G. Mackay, M. N. Weedon, M. I. McCarthy, M. Walker, G. Hitman, B. A. Knight, K. R. Owen, A. T. Hattersley, and T. M. Frayling Assessment of the Role of Common Genetic Variation in the Transient Neonatal Diabetes Mellitus (TNDM) Region in Type 2 Diabetes: A Comparative Genomic and Tagging Single Nucleotide Polymorphism Approach. Diabetes, August 1, 2006; 55(8): 2272 - 2276. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. Winckler, N. P. Burtt, J. Holmkvist, C. Cervin, P. I.W. de Bakker, M. Sun, P. Almgren, T. Tuomi, D. Gaudet, T. J. Hudson, et al. Association of Common Variation in the HNF1{alpha} Gene Region With Risk of Type 2 Diabetes Diabetes, August 1, 2005; 54(8): 2336 - 2342. [Abstract] [Full Text] [PDF] |
||||
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| Diabetes | Diabetes Care | Clinical Diabetes | Diabetes Spectrum |