Diabetes 57:1753-1756, 2008 DOI: 10.2337/db07-1402 © 2008 by the American Diabetes Association
Sequencing-Based Genotyping and Association Analysis of the MICA and MICB Genes in Type 1 DiabetesFrom the Juvenile Diabetes Research Foundation/Wellcome Trust Diabetes and Inflammation Laboratory, Department of Medical Genetics, Cambridge Institute for Medical Research, University of Cambridge, Addenbrooke's Hospital, Cambridge, U.K Corresponding author: John A. Todd, JDRF/WT Diabetes and Inflammation Laboratory, Cambridge Institute for Medical Research, Wellcome Trust/MRC Building, Addenbrooke's Hospital, Hills Road, Cambridge, CB2 0XY, U.K. E-mail: john.todd{at}cimr.cam.ac.uk
Abbreviations:
LD, linkage disequilibrium; MHC, major histocompatibility complex; MIC, MHC class I chain-related molecule, nsSNP, nonsynonymous single nucleotide polymorphism; rpart, recursive partitioning; SBT, sequencing-based typing; SNP, single nucleotide polymorphism; STR, short tandem repeat
OBJECTIVE— The nonclassical major histocompatibility complex (MHC) class I chain-related molecules (MICs), encoded within the MHC, function in immunity. The transmembrane polymorphism in MICA (MICA-STR) has been reported to be associated with type 1 diabetes. In this study, we directly sequenced both of the highly polymorphic MIC genes (MICA and MICB) in order to establish whether they are associated with type 1 diabetes independently of the known type 1 diabetes MHC class II genes HLA-DRB1 and HLA-DQB1. RESEARCH DESIGN AND METHODS— We developed a sequencing-based typing method and genotyped MICA and MICB in 818 families (2,944 individuals) with type 1 diabetes from the U.K. and U.S. (constructing the genotype from single nucleotide polymorphisms in exons 2–4 of MICA and 2–5 of MICB) and additionally genotyped the MICA-STR in 2,023 type 1 diabetic case subjects and 1,748 control subjects from the U.K. We analyzed the association of the MICA and MICB alleles and genotypes with type 1 diabetes using regression methods. RESULTS— We identified known MICA and MICB alleles and discovered four new MICB alleles. Based on this large-scale and detailed genotype data, we found no evidence for association of MICA and MICB with type 1 diabetes independently of the MHC class II genes (MICA P = 0.08, MICA-STR P = 0.76, MICB P = 0.03, after conditioning on HLA-DRB1 and HLA-DQB1). CONCLUSIONS— Common MICA and MICB genetic variations including the MICA-STR are not associated, in a primary way, with susceptibility to type 1 diabetes. Approximately 50% of the familial clustering of type 1 diabetes is attributable to the major histocompatibility complex (MHC) region on human chromosome 6p21, and the MHC class II genes HLA-DRB1 and HLA-DQB1 account for a large proportion of this clustering (1). However, association studies in the MHC region indicate that other genes have effects in type 1 diabetes independently of the class II genes (2–4). Identification of these genes is complicated because genes in the MHC have multiple alleles and exhibit extensive linkage disequilibrium (LD), and the class II genes HLA-DQB1 and HLA-DRB1 show complicated dominance and epistatic effects. Importantly, any putative new effect must be distinguished from the effect of these class II genes (2,3).
The nonclassical MHC class I chain-related molecule (MIC) genes MICA and MICB are located 46.4 and 141.2 kb, respectively, centromeric of HLA-B and
MICA and MICB share 83% DNA sequence homology. Each gene consists of six exons: exon 1, which codes for a signal peptide, exons 2–4 for three extracellular domains ( 1, 2, and 3), exon 5 for a transmembrane domain, and exon 6 for a carboxy-terminal cytoplasmic tail (5).
Genotyping of MICA-STR.
SBT of MICA and MICB. We aligned and scored sequence reads using the Gap4 program from the Staden package (http://www.mrc-lmb.cam.ac.uk/pubseq/). To increase throughput we included a consensus trace in the alignment. Sequence data are not phased, and therefore a method was needed to determine which pair of alleles was present. While there are many nsSNPs in the MIC genes, they give rise to relatively few alleles per gene. The common alleles are listed on the IMGT/HLA Sequence Database (http://www.anthonynolan.org.uk/HIG/data.html) (18). Since the possible genotypes were limited and known, we used in-house software similar to the Helmberg Score software (19). Positions that varied were extracted from the static alignments of the IMGT/HLA Sequence Database. The alleles were reduced to just those SNPs that are present in the regions we sequenced. A file was then constructed of all possible pairs of MICA or MICB alleles. Our sequence data were extracted in the same format, and the two files were compared. Ambiguity created by missing SNP values was clarified by examining familial relationship and the MICA-STR data. We validated this method in a reference panel of 40 cell lines received from the International Histocompatibility Working Group (IHWG) International Cell and Gene Bank (http://www.ihwg.org/cellbank/dna/refpan.html) and eight artificial heterozygotes made by pooling DNA from different homozygous cell lines. We obtained 100% concordance with the published MICA genotypes.
Sample populations.
Statistical analysis. Each MIC locus was first tested for association with type 1 diabetes without conditioning on the MHC class II loci. Sets of case and matched pseudo-control subjects were generated from the affected sib-pair families and analyzed using conditional logistic regression (25) with Huber-White sandwich estimators to correct for the nonindependence of siblings. In the case-control collection, case and control subjects were matched to 12 broad geographical regions across the U.K. (Northern, East and West Ridings, North Midland, Eastern, Southeastern, Southern, Southwestern, Wales, Midlands, Scotland, London, and Northwestern) (22) and analyzed using logistic regression (2). The alleles of the MIC loci were modeled assuming a multiplicative mode of inheritance.
To establish whether any observed associations of MICA and MICB were independent of, and not due to LD with, the class II genes HLA-DRB1 and HLA-DQB1, the effects of these highly associated loci must be taken into account in the analysis of MICA and MICB. The overall number of class II alleles, which includes many rare alleles, and the well-established nonmultiplicative effects of these loci complicate their modeling. We found that rpart was an effective method to model the genotypes of HLA-DRB1 and HLA-DQB1 (2). This is a risk-based grouping method that categorizes individuals as affected or unaffected based on their MHC class II genotypes (2). The resulting groups of MHC class II genotypes define strata within which additional, non–class II associations can be tested, using either conditional logistic regression (families) or logistic regression (case/control) (2). The rpart method was performed both in the case/controls and in the families (cases/pseudo-controls). The additional MHC class II independent effect of each MIC locus was tested by adding the alleles of the test locus to the logistic model and testing their independent effects by a Wald test (families) or a likelihood ratio test (case/control). Note that all the alleles of MICA or of MICB were included in the logistic model, which allows testing of the conditional association of the locus and calculation of the conditional relative risks (RRs)/odds ratios (ORs) of the individual alleles simultaneously, relative to a common reference allele. For the most common allele of the MICA-STR (MICA-STR*A5.1), which has a minor allele frequency of 0.5, our power to detect an effect size of OR 2 was >98% at
Alleles and haplotypes. In MICA, 40 SNPs plus the MICA-STR yielded 55 distinct alleles in IMGT/HLA release 2.8 (Jan 2005) (18). We limited ourselves to SNPs in exons 2–4 so that MICA*009 and MICA*049 were indistinguishable, as were MICA*027 and MICA*048. Genotypes of the three loci were in Hardy-Weinberg equilibrium in unaffected parents and control subjects (P > 0.05). Frequencies for the MICA and MICB alleles were consistent in the U.K. and U.S. families and similar to those reported in other European populations (9,10,15,16). In online appendix Table 1, we list the combinations of MICA gene alleles and MICA-STR alleles observed in this dataset.
Novel alleles.
Association with type 1 diabetes. We analyzed the MICA and MICB alleles for association with type 1 diabetes in the family collections and the MICA-STR in the family and case control collections. For each locus we calculated two statistics: the P value of association for each locus overall and the RR/OR for each individual allele. When the MIC loci are analyzed without conditioning on HLA-DRB1 and HLA-DQB1, several alleles of both genes showed a strong association with type 1 diabetes (MICB P = 4.21 x 10–19, MICA P = 3.74 x 10–15, MICA-STR P = 5.44 x 10–9). However, if the effect of these genes is conditioned on HLA-DRB1 and HLA-DQB1, the association with type 1 diabetes disappears (MICB P = 0.03, MICA P = 0.08, MICA-STR P = 0.76). Note that P = 0.03 cannot be considered as evidence of association because it is produced as a result of using a Wald test that is artificially biased toward the alternative hypothesis with rare data (rare alleles, below 0.01, were grouped together, the resulting group had frequency 0.014). Use of a likelihood ratio test without robust variance estimates produces a less significant P value (P = 0.1). In Table 2 we present the RR for the alleles of MICA and MICB in the U.K. and U.S. families using the most common alleles (MICA*008 and MICB*005) as reference, as they give the tightest 95% CIs. Since none of the other alleles were significantly different to the reference, the reference alleles themselves are also not associated. MICA-STR was also not associated with type 1 diabetes after conditioning on HLA-DRB1 and HLA-DQB1 in the case/control collection (P = 0.65 for the individual alleles and P = 0.91 for the genotypes) or in the families (P = 0.76) (Table 3). We also tested the association of MICA-STR*A5.1 genotypes against all the other genotypes grouped together. They were not associated with type 1 diabetes risk after conditioning (Table 4). We found that all MICA and MICB alleles were in strong LD (D' >0.9) with one or more HLA-DRB1 or HLA-DQB1 alleles (online appendix Tables 2 and 3).
Our SBT method allows for large-scale and detailed genotyping of the full MICA and MICB alleles. SBT also allows discovery of novel alleles and, indeed, we found four new rare variants for MICB. These results illustrate the high level of allelic variability found in the MICA and MICB genes, implying that this variability has yet to be fully characterized. Previous and ongoing studies of type 1 diabetes associations, including the Type 1 Diabetes Genetics Consortium (www.t1dgc.org) and the IHWG (http://www.ncbi.nlm.nih.gov/projects/mhc/ihwg.cgi?cmd=DS&ID=11), have either omitted MICA and MICB or focused on the MICA-STR because of the costs and complications associated with obtaining the full genotypes of MICA and MICB. Many published reports have identified one or more of the alleles or genotypes of the MICA-STR as being associated with type 1 diabetes. However, some of these studies did not condition adequately on the MHC class II genes (12–14), while others used subgroup analysis to analyze the MICA-STR in samples chosen to have particular class II genotypes (7–9,11). This subgrouping causes a multiple testing problem and reduces the sample set and hence the power to detect MHC class II–independent effects. We conclude that, in the European populations analyzed here, the common MICA and MICB alleles are unlikely to affect type 1 diabetes susceptibility, nor are they markers for the strong independent associations of the common alleles of HLA-B (HLA-B*39) and HLA-A (2).
This work was funded by the Wellcome Trust (WT), the Juvenile Diabetes Research Foundation (JDRF), the Medical Research Council, and the National Institute for Health Research Cambridge Biomedical Resource Center. We acknowledge use of DNA from the British 1958 Birth Cohort collection (D. Strachan, P. Barton, S. Ring, W. McArdle, and M. Pembrey), funded by Medical Research Council Grant G0000934 and Wellcome Trust Grant 068545/Z/02, and DNA from the Human Biological Data Interchange and Diabetes UK for the U.S. and U.K. multiplex families, respectively. We thank B. Widmer, H. Stevens, and the JDRF/WT Diabetes and Inflammation Laboratory DNA team for collecting and preparing the type 1 diabetes case samples and D. Smyth and R. Bailey for help with genotyping. S.N. is a Diabetes Research and Wellness Foundation Non-Clinical Fellow.
Published ahead of print at http://diabetes.diabetesjournals.org on 10 March 2008. DOI: 10.2337/db07-1402. Additional information for this article can be found in an online appendix at http://dx.doi.org/10.2337/db07-1402. The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. Section 1734 solely to indicate this fact. Received for publication October 3, 2007 and accepted in revised form February 29, 2008
|
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||