Diabetes 53:505-509, 2004 © 2004 by the American Diabetes Association, Inc.
Haplotype Tag Single Nucleotide Polymorphism Analysis of the Human Orthologues of the Rat Type 1 Diabetes Genes Ian4 (Lyp/Iddm1) and CblbJuvenile Diabetes Research Foundation/Wellcome Trust Diabetes and Inflammation Laboratory, Cambridge Institute for Medical Research, University of Cambridge, Addenbrookes Hospital, Cambridge, U.K
The diabetes-prone BioBreeding (BB) and Komeda diabetes-prone (KDP) rats are both spontaneous animal models of human autoimmune, T-cell-associated type 1 diabetes. Both resemble the human disease, and consequently, susceptibility genes for diabetes found in these two strains can be considered as potential candidate genes in humans. Recently, a frameshift deletion in Ian4, a member of the immune-associated nucleotide (Ian)-related gene family, has been shown to map to BB rat Iddm1. In the KDP rat, a nonsense mutation in the T-cell regulatory gene, Cblb, has been described as a major susceptibility locus. Following a strategy of examining the human orthologues of susceptibility genes identified in animal models for association with type 1 diabetes, we identified single nucleotide polymorphisms (SNPs) from each gene by resequencing PCR product from at least 32 type 1 diabetic patients. Haplotype tag SNPs (htSNPs) were selected and genotyped in 754 affected sib-pair families from the U.K. and U.S. Evaluation of disease association by a multilocus transmission/disequilibrium test (TDT) gave a P value of 0.484 for IAN4L1 and 0.692 for CBLB, suggesting that neither gene influences susceptibility to common alleles of human type 1 diabetes in these populations.
Development of diabetes in the BB rat involves at least three genes: Iddm1/lyp on chromosome 4, RT1u (at Iddm2) in the major histocompatibility complex (MHC) on chromosome 20, and a third unmapped gene (1,2). One unusual feature of this animal model is the severe lymphopenia that is essential for the development of the diabetic phenotype and that is inherited as a Mendelian trait (3). Life-long and profound T-cell lymphopenia is characterized by a reduction in peripheral CD4+ T-cells, an even greater reduction of CD8+ T-cells (4), and an almost total absence of RT6+ T-cells (5). The lymphopenia gene is involved in the regulation of apoptosis in the T-cell lineage and is, therefore, responsible for loss of critical T-cells, resulting in autoimmunity (6). Recently, two groups have independently shown, by positional cloning of Iddm1/lyp, that lymphopenia is due to a frameshift deletion in Ian4 (also called Ian5) of the immune-associated nucleotide (Ian)-related gene family (6,7), resulting in a truncated protein product. This deletion was only found in strains that have lymphopenia and diabetes (6). The human orthologue of Ian4 (IAN4L1) belongs to a family of at least 10 genes that encode GTP-binding proteins and are located in a 300-kb interval of human chromosome 7q36. The KDP rat was derived as a substrain of the Long-Evans Tokushima lean (LETL) rat and shows 100% development of moderate to severe insulitis within 220 days of age (8,9). The LETL rat is characterized by sudden onset of polyuria, polyphagia, hyperglycemia, weight loss, and autoimmune destruction of pancreatic B-cells, while showing no significant T-cell lymphopenia and no sex-specific differences in rate of onset or severity (8). As with the BB rat, the KDP rat possesses the diabetogenic RT1u haplotype, adding to its relevance as a model of type 1 diabetes. In addition to the MHC, another unlinked locus, Iddm/kdp1, is essential in the development of moderate to severe insulitis and the onset of diabetes (10). Iddm/kdp1 has been mapped to a nonsense mutation in CBLB (Casitas B-lineage lymphoma b, or Cas-Br-M murine ecotropic retroviral transforming sequence b), a gene shown to have a role in the regulation of tyrosine kinase signaling pathways (1114). This mutation results in the removal of 484 amino acids, including the proline-rich and leucine zipper domains of the protein, and is specific to the KDP rat and the original LETL strain. It is not found in the nondiabetic KND (Komeda nondiabetic) or LETO (Long-Evans Tokushima Otsuka) strains (15). Homozygous mice generated to be deficient in Cblb develop spontaneous autoimmunity, characterized by T- and B-cell infiltration of multiple organs (16). Taken together, this evidence suggests that Cblb is probably the disease susceptibility gene at Iddm/Kdp1 and, consequently, a major susceptibility gene for diabetes in the rat. We, therefore, resequenced both IAN4L1 and CBLB as candidates for human type 1 diabetes susceptibility. For IAN4L1, we resequenced the entire gene, covering 12.2 kb, comprising three exons and introns and 3 kb 3' and 5' of the gene in 32 type 1 diabetic subjects, identifying 30 single nucleotide polymorphisms (SNPs), 19 of which were novel (Table 1). Of the 30 SNPs, 7 were exonic: 1 in exon 1, which contains the 5' untranslated region, and 6 in exon 3. At CBLB, which extends over 230 kb (including three alternative, untranslated exon 1s), we resequenced 12.6 kb in 96 type 1 diabetic subjects, encompassing exons, intron/exon boundaries, and 2.5 kb 3' and 5' of the gene. From the CBLB sequence data, we identified 37 polymorphisms, of which 26 were novel (Table 2). These comprised 32 SNPs and five insertion/deletions. Of the 37 polymorphisms, 7 were exonic: 1 in each of exons 6, 9, 11, and 12 and 3 in exon 10. However, no nonsynonymous variants were observed in either gene, nor were there any other obvious candidates for variants that might change function or expression (Tables 1 and 2). For CBLB, we were unable to sequence exons 18, 1A, or 1B (although we covered 135 of 195 bp of exon 1C), and consequently, it was not possible to fully represent them directly with our haplotype tag SNP (htSNP) selection.
From the 21 polymorphisms in CBLB and 25 in IAN4L1 with allele frequencies >3%, we selected nine htSNPs for each, capturing the allelic variation within the genes with a minimum R2 of 0.8 (Tables 1 and 2), using the htSNP selection method described by Chapman et al. (17). To further reduce genotyping costs, we adopted a two-stage strategy, in which we only proceed to the second stage of genotyping if the results from the first stage offered some possibility of an overall significant result. In stage 1, a collection of 754 affected sib-pairs, comprising 472 U.K. and 282 U.S. multiplex type 1 diabetic families (equivalent to 1,400 trios; set 1), are genotyped and tested for association using the multilocus TDT, which tests for association between disease and htSNPs due to linkage disequilibrium (LD) with one or more causal variants (17). Transmissions of SNP alleles not genotyped in stage 1 can also be predicted using multiple regression equations computed in the course of htSNP selection from the initial sequencing data (17). Stage 2, genotyping in 1,708 additional families (set 2) only proceeds if the stage-1 multilocus TDT P value is <0.1. By setting a threshold P value relatively high at the first stage, in order to avoid rejecting true positives, little power is lost when compared with a single-stage approach. After genotyping of set 2, statistical analysis is performed on the entire dataset (2,462 families). Given the currently available sample collection and the two-stage strategy adopted, we have over 90% power to detect an association with P = 1 x 10-4, assuming a relative risk of 1.5 conferred by each copy of the causal allele and a population frequency of the causal allele of 0.1, regardless of whether genotyping proceeds to stage 2. Approaches to the statistical analysis of htSNPs have been described by Chapman et al. (17). It was demonstrated that in regions of strong LD, simple models considering only the main effects of htSNP genotypes were optimal or near optimal for detecting disease association. Consequently, the multilocus TDT is considered the most appropriate test. In stage 1, the multilocus TDT P value for association between type 1 diabetes and IAN4L1 was 0.484 and for CBLB was 0.692. Therefore, we did not proceed to genotype the additional set 2 families in either gene. To illustrate the predictions of ungenotyped markers that are possible using this new approach, Tables 1 and 2 include single-locus tests for all the common polymorphisms in set 1 families.
These results suggest that common alleles of IAN4L1 and CBLB do not contribute significantly to the familial clustering of human type 1 diabetes in the two populations analyzed. We cannot exclude the possibility that a common variant exists in either gene with an effect that is too small to be detected in a study of this size or that there is an unidentified polymorphism that is in much weaker LD with the htSNPs we analyzed. Had we genotyped all identified markers, our probability of detecting disease association would not have been substantially increased. Large introns and more extensive flanking DNA regions can be analyzed for association in the future by using the genome-wide SNP map that is under construction (18). By adopting an htSNP and a two-stage strategy, these candidate genes were quickly and economically evaluated for association with type 1 diabetes. This approach has allowed us to significantly reduce the genotyping burden (by
The 754 type 1 diabetic families were white European or of Caucasian European descent, with two parents and at least one affected child (472 Diabetes U.K. Warren 1 multiplex [21] and 282 multiplex ascertained in the U.S., obtained from the Human Biological Data Interchange [22]).
SNP identification and genotyping. Genotyping was performed using either Taqman MGB chemistry (Applied Biosystems) (23) or the Invader biplex assay (Third Wave Technologies, Madison, WI) (24). All genotyping data were double scored to minimize error. All SNP sequences are in dbSNP; sequencing and genotyping data can be obtained upon request (http://www-gene.cimr.cam.ac.uk/todd/human_data.shtml).
Annotation.
Statistical analysis.
This work was funded by the Wellcome Trust and the Juvenile Diabetes Research Foundation International. We thank the Human Biological Data Interchange and Diabetes U.K. for U.S. and U.K. multiplex families, respectively.
F.P. and D.J.S. contributed equally to this study. Address correspondence and reprint requests to John A. Todd, Juvenile Diabetes Research Foundation/Wellcome Trust Diabetes and Inflammation Laboratory, Cambridge Institute for Medical Research, University of Cambridge, Wellcome Trust/MRC Building, Addenbrookes Hospital, Cambridge CB2 2XY, U.K. E-mail: john.todd{at}cimr.cam.ac.uk Received for publication September 24, 2003 and accepted in revised form October 31, 2003
Abbreviations: htSNP, haplotype tag single nucleotide polymorphism; LD, linkage disequilibrium; MHC, major histocompatibility complex; SNP, single nucleotide polymorphism
This article has been cited by other articles:
|
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||