Diabetes 57:229-234, 2008 DOI: 10.2337/db07-0289 © 2008 by the American Diabetes Association
Further Evidence of a Primary, Causal Association of the PTPN22 620W Variant With Type 1 Diabetes![]()
1 Dipartimento di Scienze Biomediche, University of Sassari, Sassari, Italy Address correspondence and reprint requests to Francesco Cucca, Cattedra di Genetica Medica, Dipartimento di Scienze Biomediche, Università di Sassari, Viale S. Pietro, 07100 Sassari, Italy. E-mail: fcucca{at}uniss.it
Key Words: AFBAC, affected family–based control DIP, deletion insertion polymorphism HLA, human leukocyte antigen Lyp, lymphoid protein tyrosine phosphatase MAF, minor allele frequency SNP, single nucleotide polymorphism TDT, transmission disequilibrium test
OBJECTIVE— The minor allele of the nonsynonymous single nucleotide polymorphism (SNP) +1858C>T within the PTPN22 gene is positively associated with type 1 diabetes and other autoimmune diseases. Genetic and functional data underline its causal effect, but some studies suggest that this polymorphism does not entirely explain disease association of the PTPN22 region. The aim of this study was to evaluate type 1 diabetes association within this gene in the Sardinian population. RESEARCH DESIGN AND METHODS— We resequenced the exons and potentially relevant portions of PTPN22 and detected 24 polymorphisms (23 SNPs and 1 deletion insertion polymorphism [DIP]), 8 of which were novel. A representative set of 14 SNPs and the DIP were sequentially genotyped and assessed for disease association in 794 families, 490 sporadic patients, and 721 matched control subjects. RESULTS— The +1858C>T variant, albeit rare in the general Sardinian population (allele frequency 0.014), was positively associated with type 1 diabetes (Pone tail = 3.7 x 10–3). In contrast, the background haplotype in which this mutation occurred was common (haplotype frequency 0.117) and neutrally associated with disease. We did not confirm disease associations reported in other populations for non +1858C>T variants (rs2488457, rs1310182, and rs3811021), although they were present in appreciable frequencies in Sardinia. Additional weak disease associations with rare variants were detected in the Sardinian families but not confirmed in independent case-control sample sets and are most likely spurious. CONCLUSIONS— We provide further evidence that the +1858C>T polymorphism is primarily associated with type 1 diabetes and exclude major contributions from other purportedly relevant variants within this gene.
A large body of experimental evidence suggests that most cases of type 1 diabetes result from the autoimmune attack of T-cells on insulin-producing pancreatic β-cells. The lifetime disease risk for a monozygotic twin of an affected patient is The major histocompatibility complex/human leukocyte antigen (HLA) region on chromosome 6p21 contains the major component of disease-inherited risk (5,6). A second disease locus has been located (7) and fine mapped on chromosome 11p15.5 to a minisatellite (variable number of tandem repeats) locus in the insulin gene promoter region (8). Over the last few years, several new non-HLA, non-INS genes and variants have been found to be associated with type 1 diabetes (9,10). Among these, a +1858C>T polymorphism, localized in exon 14 of the PTPN22 gene (1p13.3), shows consistent remarkable size effects for type 1 diabetes susceptibility (odds ratio [OR] constantly >1.5 in all populations in which it is present at appreciable frequencies) (9,11,12). This variant is also positively associated with some other autoimmune diseases, including rheumatoid arthritis (13), Graves disease (14), and systemic lupus erythematosus (15), but not with other autoimmune disorders such as multiple sclerosis, celiac disease, Crohn's disease, and psoriasis (16,17). Although the reasons for the different association with distinct autoimmune traits are still unclear, the available data pose the +1858C>T variant as one of the few non-HLA polymorphisms with relevant size effects in the inherited risk for type 1 diabetes. This variant causes an amino acid change from arginine (R) to tryptophan (W) in codon 620 of the lymphoid protein tyrosine phosphatase (Lyp), which has a critical negative role on T-cell activation (18). Experimental evidence suggests that the Lyp-W620 variant causes a gain of physiological function of the wild-type phosphatase; that is, W620 has more negative regulatory activity than the more common wild-type R620 allele (19,20). The primary role of the +1858C>T variant in type 1 diabetes is also supported by a detailed haplotype analysis in a familial type 1 diabetes sample set of northern European origin (21). However, this study described some preliminary evidence of disease association of the very rare (minor allele frequency [MAF] = 0.006) +2250G>C, K750N variant. This variant was independent of +1858C>T and resulted in an RNA product in which exon 17 had been spliced directly to exon 19, with exon 18 containing position 2250 excluded (21). Furthermore, another study did not detect the +1858C>T variant in large Asian sample sets and suggested that another polymorphism located in the promoter region (rs2488457) was associated with the classical form of type 1 diabetes in both Asian- and European-derived family sample sets (22). After comparing the strength of disease association of the +1858C>T and rs2488457 single nucleotide polymorphisms (SNPs) in a European collection of families, the authors concluded that disease association was stronger at rs2488457 and suggested that this SNP was a more likely causative variant within the PTPN22 gene (22). Finally, after a comprehensive resequencing of the gene and conditional association analysis, Carlton et al. (23) also suggested that +1858C>T by itself is unlikely to account for all association observed between PTPN22 and rheumatoid arthritis. Three SNPs, located on intron 16 (ss38346943), in the 3'-untranslated (rs3811021) region and 1,496 bases downstream of PTPN22 at the 5' end of the nearby RSBN1 gene (rs3789604), were associated with rheumatoid arthritis independently of +1858C>T (23). Previous work found the +1858C>T mutation to be relatively rare in the general Sardinian population (9). This raises the possibility that other mutations in PTPN22 could be mediating type 1 diabetes risk in Sardinia. Here, we report a fine-mapping association analysis of the PTPN22 gene in type 1 diabetes sample sets from Sardinia. The aim of the study was to clarify the results of previous studies and determine whether additional, non +1858C>T variants are independent contributors to the inherited risk of type 1 diabetes in this island population.
We used a primary sample set consisting of 794 type 1 diabetic families, all of Sardinian origin. These families included 693 simplex families with one affected child and 101 multiplex families with at least two affected children. The average age of the patients at disease onset was 8.8 ± 5.5 years (±SD) with a range of 0.5–41 years. We also used an additional sample set of 490 Sardinian sporadic patients and 721 ethnically matched control subjects. These control subjects were healthy blood donor volunteers. The study was approved by the ethic committees of the Universities of Cagliari and Sassari. All participating individuals and their parents or legal guardians signed a statement of informed consent.
Resequencing.
Genotyping. Partial results of the transmission disequilibrium test (TDT) analysis for the +1858C>T variant in a subset of the type 1 diabetic families have been reported elsewhere (20).
Statistical analysis.
Parental haplotypes were reconstructed using the best option of the Merlin package, version 1.1- To determine the genetic architecture of the PTPN22 gene in the Sardinian population, the linkage disequilibrium measure r2 was calculated on parental genotypes between every pair of markers (30,31). The measure was built in a standard fashion from the basic pairwise-disequilibrium coefficient, D, computed using the HAPLOXT program in the GOLD software package (www.sph.umich.edu/csg/abecasis/GOLD). Intermarker pairwise r2 ranges from 0 to 1, with a value of 0 reflecting full independence between alleles at the two loci compared and a value of one occurring only when the alleles at the two marker loci always coincide.
Twenty-three SNPs and one DIP in the promoter region were detected after resequencing of the exons, the exon-intron boundary, and the region upstream and downstream of the gene in 64 parents of type 1 diabetic patients (Table 1). Eight of these variants (and the DIP) are still not present in public databases.
Fifteen markers spanning a region of 60,774 bp of the PTPN22 gene region were initially genotyped in 694 type 1 diabetic families. These 15 variants included 11 of the polymorphisms detected with the resequencing of the gene (rs2488457, ACTC DIP, rs1217419, rs1217418, rs3789608, rs2476602, ss38346945, rs1970559, rs3761935, ss38346943, and rs3811021), three additional SNPs that were genotyped before the resequencing stage (rs1235005, rs2476601, and rs2476599), and one variant (rs1310182) of prior interest (23) not included in the segments of the gene we resequenced. TDT analysis was performed for the 15 genotyped markers, and 4 of them were found to be positively associated with type 1 diabetes at a 5% level of significance (Table 2). These included the established +1858C>T polymorphism in exon 14 (marker 9 on Table 2) and three additional variants: allele A of ss38346945 (marker 8 on Table 2), allele C of ss38346943 (marker 13 on Table 2), and allele T of rs2476599 (marker 14 on Table 2), localized in exon 10 (+789G>A), in intron 17 (IVS17 +47T>C), and in intron 19 (IVS19 –1183C>T), respectively. The rs2488457/–1123G SNP (marker 2 on Table 2), found to be associated with type 1 diabetes in previous studies (21,22), is present and common in the Sardinian population (AFBAC frequency 0.128). Despite having >78% power to detect a primary association even as weak as an OR = 1.3, at a significance level of 5%, there was no evidence of association in our sample set (Table 2). Similarly, allele T of rs1310182 and allele C of rs3811021 (markers 11 and 15 on Table 2), which were found to be associated with rheumatoid arthritis in another study (23), were not associated with type 1 diabetes in our sample set (Table 2), although the powers to detect association at P = 0.05 assuming an OR 1.3 were 96 and 80%, respectively, for the two variants.
We then genotyped the four variants showing evidence of association in our survey study in the Sardinian population in an additional 100 type 1 diabetic families (380 individuals) collected during the course of the study. Evidence of association at the 5% level was still detected for all four alleles in this enlarged sample set (Table 3).
We next performed a haplotype analysis in all 15 variants genotyped in families. Considering haplotypes with a frequency >0.008 in the parental chromosomes, we detected seven major haplotypes accounting for >95% of haplotypes present in the familial sample set. Allele T of marker 9 (+1858C>T) was contained in just one haplotype (haplotype 6 in Table 4), which was positively associated with disease (P = 0.036). The background wild-type haplotype (haplotype 1 in Table 4) in which the +1858C>T mutation apparently occurred was common in this Sardinian sample set (AFBAC frequency 0.117) and was neutrally associated with type 1 diabetes. Finally, also a haplotype containing markers 8, 13, and 14 (haplotype 7 in Table 4) showed some degree of significant association with type 1 diabetes in this familial sample set (P = 0.016). Allele T of marker 14 was only associated with disease when present together with allele A of marker 8 and allele C of marker 13 on the same haplotype.
Genetic relationships between the variants at these 15 markers are also illustrated by the pairwise linkage disequilibrium of their MAFs measured by r2 values (Fig. 1). The critical allele T of marker 9 was independent from all other genotyped variants. Moreover, allele A of marker 8 and allele C of marker 13 on the one hand, and allele T of marker 7, allele G of marker 10 and allele T of marker 14 on the other hand, constituted distinct haplotype blocks (r2 > 0.8). The same patterns of linkage disequilibrium were also detected on samples of northern European origin (23).
To further investigate the putative novel evidence of disease association, we then assessed allele A of marker 8 and allele C of marker 13 in an additional collection of sporadic patients and matched control subjects from the same population. However, in this independent sample set, these markers showed nearly overlapping frequencies in patients and in control subjects; allele A of marker 8 was present in 27 of 962 fully genotyped patients chromosomes (0.028) and in 39 of 1,442 control subjects chromosomes (0.027), whereas allele C of marker 13 was counted in 36 of 976 patients chromosomes (0.036) and in 54 of 1,442 control subjects chromosomes (0.037).
Using a collection of type 1 diabetic families from Sardinia, we provide further evidence for a primary association with type 1 diabetes of the +1858C>T polymorphism within the PTPN22 gene. This is underlined by the fact that the background haplotype in which the +1858C>T mutation occurred is retained and still relatively common in the Sardinian population but neutrally associated with type 1 diabetes. The data are in full agreement with evidence from type 1 diabetic families of the Human Biological Data Interchange and the British Diabetes Association repositories (21). These results are also consistent with a reported primary association of +1858C>T with rheumatoid arthritis in two northern European–derived case-control sample sets from the U.S. (23). Primary association of the +1858T with type 1 diabetes is further supported by experimental data indicating the relevant functional consequences of this polymorphism at the protein level (19,20). Conversely, our observations are at odds with the results of a study suggesting that association of the +1858T was in fact secondary to association of a polymorphism, rs2488457 (marker 2 on Table 2), located in the promoter region of the gene and frequent in both the Asian and northern European populations (22). This latter variant was present and common in our Sardinian sample set but was not associated with type 1 diabetes. Note that in contrast with some other populations, the distribution of alleles +1858C>T and rs2488457 is informative in Sardinia, most likely owing to their distinct distribution on positively and neutrally associated haplotypes, respectively. Similar conclusions about the role of rs2488457 were recently reached by Chelala et al. (32) after analyzing a pooled French, Danish, and U.S. sample set. Although Carlton et al. (23) suggested an association of rheumatoid arthritis with polymorphisms rs1310182, rs3811021, and rs3789604, with the latter two alleles in nearly complete linkage disequilibrium with each other, when we genotyped two of these variants, rs1310182 (marker 11) and rs3811021 (marker 15), in the Sardinian families, we found no evidence of association with type 1 diabetes. This result is consistent with other recent studies concerning the association of these variants with rheumatoid arthritis and Graves disease (33,34). The recently described, positively associated rare variant +2250C causing the amino acid change K750N and alternative splicing with exclusion of exon 18 of PTPN22 (21) was not detected in the Sardinian samples resequenced and hence was not genotyped in the enlarged sample set. We did, however, obtain some initial evidence that variants within the PTPN22 gene, other than +1858C, were positively associated with type 1 diabetes independently of the +1858C>T variant. These included the minor alleles of ss38346945 (marker 8), which causes an amino acid change from arginine to glutamine in codon 263 of Lyp (R263Q) and of ss38346943 (marker 13), in strong linkage disequilibrium with each other. Nevertheless, these novel associations were borderline significant after correction for number of tests performed and were not confirmed by further analyses in independent case-control sample sets from Sardinia and from the U.K. (J.A. Todd and D. Smyth, personal communication). Furthermore, these variants did not show any significant evidence of association with rheumatoid arthritis in a previous study (23). It appears likely, therefore, that these inconsistent additional disease associations with non +1858C>T PTPN22 variants may in fact be spurious and related to stochastic fluctuations in the frequencies or in the transmission of variants to affected children. The role of the +2250G>C (K750N) variant remains to be evaluated in larger sample sets from other populations (22). We can conclude that a comprehensive analysis in a representative sample set from Sardinia provides further evidence that the +1858C>T is primarily associated with type 1 diabetes, even in a population in which this variant is rare. Furthermore, we excluded a major role for other potentially relevant variants within this gene. These results underline the need of the systematic replication of any evidence of disease association in sample sets offering adequate statistical power.
N.B. has received Juvenile Diabetes Research Foundation (JDRF) Grant 1-2005-342. F.C. has received Telethon-JDRF Grant GJT030477 and Ministero dellUniversità e della Ricerca (MIUR) Grant 2005060193 from the Italian Ministry of Scientific Research. We thank John Todd and Debbie Smyth for sharing unpublished data with us, Michael Whalen and John Todd for useful suggestions, Patrizia Zavattari for technical advice, and Rebecca Lewis for editorial assistance in the preparation of the manuscript. This study is dedicated to the memory of Chiara Perra.
Published ahead of print at http://diabetes.diabetesjournals.org on 12 October 2007. DOI: 10.2337/db07-0289. M.Z., C.P., and V.O. contributed equally to this work. The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. Section 1734 solely to indicate this fact. Received for publication March 1, 2007 and accepted in revised form October 7, 2007
|
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||