Abstract
Background We undertook a genome wide single nucleotide polymorphism analysis of a spectrum of patients with myelodysplastic syndrome del(5q) in order to investigate whether additional genomic abnormalities occur. Single nucleotide polymorphism array analysis has been shown to detect not only gene deletions but also regions of uniparental disomy that can pinpoint particular regions for mutation analysis.Design and Methods We studied 42 cases of myelodysplastic syndrome with del(5q), comprising 21 patients with 5q- syndrome and 21 with del(5q) (not 5q- syndrome), and 45 healthy controls by genome wide single nucleotide polymorphism analysis with the 50K Affymetrix single nucleotide polymorphism arrays.Results The del(5q) was characterized in all cases. The commonly deleted region of the 5q- syndrome extends between the genes SH3TC2 (proximal boundary) and GLRA1 (distal boundary) and measures 2.9 Mb. Copy number changes in addition to the del(5q) were observed in 10 of 21 patients with del(5q) myelodysplastic syndrome but in none of the patients with the 5q- syndrome. A total of 63 regions of uniparental disomy greater than 2 Mb were detected in 40 of 42 patients, dispersed on 18/23 chromosomes. In the 5q- syndrome group 31 regions of uniparental disomy were identified in 19 of 21 patients, the largest one being 7.6 Mb. All 21 patients with del(5q) myelodysplastic syndrome had uniparental disomy; in total 32 regions of uniparental disomy were identified in the 21 patients, including six regions of uniparental disomy > 10 Mb. Eight recurrent regions of uniparental disomy were observed among the 42 patients. For eight patients we had T-cell DNA as a germline control and four recurrent regions of uniparental disomy were identified that were present only in the neutrophil and not T-cell DNA. One small region of uniparental disomy at 10p12.31-p12.2 was observed in four patients with the 5q- syndrome.Conclusions This study shows that regions of uniparental disomy greater than 2 Mb are found in the 5q-syndrome and del(5q) myelodysplastic syndrome, although large regions of uniparental disomy (>10 Mb) are only found in the latter group. The recurrent regions of uniparental disomy may indicate the position of novel leukemia-associated genes.Introduction
The myelodysplastic syndromes (MDS) are a heterogeneous group of clonal disorders of the hematopoietic stem cell characterized by ineffective hematopoiesis,1 with approximately 30% to 40% of cases transforming to acute myeloid leukemia (AML).2,3 An interstitial deletion of the long arm of chromosome 5, del(5q), is the most common chromosomal abnormality in MDS with a frequency of 10–15% among all patients.4 del(5q) also occurs in AML4 and several other cancers.5–7 The 5q- syndrome is defined as an MDS with a bone marrow blast count <5% and the presence of the del(5q) as the sole chromosomal abnormality.8,9 The 5q- syndrome has a good prognosis and a low probability of transformation to AML.10,11 However, in high-risk MDS and de novo AML a del(5q) is associated with a poor prognosis.12
Single nucleotide polymorphism (SNP) microarrays allow the simultaneous analysis of large numbers of polymorphic loci across the genome, which provides information on copy number change and/or loss of heterozygosity (LOH),13,14 LOH can result from deletion of all copies of one homolog or from uniparental disomy (UPD) in which both copies of a chromosome pair have the same parental origin. UPD can arise through a number of mechanisms including incomplete segregation of chromosomes and mitotic recombination. Conventional cytogenetic methods do not detect UPD, whereas UPD can be efficiently detected by SNP microarrays.15,16 UPD has been described in many cancers, including AML, in which an association between UPD and homozygous gene mutation was found. In seven of 13 cases of AML with UPD, homozygous mutations at four distinct loci were found in the regions affected by UPD.17 A recent study showed that UPD is a prominent feature of some diseases such as chronic lymphocytic leukemia and myeloma.18,19
In this study we used SNP arrays to identify cryptic copy number changes and regions of UPD in patients with MDS and a del(5q), including patients with the 5q- syndrome.
Design and Methods
Patients and samples
Forty-two patients with MDS and 45 healthy volunteers were included in this study. The MDS patients were classified according to French-American-British (FAB) criteria. All MDS patients had a deletion of the long arm of chromosome 5 [del(5q)], and 21 patients had the 5q- syndrome. The samples from MDS patients were obtained from several sources: Oxford and Bournemouth (United Kingdom), Duisburg (Germany), Stockholm (Sweden) and Pavia (Italy). The study was approved by the ethics committees of the institutes involved (Oxford C00.196, Bournemouth 9991/03/E, Duisburg 2283/03, Stockholm 410/03, Pavia 26264/2002) and informed consent was obtained. The patients’ characteristics are shown in Table 1.
DNA preparation
Peripheral blood neutrophils were isolated using Histopaque (Sigma-Aldrich) and pelleted after hypotonic lysis of erythrocytes and two washes in phosphate-buffered saline. The purity of the neutrophil populations was high, >95%, as assessed by standard morphology on Wright-Giemsa-stained cytospin preparations. For eight MDS patients, T cells were isolated from peripheral blood by MACS with CD3 microbeads, for paired sample analysis. Genomic DNA was isolated by phenolchloroform extraction using standard methods and evaluated for quality and concentration using an ND-1000 spectrophotometer (Nanodrop, Wilmington, USA).
SNP mapping assay
The SNP mapping assay was performed according to the protocol supplied by the manufacturer (Affymetrix, Santa Clara, CA, USA). Briefly, 250 ng DNA were digested with Hind III, ligated to the adaptor, and amplified by polymerase chain reaction (PCR) using a single primer. PCR products were purified with the DNA amplification clean-up kit (Clontech) and the amplicons were quantified. The 40 μg of purified amplicons were fragmented, end-labeled and hybridized to a Genechip Mapping 50K Hind III array at 48°C for 16–18 hours in a Hybridization Oven 640 (Affymetrix). After washing and staining in a Fluidics Station 450 (Affymetrix), the arrays were scanned with a GeneChip Scanner 3000 (Affymetrix).
SNP array data analysis
Cell intensity calculations and scaling were performed using GeneChip Operating Software (GCOS). Data were analyzed using GeneChip Genotyping Analysis Software Version 4.0 (Affymetrix) and CNAG software version 2.0. Quality control was performed within the Genotyping software after scaling the signal intensities of all arrays to a target of 100%. DNA copy number was analyzed with both the chromosome copy number tool (CNAT) version 3.0 and CNAG version 2.0. CNAT compares obtained SNP hybridization signal intensities with SNP intensity distributions of a reference set from more than 100 healthy individuals of different ethnicity. For analysis with CNAG we used the pool of 45 healthy controls included in the study as a reference set.
Mutation analysis
Coding exons of SPAG-6 were amplified from the neutrophil DNA of 12 patients with del(5q) MDS, including ten patients with the 5q- syndrome, by PCR. Primers and amplifications conditions are available on request. PCR products were purified using ExoSAP-IT (GE Healthcare) and directly sequenced using a BigDye Terminator v1.1 cycle sequencing kit and an ABI 3100 Genetic analyzer. Sequence data were analyzed using Sequence Analysis 1.6.0 (Informagen).
Results
Quality control of SNP genotype arrays
The accuracy of whole-genome sampling analysis was determined by the SNP call rate generated via GCOS software. The mean genotyping call rates were 96.72% for the patients’ samples and 95.40% for the control samples. By calculation of heterozygous SNP calls within the X chromosome in 28 males, the accuracy of the SNP array was estimated to be 99.68%.
Del(5q) confirmed and 5q- syndrome commonly deleted region defined
DNA copy number at each SNP locus is calculated from the corresponding signal intensity on the array. The del(5q) was observed in all 42 patients included in the study; the extent and size of the deletions are given in Online Supplementary Table S1. The boundaries of the commonly deleted region in the 21 patients with the 5q- syndrome were between the genes SH3TC2 at the proximal boundary (physical position 148362828 bp) and GLRA1 at the distal boundary (physical position 151297473 bp) (Figure 1A). Using SNP array technology the commonly deleted region measures approximately 2.9 Mb.This confirms our previous mapping data of the 5q- syndrome commonly deleted region,20 provides more precise definition of the region’s boundaries and a more accurate estimation of its size. When taking all 42 patients into consideration the commonly deleted region remained unchanged.
In one patient with the 5q- syndrome the del(5q) was shown to be non-contiguous, (Table 2 and Figure 1B), an abnormality not revealed by conventional cytogenetic methods.
Detection of additional copy number changes in myelodysplastic syndromes patients with del(5q)
None of the 21 patients with the 5q- syndrome had copy number changes in addition to the del(5q). Copy number changes in addition to the del(5q) were observed in ten of the other 21 MDS patients. In five of these patients the copy number changes were not identified by cytogenetics (Table 2). For example, patient #26 was shown to have amplification of 13q31.2-q34 by SNP array (Table 2). Nine patients had abnormalities documented by cytogenetics that were not seen by SNP array, the majority of these abnormalities were translocations or inversions.
In three patients with chromosome translocations (see Table 2, patients #7, 17 and 35), SNP array analysis identified a deletion that encompassed the translocation breakpoint defined by karyotyping.
Regions of LOH with normal copy number - UPD
Regions of UPD were defined as continuous stretches of homozygous SNP calls >2 Mb without copy number loss.21 In this study a total of 90 regions of UPD >2 Mb, dispersed across 17 chromosomes were identified in 36/45 normal control subjects. The majority of these regions were between 2 Mb and 3 Mb.
For the patients we report only those regions of UPD that were not observed in any of the normal healthy controls included in our study or in normal healthy control subjects reported in the literature.21,22 A total of 63 regions of UPD > 2 Mb, dispersed on 18/23 chromosomes, were detected in 40 of 42 patients (Online Supplementary Table S2) (Figure 2a). In the 5q- syndrome group, 31 regions of UPD > 2 Mb were identified in 19 of 21 patients, the largest one being 7.6 Mb. Thirty-two regions of UPD > 2 Mb were identified among all 21 del(5q) MDS patients, including six regions of UPD >10 Mb (Figure 2b). In the 5q- syndrome group the frequency of UPD regions was highest on chromosome 4 and in the del(5q) MDS group on chromosomes 6 and 13 (Figure 2A).
Recurrent regions of UPD
We identified eight regions of UPD > 2 Mb in two or more patients and in none of the controls; 4q21.21 (two patients), 4q13.1 (three patients), 4q26-q27 (two patients), 4q13.1- q13.2 (two patients), 6q13- q14.1 (four patients), 6q14.3-q15 (two patients); 8q21.11 (two patients), 9p21.1 (two patients). The sizes of these recurrent regions of UPD and the genes that map within them are shown in Table 3.
Paired sample analysis
For eight patients in the study we obtained DNA from their T cells as germline control material. For this paired sample analysis we recorded all regions of UPD >1 Mb that were present in the neutrophil, but not T-cell sample of each patient. In total we identified 50 regions of UPD > 1 Mb. Four regions of UPD > 1 Mb were identified in two or more patients: 3q25.32 (two patients), 5p14.3-p14.2 (two patients), 6q12 (two patients) and 10p12.31-p12.2 (four patients).
The four patients with the recurrent UPD region at 10p12.31-p12.2 were all classified as having 5q- syndrome. Seven genes map to this region; SPAG6, MLLT10, PIP5K2A, BM1, COMMD3, NEBL and DNAJC1.
Candidate gene SPAG6 mapping to this recurrent region of UPD at 10p12.31-p12.2 was analyzed for the presence of mutations. All 11 exons of SPAG6 were sequenced in ten patients with the 5q- syndrome, including the four patients with the corresponding UPD, and two patients with del(5q) MDS. No mutations were identified.
In addition, two regions of LOH with associated deletions at 12p13.2-p13.1 and 12q21.31-q22 were detected in the neutrophil but not T-cell sample of a patient with del(5q) MDS.
Discussion
Using SNP array analysis, we analyzed a group of 42 patients with MDS and del(5q) for the presence of copy number changes and LOH. Twenty-one patients with the 5q- syndrome were compared with 21 patients with del(5q) MDS. Patients with the 5q- syndrome have a good prognosis and this may be related to their relative genetic stability as evidenced by an absence of other cytogenetic abnormalities and a failure to date to find other leukemia-associated mutations in this group of patients in contrast to other MDS patients with the del(5q).23
Nevertheless hitherto unidentified genetic abnormalities in addition to the del(5q) may exist in the 5q- syndrome. The high density SNP arrays offer a very good method of analysis for determining genomic abnormalities which may have been missed by earlier technologies. SNP analysis also provides for an accurate, efficient comparative method of examining large genomic deletions - giving us the opportunity for a new analysis of the critical deleted region in the 5q- syndrome, which has confirmed and refined our previous analysis.
Additional copy number changes were detected in 10/21 del(5q) MDS patients. In general the karyotypic abnormalities detected by cytogenetic analysis were confirmed by SNP array analysis. Some additional abnormalities were detected by SNP array and the greater resolution achieved by this method provided a more accurate definition of abnormalities, particularly in cases with a complex karyotype. A proportion of lesions detected by cytogenetic analysis were not found by SNP array. A limitation of SNP arrays is the inability to distinguish multiple clones, and minor clones may be detected by cytogenetics but not by SNP array analysis. In this study, the majority of lesions missed by the SNP array were translocations; this discrepancy is not unexpected as SNP array technology is unable to detect balanced rearrangements. Interestingly, cryptic deletions were detected at the location of translocation breakpoints in three cases. Such cryptic deletions associated with balanced translocations have been reported previously.24 For example, a deletion associated with the 13q14 translocation in B-cell leukemia pin-pointed a region of genetic importance in this disorder.25
In marked contrast to del(5q) MDS patients none of the 5q- syndrome patients showed any additional copy number changes. Again, as with the mutation study referred to above,23 this result is consistent with the good prognosis of the 5q- syndrome.
In this study, a total of 90 regions of UPD, dispersed across the genome on 17/23 chromosomes, were identified in 36 control subjects. The majority of the regions of UPD were between 2 and 3 Mb in length. The frequency of regions of UPD in normal individuals observed in this study is comparable with that observed by Gibson et al., who examined the genome-wide SNP genotypic data from the 209 unrelated individuals from the four HapMap populations. They observed 1393 tracts of consecutive homozygous SNP exceeding 1 Mb in length among the 209 individuals.22 These studies show homozygous tracts are common in normal individuals and that a control pool is essential for the analysis of SNP array data.
SNP analysis that enables the detection of UPD has already proven very informative in the study of hematologic malignancies. An illustrative example is the UPD seen in some cases of polycythemia vera that corresponds to the location of the JAK2 gene on chromosome 9p.26,27 By analogy with polycythemia vera, it is considered that SNP analysis may give a clue to the location of disease genes in MDS, as some studies have already confirmed in AML.15,17
For the patient group as a whole we determined disease-related UPD on the basis of size (>2 Mb) and also the absence of the corresponding UPD in control groups. We found that regions of UPD were very common in the patients although the del(5q) MDS group clearly showed a greater number of large UPD regions than did the 5q- syndrome group. There was no correlation between the number of copy number changes and number of regions of UPD per patient. However, the majority of large regions of UPD (>10 Mb) were in patients who also had regions of copy number change, suggesting a common underlying genetic instability. The regions of UPD are not evenly distributed across the various chromosomes, since their frequencies are higher on chromosomes 4, 6 and 13. Eight regions of UPD were observed in two or more patients: four different regions on 4q, two different regions on 6q, one region on 8q and one on 9p. A recent study of low-risk MDS by SNP array analysis also highlighted chromosome 4 as having the highest number of regions of UPD.21 Twelve regions of UPD ranging from 2.6–138.5 Mb were identified on chromosome 4 by Mohamedali et al., some of which overlap with the regions identified in our study.
For the subset of patients for whom we had corresponding T-cell DNA for direct intrapatient comparisons we were able to lower the stringency of the UPD definition to regions greater than 1 Mb. In this analysis we identified four recurrent regions of UPD. Four patients with the 5q- syndrome had a region of UPD at 10p12.2-10p12.3. identified only in the neutrophil but not T-cell samples. Seven genes mapping to this recurrent region of UPD, SPAG6, MLLT10, PIP5K2A, BM1, CommD3, NEBL and DNAJG1, are now to be considered as potential candidate genes for mutation in the 5q- syndrome. Gene expression profiling studies identified SPAG-6 as one of the most significantly upregulated genes in the CD34 cells from patients with the 5q- syndrome when compared with both normal individuals and patients with refractory anemia and a normal karyotype.28 In addition, SPAG-6 has recently been shown to be markedly overexpressed in pediatric AML,29 hence this gene was a good candidate to search for mutations which would be rendered homozygous by UPD. We analyzed the 11 exons of SPAG-6 for the presence of mutations in the four 5q- syndrome patients with chromosome 10p12.2-10p12.3 UPD, six additional patients with the 5q- syndrome and two patients with del(5q) MDS. No mutations were identified, although further investigation of SPAG6 in MDS is warranted.
In conclusion, we have identified recurrent regions of UPD in patients with MDS which may be of pathogenetic importance. In addition we have shown a clear distinction between 5q- syndrome and other del(5q) MDS cases with regards to both copy number changes and the presence of large UPD, consistent with the different clinical behavior of patients with these two disorders.
Footnotes
- Funding: this work was supported by the Leukaemia Research Fund of the United Kingdom and supported in part by grants from the Associazone Italiana per la Ricerca sul Cancro (AIRC), Milan and Fondazione Cariplo, IRCCS Policlinico San Matteo, Pavia, Italy.
- The online version of this article contains a supplementary appendix.
- Authorship and Disclosures CF, JSW and JB designed the research; LW, CF, AG, MC, LM, SK and CA performed the research; LW and CF analyzed the data; LW, CF, JSW and JB wrote the paper. The authors reported no potential conflicts of interest.
- Received December 10, 2007.
- Revision received February 12, 2008.
- Accepted March 6, 2008.
References
- Goasguen JE, Bennett JM. Classification and morphologic features of the myelodysplastic syndromes. Semin Oncol. 1992; 19:4-13. Google Scholar
- Aul C, Bowen DT, Yoshida Y. Pathogenesis, etiology and epidemiology of myelodysplastic syndromes. Haematologica. 1998; 83:71-86. Google Scholar
- Heaney ML, Golde DW. Myelodysplasia. N Engl J Med. 1999; 340:1649-60. Google Scholar
- Mauritzson N, Albin M, Rylander L, Billstrom R, Ahlgren T, Mikoczy Z. Pooled analysis of clinical and cytogenetic features in treatment-related and de novo adult acute myeloid leukemia and myelodysplastic syndromes based on a consecutive series of 761 patients analyzed 1976–1993 and on 5098 unselected cases reported in the literature 1974–2001. Leukemia. 2002; 16:2366-78. Google Scholar
- Johannesdottir HK, Jonsson G, Johannesdottir G, Agnarsson BA, Eerola H, Arason A. Chromosome 5 imbalance mapping in breast tumors from BRCA1 and BRCA2 mutation carriers and sporadic breast tumors. Int J Cancer. 2006; 119:1052-60. Google Scholar
- Jones AM, Douglas EJ, Halford SE, Fiegler H, Gorman PA, Roylance RR. Array-CGH analysis of microsatellite-stable, near-diploid bowel cancers and comparison with other types of colorectal carcinoma. Oncogene. 2005; 24:118-29. Google Scholar
- Shin JH, Kang SM, Kim YS, Shin DH, Chang J, Kim SK. Identification of tumor suppressor loci on the long arm of chromosome 5 in pulmonary large cell neuroendocrine carcinoma. Chest. 2005; 128:2999-3003. Google Scholar
- Boultwood J, Lewis S, Wainscoat JS. The 5q-syndrome. Blood. 1994; 84:3253-60. Google Scholar
- Fenaux P, Kelaidi C. Treatment of the 5q- syndrome. Hematology Am Soc Hematol Educ Program. 2006;192-8. Google Scholar
- Van den Berghe H, Michaux L. 5q-, twenty-five years later: a synopsis. Cancer Genet Cytogenet. 1997; 94:1-7. Google Scholar
- Van den Berghe H, Vermaelen K, Mecucci C, Barbieri D, Tricot G. The 5q-anomaly. Cancer Genet Cytogenet. 1985; 17:189-255. Google Scholar
- Giagounidis AA, Germing U, Aul C. Biological and prognostic significance of chromosome 5q deletions in myeloid malignancies. Clin Cancer Res. 2006; 12:5-10. Google Scholar
- Nannya Y, Sanada M, Nakazaki K, Hosoya N, Wang L, Hangaishi A. A robust algorithm for copy number detection using high-density oligonucleotide single nucleotide polymorphism genotyping arrays. Cancer Res. 2005; 65:6071-9. Google Scholar
- Yamamoto G, Nannya Y, Kato M, Sanada M, Levine RL, Kawamata N. Highly sensitive method for genomewide detection of allelic composition in nonpaired, primary tumor specimens by use of affymetrix single-nucleotide-polymorphism genotyping microarrays. Am J Hum Genet. 2007; 81:114-26. Google Scholar
- Raghavan M, Lillington DM, Skoulakis S, Debernardi S, Chaplin T, Foot NJ. Genome-wide single nucleotide polymorphism analysis reveals frequent partial uniparental disomy due to somatic recombination in acute myeloid leukemias. Cancer Res. 2005; 65:375-8. Google Scholar
- Teh MT, Blaydon D, Chaplin T, Foot NJ, Skoulakis S, Raghavan M. Genomewide single nucleotide polymorphism microarray mapping in basal cell carcinomas unveils uniparental disomy as a key somatic event. Cancer Res. 2005; 65:8597-603. Google Scholar
- Fitzgibbon J, Smith LL, Raghavan M, Smith ML, Debernardi S, Skoulakis S. Association between acquired uniparental disomy and homozygous gene mutation in acute myeloid leukemias. Cancer Res. 2005; 65:9152-4. Google Scholar
- Pfeifer D, Pantic M, Skatulla I, Rawluk J, Kreutz C, Martens UM. Genome-wide analysis of DNA copy number changes and LOH in CLL using high-density SNP arrays. Blood. 2007; 109:1202-10. Google Scholar
- Walker BA, Leone PE, Jenner MW, Li C, Gonzalez D, Johnson DC. Integration of global SNP-based mapping and expression arrays reveals key regions, mechanisms, and genes important in the pathogenesis of multiple myeloma. Blood. 2006; 108:1733-43. Google Scholar
- Boultwood J, Fidler C, Strickson AJ, Watkins F, Gama S, Kearney L. Narrowing and genomic annotation of the commonly deleted region of the 5q- syndrome. Blood. 2002; 99:4638-41. Google Scholar
- Mohamedali A, Gaken J, Twine NA, Ingram W, Westwood N, Lea NC. Prevalence and prognostic significance of allelic imbalance by single nucleotide polymorphism analysis in low risk myelodysplastic syndromes. Blood. 2007; 110:3365-73. Google Scholar
- Gibson J, Morton NE, Collins A. Extended tracts of homozygosity in outbred human populations. Hum Mol Genet. 2006; 15:789-95. Google Scholar
- Fidler C, Watkins F, Bowen DT, Littlewood TJ, Wainscoat JS, Boultwood J. NRAS, FLT3 and TP53 mutations in patients with myelodysplastic syndrome and a del(5q). Haematologica. 2004; 89:865-6. Google Scholar
- De Gregori M, Ciccone R, Magini P, Pramparo T, Gimelli S, Messa J. Cryptic deletions are a common finding in “balanced” reciprocal and complex chromosome rearrangements: a study of 59 cases. J Med Genet. 2007; 44:750-62. Google Scholar
- Herholz H, Kern W, Schnittger S, Haferlach T, Dicker F, Haferlach C. Translocations as a mechanism for homozygous deletion of 13q14 and loss of the ATM gene in a patient with B-cell chronic lymphocytic leukemia. Cancer Genet Cytogenet. 2007; 174:57-60. Google Scholar
- Kralovics R, Guan Y, Prchal JT. Acquired uniparental disomy of chromosome 9p is a frequent stem cell defect in polycythemia vera. Exp Hematol. 2002; 30:229-36. Google Scholar
- Kralovics R, Passamonti F, Buser AS, Teo SS, Tiedt R, Passweg JR. A gain-of-function mutation of JAK2 in myeloproliferative disorders. N Engl J Med. 2005; 352:1779-90. Google Scholar
- Boultwood J, Pellagatti A, Cattan H, Lawrie CH, Giagounidis A, Malcovati L. Gene expression profiling of CD34(+) cells in patients with the 5q-syndrome. Br J Haematol. 2007; 139:578-89. Google Scholar
- Steinbach D, Schramm A, Eggert A, Onda M, Dawczynski K, Rump A. Identification of a set of seven genes for the monitoring of minimal residual disease in pediatric acute myeloid leukemia. Clin Cancer Res. 2006; 12:2434-41. Google Scholar