Categorization of 77 dystrophinexons into 5 groups by a decision tree using indexes of splicing regulatory factors as decision markers
© Malueka et al; licensee BioMed Central Ltd. 2012
Received: 18 January 2012
Accepted: 31 March 2012
Published: 31 March 2012
Duchenne muscular dystrophy, a fatal muscle-wasting disease, is characterized by dystrophin deficiency caused by mutations in the dystrophin gene. Skipping of a target dystrophin exon during splicing with antisense oligonucleotides is attracting much attention as the most plausible way to express dystrophin in DMD. Antisense oligonucleotides have been designed against splicing regulatory sequences such as splicing enhancer sequences of target exons. Recently, we reported that a chemical kinase inhibitor specifically enhances the skipping of mutated dystrophin exon 31, indicating the existence of exon-specific splicing regulatory systems. However, the basis for such individual regulatory systems is largely unknown. Here, we categorized the dystrophin exons in terms of their splicing regulatory factors.
Using a computer-based machine learning system, we first constructed a decision tree separating 77 authentic from 14 known cryptic exons using 25 indexes of splicing regulatory factors as decision markers. We evaluated the classification accuracy of a novel cryptic exon (exon 11a) identified in this study. However, the tree mislabeled exon 11a as a true exon. Therefore, we re-constructed the decision tree to separate all 15 cryptic exons. The revised decision tree categorized the 77 authentic exons into five groups. Furthermore, all nine disease-associated novel exons were successfully categorized as exons, validating the decision tree. One group, consisting of 30 exons, was characterized by a high density of exonic splicing enhancer sequences. This suggests that AOs targeting splicing enhancer sequences would efficiently induce skipping of exons belonging to this group.
The decision tree categorized the 77 authentic exons into five groups. Our classification may help to establish the strategy for exon skipping therapy for Duchenne muscular dystrophy.
KeywordsSplicing Dystrophin Exon Splicing enhancer Decision tree
Duchenne muscular dystrophy (DMD), a fatal muscle-wasting disease, is the most common inherited muscle disease, affecting one in every 3500 male births. DMD is characterized by dystrophin deficiency caused by mutations in the dystrophin gene, the largest human gene that spans over 2500 kb on the X-chromosome. For the treatment of DMD, antisense oligonucleotides (AOs) against splicing regulatory sequences have been proposed to produce in-frame dystrophin mRNA from the out-of-frame mRNA by inducing exon skipping during splicing . The newly generated in-frame dystrophin mRNA is expected to produce semi-functional, internally deleted dystrophin protein. Currently, induction of exon skipping with AOs is considered one of the most promising treatments for DMD [2, 3].
The dystrophin gene encodes a 14-kb mRNA consisting of 79 exons and is characterized by its huge intron size; the largest, intron 44, is 248 kb long. In addition, eight alternative promoters that are activated in a tissue-specific manner have been identified. Each tissue-specific exon 1 under the control of a cryptic promoter is spliced correctly to one of the downstream authentic dystrophin exons, producing a tissue-specific dystrophin isoform [4, 5]. Furthermore, alternative splicings of some exons lead to the production of additional isoforms of the tissue-specific transcripts [6–8]. Remarkably, 14 cryptic exons that resemble authentic exons in terms of length and splice site strength, but are very rarely if ever spliced, have been reported within the huge introns [9, 10].
Splicing is the process that removes introns from pre-mRNA, and is performed in the spliceosome, a ribonucleoprotein assembly. The spliceosome is one of the most complex cellular machineries, comprising approximately 150 proteins and five small nuclear RNAs (snRNAs U1, U2, U4, U5, and U6) . Three sites, the splice donor site (5'ss), the splice acceptor site (3'ss), and the branch point sequences are the core splice site signals that are present in every intron. Despite the high potential for errors, the splicing process appears to occur with high fidelity, implying the widespread involvement of additional transcript features. These exonic elements are conventionally classified as exonic splicing enhancers (ESEs) or silencers (ESSs) and they function to promote or inhibit inclusion of the exon in which they reside, respectively. These splicing regulatory elements function by recruiting trans-acting splicing factors that activate or suppress splice site recognition or spliceosome assembly by various mechanisms . AOs that induce skipping of dystrophin exons have been mainly designed against ESEs of target exons to hamper recruitment of splicing factors [2, 3].
Recently, we reported that a small chemical enhances skipping of mutated dystrophin exon 31 in a sequence-specific manner, not altering the splicing of other dystrophin exons . It was strongly suggested that dystrophin exons have their own splicing regulatory systems. However, the characteristics of the splicing regulatory systems of individual dystrophin exons are not well understood.
Decision trees are classifiers that predict class labels for data items and can make very accurate predictions [14, 15]. They have been used to establish an integrated method that is one of the best available ways to find genes in the human genome .
Here, we identified a novel cryptic exon in intron 11 of the dystrophin gene in a DMD patient, and constructed decision trees to discriminate authentic exons from cryptic exons. Finally, we categorized 77 authentic exons into five groups based on indexes of their splicing regulatory factors. From the decision tree, we suppose that one group of exons showing high density for ESE sequences are a good target for exon skipping therapy.
We evaluated our re-constructed tree using nine disease-causing exons that have been identified in the dystrophin mRNAs of dystrophinopathy patients. These pathological exons are created by the effect of deep-intron single nucleotide mutations. These mutations confer the characteristics of an authentic exon on a portion of intronic sequence, and thus the intron segment becomes recognized as an exon during the splicing process. Therefore, the decision tree should categorize these pathological exons as authentic exons. Remarkably, all nine pathological exons were correctly categorized into one of the four exon groups: four in group A, three in group C, and one in each of groups D and E. Therefore, we consider our decision tree a suitable classifier of pathological dystrophin exons.
Seven indexes were used in common by both the preliminary and re-constructed trees. However, their values in each tree were different, except for one (≤ 1.39 ME3'ss score at the first node). When we compared the two trees, we found that three exons (2, 70, and 78) and two exons (40 and 56) were re-categorized into small groups (groups E and D) and the largest group (group C), respectively. This indicated slight changes in the categorizing pathways between the two trees. The cryptic exons were ultimately categorized into four groups, and exon 11a belonged to group d, which comprises exons 1a, 2a, and 3a-l. It was interesting that group D was categorized by a SIZE score of ≤ 114 (Figure 3). This indicates that a small exon can be real if the splicing silencer density is high.
Summary of categorization of exons by the decision tree
Number of Exon
Next, we used the tree to characterize exons that are subjected to specific splicing patterns. First, we marked exons known to be alternatively spliced (data not shown). However, they were found in all five groups. This indicates that no particular cis-elements predispose to alternative splicing. Second, we examined exons that were prone to splicing errors caused by intra-exon mutation. We have previously reported nonsense mutation-induced exon skipping in seven exons (exons 14, 15, 17, 27, 41, 42, and 70) . Among these, only four (exons 17, 27, 41, and 42) cause disruption of an ESE. Remarkably, exons 17, 41, and 42 were categorized in group A, while all those that do not disrupt an ESE were categorized in other groups. This indicates that, for exons in group A, nonsense mutation-induced exon skipping is caused by disruption of an ESE, while for exons in other groups, nonsense mutation-induced exon skipping is caused by other factors. Third, we examined exons that splice to tissue-specific exon 1 (exons 2, 30, 45, 56, and 63). We did not identify any particular categorization characteristics for these exons.
The dystrophin gene produces many mRNAs because it is so complex: it contains 79 exons, huge introns, eight tissue-specific promoters, 15 cryptic exons, and generates many alternatively spliced products. It has been reported that there is a gradient in exon and intron definition at the level of pre-mRNA splicing . For example, efficient use of intrinsically weak cryptic splice sites in exons is facilitated by a higher-than-average density of ESSs and a high density of SF2/ASF ESE motifs . This suggests that each exon has a specific splicing regulatory mechanism in which particular splicing factors could compensate for the lack of other splicing factors. It is important to understand more about the splicing regulatory mechanisms for each exon of the dystrophin gene because treatment with exon skipping is a promising therapeutic approach.
Decision trees have been used in many kinds of classifications, such as for the management of Parkinson's disease, disease severity profiling, large-scale proteomic studies, and microarray data classification [14, 18]. Here, we classified 77 authentic and 15 cryptic dystrophin exons into five groups based on their splicing regulatory factor characteristics. In this study, we needed to modify our preliminary decision tree (Figure 1) based on data from the novel exon 11a (Figure 2). This suggested a possibility that the tree needs further modification when another cryptic exon is identified in the future construction. However, we believe any modifications, if necessary, will be minimal as exon 11a was additionally analyzed (Figure 3). Moreover, our trees were constructed using indexes obtained only from cis-elements; further modification incorporating information from trans-elements may further improve the reliability of the tree.
We used our decision tree to determine the parameters that are most useful for the discrimination of authentic and cryptic dystrophin exons. We separated the two categories completely using eight variables: (1) strength of the 3'ss measured by maximum entropy score; (2) density of ESEs particularly SF2/ASF; (3) strength of the 5'ss measured by maximum entropy score; (4) free energy in U1-snRNP binding to the 5'ss; (5) density of ESSs as identified by FESS-D; (6) GC content at the 5'ss; (7) exon size; and (8) strength of the 3'ss as measured by the Shapiro score. In other words, the strength of the 3'ss consensus sequence as indicated by maximum entropy score is the most critical, as all authentic dystrophin exons must have a maximum entropy score of more than 1.39 at the 3'ss. The next most important deciding factor is ESE density, particularly for SF2/ASF; almost half (30) of dystrophin exons can be classified based on this parameter and 3'ss strength only. These findings are in line with those of a previous report, which showed that exons that are skipped because of splice site mutations have a weaker 3'ss and a lower-than-average density of ESEs .
Our exon categorization suggested that there are at least five different patterns of splicing regulatory mechanisms for dystrophin pre-mRNAs (Table 1). At the extreme, one group contained only one exon (exon 27; group B), implying that this exon has a unique splicing regulatory mechanism. In fact, alteration of exon 27 splicing has been reported for two nonsense mutations, resulting in a mild phenotype . Exon 27 may be particularly vulnerable to splicing alterations caused by intra-exonic mutations.
On the other hand, two groups (A and C) contained 72 out of 77 exons, suggesting that most dystrophin exons are under similar splicing control. Group A, consisting of 32 exons, was characterized by a high density of ESEs, particularly SF2/ASF (more than 10.53) (Figure 3) (Table 1), indicating that ESE density plays a critical role in the splicing of these exons. Disruption of ESEs for exons in this group is more likely to cause exon skipping compared with disruption of ESEs in the other exon groups. Nonsense mutations within exons 17, 41, and 42, all belonging to group A, have been shown to induce exon skipping . Accordingly, the nonsense mutations in these three exons disrupt an ESE sequence and thus cause exon skipping, while nonsense mutations that induce skipping of exons in other groups do not disrupt an ESE . However, not all ESE-disrupting nonsense mutations identified in these exons induce exon skipping . It may be necessary to consider trans-elements to explain these differences, along with the strength of splicing factor binding to the ESEs and their positions in the pre-mRNA secondary structure.
Intronic pseudo-exons that look like perfect exons, maintaining splicing consensus sequences, are now under intensive investigation [10, 20–22]. There is evidence that inclusion of many of these sequences in mRNAs is actively inhibited because of the presence of intrinsic defects, the presence of silencer elements, or the formation of an inhibitory RNA secondary structure . We categorized the cryptic exons into four groups (Figure 3), indicating a heterogeneous contribution of splicing factors required for their activation.
Group C contained 42 exons and was separated at the fifth node by a score of less than 46.8% GC content at the 5'ss. More than half of the dystrophin exons belonged to this group and are therefore presumed to be subject to similar splicing regulation. From the decision tree we can see that this group is characterized also by an ESS density ≤ 4.45 (FESS-D), indicating that splicing silencer signals play a critical role in the splicing of these exons. Indeed, we showed previously that nonsense mutation-induced exon skipping of exon 31 was caused by the creation of a splicing silencer-binding site for hnRNP A1 . It has been reported that negative elements play important roles in distinguishing real splicing signals from the vast number of false-positive splicing signals .
Because the dystrophin gene is so complex, it has the potential to produce many alternatively spliced transcripts that are translated into protein variants . However, studies on alternative splicing are limited [6–8]. In one study, Sironi et al. identified 16 alternative transcripts and examined splicing regulatory factors such as the 3' and 5' consensus values and exonic splicing enhancer scoring matrices; however, no reasonable explanation of the alternative splicing was identified . This is consistent with our findings that the cryptic exons fell into five different groups. It is possible that alternative splicing does not rely completely on specific sequence elements and is regulated by trans-acting factors.
Interest in the splicing regulation of dystrophin pre-mRNA was first aroused when exon skipping caused by a small intra-exonic deletion was identified in a DMD patient . Subsequently it was revealed that the deleted region was an ESE sequence within exon 19  and that AOs against this ESE successfully induced the skipping of exon 19 . This has led to the establishment of exon-skipping therapy . In this study, we demonstrated that exon 19 was categorized into group A, characterized by a high ESE density; hence it is reasonable that AOs against the exon 19 ESE would induce exon skipping efficiently. Currently exon skipping is recognized as the most promising way to express dystrophin in DMD. The main targets for exon skipping therapy are exons 44, 45, 51, and 53  and AOs against exon 51 are now in phase II or III clinical trials [2, 3]. Because exons 45, 51, and 53 belong to group A, we would expect AOs against the ESEs in these exons to work well.
The decision tree categorized the 77 authentic dystrophin exons into five groups. Our classification may help to establish the strategy for exon skipping therapy for DMD.
Indexes of splicing regulatory factors
Indexes of splicing regulatory factors
5' splice site strength (Shapiro score)
3' splice site strength (Shapiro score)
5' spice site strength (maximum entropy)
3' spice site strength (maximum entropy)
5' splice site strength (information content/Ri)
3' splice site strength (information content/Ri)
U1 SnRNA binding free energy
ESE density (RESCUE ESE/RESE)
ESE density (PESE)
ESS density (FAS-ESS/FESS)
ESS density (PESS)
SF2/ASF (IgM/BRCA1) number
SF2/ASF (IgM, BRCA1)-N
SF2/ASF score density
SF2/ASF (IgM/BRCA1) score density
SC35 score density
SRp40 score density
SRp55 score density
5' splice site pre-mRNA secondary structure free energy
3' splice site pre-mRNA secondary structure free energy
5' splice site GC content
3' splice site GC content
The number of nucleotides in exon
Splice site strength
Splice site strength was determined in three ways: Shapiro's splicing probability matrix scores at the 5'ss and the 3'ss were calculated using published formulae [17, 31] (SH5'ss and SH3'ss, respectively). Maximum entropy scores at the 5'ss and the 3'ss were obtained using online tools available at http://genes.mit.edu/burgelab/maxent/Xmaxentscan_scoreseq.html (ME5'ss and ME3'ss, respectively) . Information content values at the 5'ss and the 3'ss were obtained using the Delila server at https://splice.uwo.ca/ (Ri5'ss and Ri3'ss, respectively) .
Free energy of U1 snRNA binding to the 5'ss
U1 snRNA binding to constitutive splice sites has lower free energy than that of its binding to alternatively spliced exon splice sites . Free energy was analyzed using the SROOGLE server at http://sroogle.tau.ac.il/ (FE) .
Numbers and densities of ESEs and ESSs
The number of ESEs in each exon was calculated using the prediction algorithm at http://genes.mit.edu/burgelab/rescue-ese/ (RESE). The number of ESSs was calculated using two algorithms: at http://genes.mit.edu/fas-ess/ (FESS) and http://cubweb.biology.columbia.edu/pesx/ (PESS) [17, 36–38]. To calculate the densities of ESSs/ESEs, the RESE, FESS, and PESS numbers were divided by the sequence length (in nucleotides) and this figure was multiplied by 100  to give the RESE-D, FESS-D, and PESS-D scores.
The numbers of binding sites for the four SR proteins (SF2/ASF, SRp40, SC35, and SRp55) were obtained using ESEfinder (v. 3.0) available at http://rulai.cshl.edu/cgi-bin/tools/ESE3/esefinder.cgi?process=home (SF2/ASF-N, SRp40-N, SC35-N, and SRp55-N) . The density of SR protein-binding sites (SF2/ASF-D, SC35-D, SRp40-D, SRp55-D) was obtained by dividing each number by its nucleotide length then multiplying it by 100 .
Minimum free energy value of pre-mRNA secondary structure and GC content around splice sites
Minimum free energy values of the pre-mRNA secondary structure at the 5'ss and 3'ss were obtained using the free energy minimization program RNAfold http://rna.tbi.univie.ac.at/cgi-bin/RNAfold.cgi using the 70 nucleotides both up- and down-stream of each splice site (RSS5'ss and RSS3'ss, respectively) [40, 41].
GC content around splice sites has been reported to affect splicing . Thus, the percentage of GC content for each 70 nucleotides both up- and down-stream of the 5'ss and the 3'ss was calculated (GC5'ss and GC3'ss, respectively) .
The number of nucleotides in each exon was also taken into account (SIZE).
Construction of decision trees
The C4.5 algorithm was used to construct decision trees. The C4.5 decision tree algorithm is an approach for pattern recognition and data mining [42, 43] and was developed by Quinlan [44, 45]. The algorithm uses information gain, which is an entropy-based criterion in information theory for decision tree construction. In this study, the conditions were as follows: (1) at least one object was to be contained in each branch, (2) the pruning confidence level was set to 100%, and (3) the iterative mode was used to avoid a local optimum.
We used 92 out of 94 data points (79 exons and 15 cryptic exons) for decision tree construction. We excluded two exons, exons 1 and 79, because they lacked some feature values.
A 1-year-old Japanese boy (KUCG 952) without any family history of neuromuscular disorders was referred to Kobe University Hospital (Kobe, Japan) because of a high serum creatine kinase level (37,110 IU/l). DMD was tentatively diagnosed, and mutation in the dystrophin gene was analyzed after obtaining informed consent from his parents. Dystrophin mRNA expressed in his peripheral lymphocytes was analyzed as described previously [10, 25]. A fragment spanning exons 10 to 14 was amplified by reverse transcription (RT)-nested PCR using two sets of primers. The outer primer set comprised a forward primer, 1A 5'-TTTTTATCGCTGCCTTGATATACA-3' and a reverse primer, 1B 5'-ACTCTGCAACACAGCTTCTGAG-3'; the inner primer set comprised a forward primer, 1E 5'-TTGCAAGCACAAGGAGAGATT-3' and a reverse primer, c14r 5'-ACGTTGCCATTTGAGAAGGAT-3'. The amplified fragments were resolved by agarose gel electrophoresis. Sequencing of the amplified products was performed by subcloning sequencing as described previously .
The mutation study was approved by our ethical committee and mutation analysis of the dystrophin gene was done after obtaining the informed consent from the parents of the patient.
Duchenne muscular dystrophy
Exonic splicing enhancer
Exonic splicing silencer
Maximum entropy of 3' splice site
SF2/ASF score density
Maximum entropy of 5' splice site
Free energy of U1 SnRNA binding
Fluorescence-activated screen-for exonic splicing silencers density
GC content of 5' splice site
Shapiro score of 3' splice site
Size of exon.
This work was supported by a Grant-in-Aid for Scientific Research (B), and a Grant-in-Aid for Exploratory Research from the Japan Society for the Promotion of Science, a Health and Labour Sciences Research Grant for Research on Psychiatric and Neurological Diseases and Mental Health, and a research grant for Nervous and Mental Disorders from the Ministry of Health, Labour and Welfare, Japan.
- Takeshima Y, Nishio H, Sakamoto H, Nakamura H, Matsuo M: Modulation of in vitro splicing of the upstream intron by modifying an intra-exon sequence which is deleted from the dystrophin gene in dystrophin Kobe. J Clin Invest. 1995, 95: 515-520. 10.1172/JCI117693.PubMed CentralView ArticlePubMed
- Aartsma-Rus A: Antisense-mediated modulation of splicing: Therapeutic implications for duchenne muscular dystrophy. RNA Biol. 2010, 7 (4): 453-461. 10.4161/rna.7.4.12264.View ArticlePubMed
- Lu QL, Yokota T, Takeda S, Garcia L, Muntoni F, Partridge T: The Status of Exon Skipping as a Therapeutic Approach to Duchenne Muscular Dystrophy. Mol Ther. 2011, 19: 9-15. 10.1038/mt.2010.219.PubMed CentralView ArticlePubMed
- Ahn AH, Kunkel LM: The structural and functional diversity of dystrophin. Nat Genet. 1993, 3: 283-291. 10.1038/ng0493-283.View ArticlePubMed
- Nishio H, Takeshima Y, Narita N, Yanagawa H, Suzuki Y, Ishikawa Y, Minami R, Nakamura H, Matsuo M: Identification of a novel first exon in the human dystrophin gene and of a new promoter located more than 500 kb upstream of the nearest known promoter. J Clin Invest. 1994, 94: 1037-1042. 10.1172/JCI117417.PubMed CentralView ArticlePubMed
- Feener CA, Koenig M, Kunkel LM: Alternative splicing of human dystrophin mRNA generates isoforms at the carboxy terminus. Nature. 1989, 338: 509-511. 10.1038/338509a0.View ArticlePubMed
- Sironi M, Cagliani R, Pozzoli U, Bardoni A, Comi GP, Giorda R, Bresolin N: The dystrophin gene is alternatively spliced throughout its coding sequence. FEBS Lett. 2002, 517: 163-166. 10.1016/S0014-5793(02)02613-3.View ArticlePubMed
- Kubokawa I, Takeshima Y, Ota M, Enomoto M, Okizuka Y, Mori T, Nishimura N, Awano H, Yagi M, Matsuo M: Molecular characterization of the 5'-UTR of retinal dystrophin reveals a cryptic intron that regulates translational activity. Mol Vis. 2010, 16: 2590-2597.PubMed CentralPubMed
- Sun H, Chasin LA: Multiple splicing defects in an intronic false exon. Mol Cell Biol. 2000, 20: 6414-6425. 10.1128/MCB.20.17.6414-6425.2000.PubMed CentralView ArticlePubMed
- Zhang Z, Habara Y, Nishiyama A, Oyazato Y, Yagi M, Takeshima Y, Matsuo M: Identification of seven novel cryptic exons embedded in the dystrophin gene and characterization of 14 cryptic dystrophin exons. J Hum Genet. 2007, 52: 607-617. 10.1007/s10038-007-0163-0.View ArticlePubMed
- Valadkhan S: Role of the snRNAs in spliceosomal active site. RNA Biol. 2010, 7: 345-353. 10.4161/rna.7.3.12089.View ArticlePubMed
- Matlin AJ, Clark F, Smith CW: Understanding alternative splicing: towards a cellular code. Nat Rev Mol Cell Biol. 2005, 6: 386-398. 10.1038/nrm1645.View ArticlePubMed
- Nishida A, Kataoka N, Takeshima Y, Yagi M, Awano H, Ota M, Itoh K, Hagiwara M, Matsuo M: Chemical treatment enhances skipping of a mutated exon in the dystrophin gene. Nat Commun. 2011, 2: 308-PubMed CentralView ArticlePubMed
- Kingsford C, Salzberg SL: What are decision trees?. Nat Biotechnol. 2008, 26: 1011-1013. 10.1038/nbt0908-1011.PubMed CentralView ArticlePubMed
- Allen JE, Majoros WH, Pertea M, Salzberg SL: JIGSAW, GeneZilla, and GlimmerHMM: puzzling out the features of human genes in the ENCODE regions. Genome Biol. 2006, 7 (Suppl 1:S9): 1-13.PubMed
- Nishiyama A, Takeshima Y, Zhang Z, Habara Y, Tran TH, Yagi M, Matsuo M: Dystrophin nonsense mutations can generate alternative rescue transcripts in lymphocytes. Ann Hum Genet. 2008, 72: 717-724. 10.1111/j.1469-1809.2008.00468.x.View ArticlePubMed
- Královicová J, Vorechovsky I: Global control of aberrant splice-site activation by auxiliary splicing sequences: evidence for a gradient in exon and intron definition. Nucleic Acids Res. 2007, 35: 6399-6413. 10.1093/nar/gkm680.PubMed CentralView ArticlePubMed
- David MP, Concepcion GP, Padlan EA: Using simple artificial intelligence methods for predicting amyloidogenesis in antibodies. BMC Bioinforma. 2010, 11: 79-10.1186/1471-2105-11-79.View Article
- Shiga N, Takeshima Y, Sakamoto H, Inoue K, Yokota Y, Yokoyama M, Matsuo M: Disruption of the splicing enhancer sequence within exon 27 of the dystrophin gene by a nonsense mutation induces partial skipping of the exon and is responsible for Becker muscular dystrophy. J Clin Invest. 1997, 100: 2204-2210. 10.1172/JCI119757.PubMed CentralView ArticlePubMed
- Buratti E, Chivers M, Královicová J, Romano M, Baralle M, Krainer A, Vorechovsky I: Aberrant 5' splice sites in human disease genes: mutation pattern, nucleotide structure and comparison of computational tools that predict their utilization. Nucleic Acids Res. 2007, 35: 4250-4263. 10.1093/nar/gkm402.PubMed CentralView ArticlePubMed
- Dhir A, Buratti E: Alternative splicing: role of pseudoexons in human disease and potential therapeutic strategies. FEBS J. 2010, 277: 841-855. 10.1111/j.1742-4658.2009.07520.x.View ArticlePubMed
- Vorechovsky I: Transposable elements in disease-associated cryptic exons. Hum Genet. 2010, 127: 135-154. 10.1007/s00439-009-0752-4.View ArticlePubMed
- Dhir A, Buratti E, van Santen MA, Luhrmann R, Baralle FE: The intronic splicing code: multiple factors involved in ATM pseudoexon definition. EMBO J. 2010, 29: 749-760. 10.1038/emboj.2009.397.PubMed CentralView ArticlePubMed
- Wei B, Jin JP: Troponin T isoforms and posttranscriptional modifications: Evolution, regulation and function. Arch Biochem Biophys. 2011, 505: 144-154. 10.1016/j.abb.2010.10.013.PubMed CentralView ArticlePubMed
- Matsuo M, Masumura T, Nishio H, Nakajima T, Kitoh Y, Takumi T, Koga J, Nakamura H: Exon skipping during splicing of dystrophin mRNA precursor due to an intraexon deletion in the dystrophin gene of Duchenne muscular dystrophy Kobe. J Clin Invest. 1991, 87: 2127-2131. 10.1172/JCI115244.PubMed CentralView ArticlePubMed
- Pramono ZA, Takeshima Y, Alimsardjono H, Ishii A, Takeda S, Matsuo M: Induction of exon skipping of the dystrophin transcript in lymphoblastoid cells by transfecting an antisense oligodeoxynucleotide complementary to an exon recognition sequence. Biochem Biophys Res Commun. 1996, 226: 445-449. 10.1006/bbrc.1996.1375.View ArticlePubMed
- Takeshima Y, Yagi M, Wada H, Ishibashi K, Nishiyama A, Kakumoto M, Sakaeda T, Saura R, Okumura K, Matsuo M: Intravenous infusion of an antisense oligonucleotide results in exon skipping in muscle dystrophin mRNA of Duchenne muscular dystrophy. Pediatr Res. 2006, 59: 690-694. 10.1203/01.pdr.0000215047.51278.7c.View ArticlePubMed
- Takeshima Y, Yagi M, Okizuka Y, Awano H, Zhang Z, Yamauchi Y, Nishio H, Matsuo M: Mutation spectrum of the dystrophin gene in 442 Duchenne/Becker muscular dystrophy cases from one Japanese referral center. J Hum Genet. 2010, 55: 379-388. 10.1038/jhg.2010.49.View ArticlePubMed
- Yagi M, Takeshima Y, Wada H, Nakamura H, Matsuo M: Two alternative exons can result from activation of the cryptic splice acceptor site deep within intron 2 of the dystrophin gene in a patient with as yet asymptomatic dystrophinopathy. Hum Genet. 2003, 112: 164-170.PubMed
- Beroud C, Carrie A, Beldjord C, Deburgrave N, Llense S, Carelle N, Peccate C, Cuisset JM, Pandit F, Carre-Pigeon F, et al: Dystrophinopathy caused by mid-intronic substitutions activating cryptic exons in the DMD gene. Neuromuscul Disord. 2004, 14: 10-18. 10.1016/S0960-8966(03)00169-X.View ArticlePubMed
- Shapiro MB, Senapathy P: RNA splice junctions of different classes of eukaryotes: sequence statistics and functional implications in gene expression. Nucleic Acids Res. 1987, 15: 7155-7174. 10.1093/nar/15.17.7155.PubMed CentralView ArticlePubMed
- Yeo G, Burge CB: Maximum entrophy modeling of short sequence motifs with applications to RNA splicing signals. J Comput Biol. 2003, 11: 377-394.View Article
- Nalla VK, Rogan PK: Automated splicing mutation analysis by information theory. Hum Mutat. 2005, 25: 334-342. 10.1002/humu.20151.View ArticlePubMed
- Ast G: How did alternative splicing evolve?. Nat Rev Genet. 2004, 5: 773-782.View ArticlePubMed
- Schwartz S, Hall E, Ast G: SROOGLE: webserver for integrative, user-friendly visualization of splicing signals. Nucleic Acids Res. 2009, 37 (Web Server): W189-192. 10.1093/nar/gkp320.PubMed CentralView ArticlePubMed
- Fairbrother WG, Yeh RF, Sharp PA, Burge CB: Predictive identification of exonic splicing enhancers in human genes. Science. 2002, 297: 1007-1013. 10.1126/science.1073774.View ArticlePubMed
- Wang Z, Rolish ME, Yeo G, Tung V, Mawson M, Burge CB: Systematic identification and analysis of exonic splicing silencers. Cell. 2004, 119: 831-845. 10.1016/j.cell.2004.11.010.View ArticlePubMed
- Zhang XH, Chasin LA: Computational definition of sequence motifs governing constitutive exon splicing. Genes Dev. 2004, 18: 1241-1250. 10.1101/gad.1195304.PubMed CentralView ArticlePubMed
- Cartegni L, Wang J, Zhu Z, Zhang MQ, Krainer AR: ESEfinder: A web resource to identify exonic splicing enhancers. Nucleic Acids Res. 2003, 31: 3568-3571. 10.1093/nar/gkg616.PubMed CentralView ArticlePubMed
- Zhang J, Kuo CC, Chen L: GC content around splice sites affects splicing through pre-mRNA secondary structures. BMC Genomics. 2011, 12: 90-10.1186/1471-2164-12-90.PubMed CentralView ArticlePubMed
- Gruber AR, Lorenz R, Bernhart SH, Neubock R, Hofacker IL: The Vienna RNA websuite. Nucleic Acids Res. 2008, 36 (Web Server): W70-74. 10.1093/nar/gkn188.PubMed CentralView ArticlePubMed
- Han J, Kamber M, Pei J: Data Mining: Concepts and Techniques. 2011, Waltham, Massachusetts: Morgan Kaufmann Publishers
- Witten IH, Frank E: Practical Machine Learning Tools and Techniques. 2005, Waltham, Massachusetts: Morgan Kaufmann Publishers
- Quinlan JR: C4.5: Programs for Machine Learning. 1993, Waltham, Massachusetts: Morgan Kaufmann Publishers
- Quinlan JR: Improved Use of Continuous Attributes in C. J Artif Intell Res. 1996, 4: 77-90.
- Surono A, Takeshima Y, Wibawa T, Ikezawa M, Nonaka I, Matsuo M: Circular dystrophin RNAs consisting of exons that were skipped by alternative splicing. Hum Mol Genet. 1999, 8: 493-500. 10.1093/hmg/8.3.493.View ArticlePubMed
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.