High resolution analysis of the human transcriptome: detection of extensive alternative splicing independent of transcriptional activity
© Zhou et al; licensee BioMed Central Ltd. 2009
Received: 22 January 2009
Accepted: 5 October 2009
Published: 5 October 2009
Commercially available microarrays have been used in many settings to generate expression profiles for a variety of applications, including target selection for disease detection, classification, profiling for pharmacogenomic response to therapeutics, and potential disease staging. However, many commercially available microarray platforms fail to capture transcript diversity produced by alternative splicing, a major mechanism for driving proteomic diversity through transcript heterogeneity.
The human Genome-Wide SpliceArray™ (GWSA), a novel microarray platform, utilizes an existing probe design concept to monitor such transcript diversity on a genome scale. The human GWSA allows the detection of alternatively spliced events within the human genome through the use of exon body and exon junction probes to provide a direct measure of each transcript, through simple calculations derived from expression data. This report focuses on the performance and validation of the array when measured against standards recently published by the Microarray Quality Control (MAQC) Project. The array was shown to be highly quantitative, and displayed greater than 85% correlation with the HG-U133 Plus 2.0 array at the gene level while providing more extensive coverage of each gene. Almost 60% of splice events among genes demonstrating differential expression of greater than 3 fold also contained extensive splicing alterations. Importantly, almost 10% of splice events within the gene set displaying constant overall expression values had evidence of transcript diversity. Two examples illustrate the types of events identified: LIM domain 7 showed no differential expression at the gene level, but demonstrated deregulation of an exon skip event, while erythrocyte membrane protein band 4.1 -like 3 was differentially expressed and also displayed deregulation of a skipped exon isoform.
Significant changes were detected independent of transcriptional activity, indicating that the controls for transcript generation and transcription are distinct, and require novel tools in order to detect changes in specific transcript quantity. Our results demonstrate that the SpliceArray™ design will provide researchers with a robust platform to detect and quantify specific changes not only in overall gene expression, but also at the individual transcript level.
A large portion of the diversity within the transcriptome is generated by alternative splicing, which in some cases, can produce thousands of transcripts from a single gene or locus [1, 2]. This has important implications in biology and pathophysiology where extensive alterations in transcripts resulting from alternative splicing produce structurally different products and impacts the function of genes in biology, disease [3–6], as well as processes such as evolution [7, 8]. The fine granularity of the transcriptome has not been determined with clarity, and new commercial tools are required in order to begin to identify with certainty the diverse content of the transcriptome. Traditional microarray designs and analytical methods are not robust enough to detect this transcript diversity. SpliceArray™ microarrays were developed to experimentally define the composite of transcripts that are present within biological samples and have the ability to detect subtle differential changes in gene expression for different alternatively spliced isoforms. The performance parameters of the human Genome-Wide SpliceArray™ (GWSA) which monitors over 280,000 potential splicing events (known and predicted) were assessed here, guided by the recent publications from the MicroArray Quality Control (MAQC) consortium.
The MAQC project  is a FDA sponsored consortium founded to address concerns of microarray reproducibility of expression profiling experiments. The purpose was to generate quality control tools for the microarray community in order to avoid procedural failures and to develop guidelines for microarray data analysis by providing the public with large reference datasets along with readily accessible reference RNA samples. The study found, that overall, the platforms perform similarly  and were validated with alternative quantitative gene expression platforms . However, all the platforms tested contain a similar bias where probes were designed to monitor the overall level of the gene, and do not give any expression information toward the isoform diversity produced from each gene through alternative splicing. Here we present a novel microarray platform and analysis for the efficient detection of isoform diversity on a genome wide scale in human. Our analysis of this new microarray design is in accordance with the approach outlined by the MAQC consortium and demonstrates that the SpliceArray™ products are highly reproducible, quantify transcripts, and are sensitive in detecting subtle changes in transcript ratios.
Distribution of splice events from an analysis of the human genome.
Number of events
Total putative alternative splicing events with cDNA supporting evidence:
▪ novel exon
▪ novel exons
▪ exon skipped
▪ exons skipped
▪ alternative splice donor (ASD)
▪ alternative splice acceptor (ASA)
▪ intron retention
▪ novel intron
Predictions of single exon skips with no supporting cDNA evidence
Intron/exon boundary regions with structural probes for prediction of ASA, ASD and intron retentions
To assess performance of the human GWSA, samples were selected in accordance with the MAQC project, which included human universal reference RNA (Sample A), human brain RNA (Sample B) and two titration samples (C and D; see methods; refer to fig. 1 in ). As with the MAQC project , we assumed a linear response for each probe across the four titration samples (A-D). The frequency distribution of the expression values for all 12 arrays (samples A-D run in triplicate) showed a normal distribution of signal as expected for the platform (Additional file 2-A). The reproducibility of the arrays was found to be highly concordant and produced low coefficients of variation (CV%) for each sample analyzed. The mean CV% were found to be slightly higher than the median values (5.5 versus 4.23, respectively), indicating a slight bias towards higher values. On a gross level, principal component analysis (Additional file 2-B) showed close grouping of the replicates.
Correlation of gene level analysis between the human GWSA and the Affymetrix HG-U133 Plus2.0 array.
# of Genes represented
1.5 Fold Change (FC)
Event analysis of genes grouped by overall gene expression.
Gene Expression Level
(median of F and T)
Fold Change Set
Subset of events with ≥3 Fold Change
x ≥ 3 or ≤ -3
-1.2 ≤ x ≤ 1.2
Analysis of gene expression has provided researchers with an important resource to identify genes involved in different biological processes, and has been used to generate profiles where the expression level of a predefined set of genes can help to identify and predict a variety of pathological states and prognoses [14, 15]. However, many of these studies have ignored the large diversity of transcripts that are generated from each locus by alternative splicing and miss the rich source of diverse transcripts. Researchers have treated each gene as a single entity which is inaccurate considering that potentially over 90% of genes undergo alternative splicing as evidenced by our analysis and recent reports [16, 17]. In an attempt to identify different transcripts, studies have been performed where probes have been reordered into new probe sets to define transcripts more accurately [18–20]. However these studies suffer from the same flaw, that the original design of the microarray was not focused on detecting alternative transcripts per se, but to provide an accurate measurement of the overall expression of each gene, not transcript. In contrast, the human Genome-Wide SpliceArray™ (GWSA) is designed to detect alternatively spliced events through the use of exon body and junction probes.
Since Johnson and co-workers  published the first human genome-wide microarray that utilized exon-exon junction probes, several groups have simultaneously produced microarrays to monitor splice events using similar probe designs [11, 22–24]. The GWSA microarray is the most comprehensive design commercially available with over 6 million probes. The array design consists of a maximum of 6 probe types targeted to the exon body, exon-exon junctions, and exon-intron/intron-exon junctions allowing more complete detection of transcripts, in contrast to those commercially available arrays that incorporate only exon body probes  or only junction probes . The SpliceArray™ probe design simultaneously monitors both isoforms for each splice event and is able to detect alternative exon, intron retention, alternative splice acceptor (ASA), and alternative splice donor (ASD) events. Additionally, the design has the ability to monitor putative novel skipped exons and unidentified ASA and ASD events. Although the design is theoretically capable of detecting all types of events, it does have the limitation where very short sequence differences (less than 8 bases) are not captured and these events have been filtered out of the collection. In addition, the microarray platform itself has limited space and even at the maximum number of features, only 6.54 million probes can be included which limits the number of probes per probe set that can be designed for any event whether evidenced or predicted. Furthermore, microarray platforms possess limitations on the sensitivity to detect very small changes in transcript expression. A consequence of manufacturing the GWSA content on the Affymetrix platform is that the cost for updating the array probe content is significant as it would require re-manufacturing a custom array and therefore recent discoveries of novel splice events cannot feasibly be added to the design. However, it should be pointed out that the GWSA monitors ~140,000 predicted exon skip events so although there was no evidence of these exon skip events at the time the GWSA content was generated, it indicates that the array has the potential to monitor newly discovered exon skip events.
Concurrent with the generation of microarray designs able to detect alternatively spliced transcripts, many analytical methods have been developed to determine the extent of alternative splicing and identify the expression level for each transcript within a gene. We used a simple approach to identify cases where the ratio of the inclusive and exclusive isoforms changed dramatically, by calculating the ratio of probes that were specific for the inclusive or long form (probe B) versus probes specific for the exclusive or short form (probe E). This approach was shown to provide very powerful results when assessed against PCR validation of the events. It is a simple and applicable approach that provides strong filtering methods for real positives, unlike other methodologies which require an extensive programming and array fitting .
Performance of the human GWSA product was evaluated in this study and the arrays were shown to be highly reproducible, suggesting that the reproducibility is more a function of the platform and labeling procedures than of the probe design. Analysis of the array data at the gene level proved highly concordant with data produced in the MAQC project on the HG-U133 Plus 2.0 array. Even though the platforms were slightly different (the HG-U133 Plus 2.0 array contains 11 micron features while the GWSA contains 5 micron features), the genes showed similar levels of expression, and importantly, comparable quantified changes between the universal and brain RNA samples. High concordance was observed between these similar platforms even though the probe sets were designed for different applications. Additionally, the GWSA and other SpliceArray™ products also provide splice event information available from the same experiment, exemplified by several recent reports [3, 4, 26–29].
Validation of specific splice events from the GWSA hybridizations was performed based on the analysis of the ratio of the long isoform to the short isoform. This type of assessment is an important aspect to determining the extent of alternative splicing as in many cases the ratio of isoforms will determine the biological response [30–33]. By assessing the statistical power of the ratio, we were able to validate a high rate of events; overall, 81% of statistically selected events were validated by RT-PCR. The high rate of validation was found regardless of the overall transcriptional changes identified. Interestingly, almost 60% of the events identified to have gene level, transcriptional alterations of 3 fold or more were found to have an event fold change of 3 fold or more. This indicates that not only is there a change in overall gene expression, but the transcripts produced are different as generated by alternative splicing. Surprisingly, genes that showed no change in their expression levels represented a rich source of alternatively spliced transcripts. Almost 10% of the events among these genes contained evidence of splicing alterations by changes in the ratio of the different isoforms. This is an important finding as it brings to point that genes which have been ignored because of equivocal expression between samples actually have important changes that occur in producing different isoforms. This confirms earlier evidence  that illustrates transcriptional activity is independent of splicing, even though the two processes function simultaneously and emphasizes the need to measure both overall gene expression and alternatively spliced transcripts for greater understanding of biological processes. The above findings should encourage researchers to look more closely at the genes which show no variation in overall gene expression for important clues into the mechanisms in pathophysiology and biology.
Alternative splicing affects not only the structure of the mRNA but ultimately the structure of the protein produced. Many exons encode protein domains that can be removed by an exon skip event . Such an event can produce proteins that lack a functional domain, dominant negative proteins, constitutively active isoforms, soluble homologues, or can lead to the regulation of the protein's overall activity by altering its global expression. The two examples, erythrocyte membrane protein band 4.1-like 3 (EPB41L3), and Lim domain 7 (LMO7) depicted here show how changes at the transcript level can be observed with or without gene expression changes, respectively. Multiple alternative splice variants have been previously described for LMO7  and the variant detected here results in deletion of the Lim domain, a protein-protein interaction domain found in many key regulators of development. EPB41L3 is potentially a critical growth regulator in meningioma pathogenesis  and is normally expressed at high levels in brain, with lower levels in kidney, intestine, and testis. The function of the EPB41L3 variant is unknown, yet contains some well described protein domains, including Ferm domains found in many cytoskeletal proteins, the N domain found in ubiquitin-like structural domain, and the C domain found in tyrosine phosphatases . Different isoforms certainly will affect the function of these different domains and potentially the protein, and may play a role in pathological states.
The identification of expressed transcripts is at the heart of expression analysis and these examples, as well as recently identified novel spliced isoforms demonstrating diverse activity [4, 6] illustrate the importance of correct determination of the expression of each transcript. Array platforms specifically designed to detect the differences in transcripts along with other novel technologies  will allow researchers to more fully explore the transcriptome under different physiological and pathological conditions. In short, we have provided a unique platform to detect isoform diversity in the human transcriptome by utilizing algorithmically designed novel exon body and splice junction probes to detect every possible isoform event. This approach on a genome-wide scale has been demonstrated to be highly reproducible and quantitative.
Total RNA was purchased from Stratagene (Universal Human Reference RNA, # 740000; sample A) (LaJolla, CA, USA) and Ambion (Human Brain Total RNA, # AM7962; sample B) (Austin, TX, USA). Total RNA quality and concentration were verified using the Agilent 2100 bioanalyzer (Agilent, Santa Clara, CA, USA). Four titration samples (A, B, C = 75%A+25%B, and D = 25%A+75%B) were generated as previously described  and each sample was analyzed in triplicate.
The content for the human Genome-Wide SpliceArray™ (GWSA) was designed using Build 35 of the NCBI Human (genome) reference sequence produced by the International Human Genome Sequencing Consortium and sequence data from the NCBI GenBank sequence database. Additional information was used from the NCBI Entrez Gene database in the representation of the genes. For each gene, a RefSeq was selected as the reference form, the selection considered which RefSeq (when more than one was available) covered the largest genomic area to maximize sequence inclusion (based on mapping against a genomic range overlapping the selected representative). Sequences used in the analysis were aligned to the human genome and data from this alignment was used to create distinct exon structures. Significant differences from the reference sequence were detected and probes were designed to monitor each difference or splice event as previously described . All splice events included on the GWSA were subject to manual review to remove those events that appeared to be indicated by sequence or alignment artifact. Additionally, splice events were removed with the following characteristics: those that resulted in very small changes (of a few nucleotides) to the transcript; where the required probes would not conform to Affymetrix manufacturing standards; and that would not be detected by the SpliceArray™ probe configuration. The human GWSA custom microarrays were manufactured by Affymetrix (Santa Clara, CA, USA) as a 49 format, 5 micron array containing over 6 million probes which monitor approximately 229,000 exons and 181,000 junctions. GC probe sets for background correction and positive and negative control probes were included on the array. The probes on the array are designed in the sense orientation and for use with the NuGEN WT-Ovation™ RNA Amplification Systems and FL-Ovation™ Biotin Module Products. For additional information on SpliceArray products and probe design, refer to http://www.splicearray.com.
MicroArray processing: Transcript amplification and labeling
Amplified cDNA was prepared using the WT-Ovation™ Pico RNA Amplification System (NuGen product #3300; Santa Clara, CA, USA) starting with 50 ng of each total RNA sample, as described by the manufacturer. Briefly, the RNA is synthesized into double-stranded cDNA using a SPIA™ DNA/RNA primer. The DNA portion of the primer binds poly (A) sequence or randomly across the transcript, and the RNA portion of the primer is incorporated into an unique DNA/RNA heteroduplex at one end of the cDNA. RNase H degrades the RNA portion of the DNA/RNA heteroduplex allowing for additional SPIA™ primer to bind and initiate replication by DNA polymerase. The process of SPIA™ DNA/RNA primer binding, DNA replication, strand displacement and RNA cleavage is repeated resulting in isothermal linear amplification of the cDNA. On average, the cDNA yield from 50 ng of high quality total RNA ranges from 6 to 10 micrograms. Fragmentation and biotin-labeling of the amplified cDNA was performed using the FL-Ovation™ cDNA Biotin Module V2 (NuGen product #4200) as per manufacturer's instructions.
Array hybridization, scanning, and data extraction
Prepared target for each sample was hybridized to the Affymetrix-formatted SpliceArray™ product using standard hybridization methods recommended by the manufacturer for the 49-format (Affymetrix - Expression analysis technical manual). The arrays were stained and washed using the Affymetrix FS450-0001 fluidics protocol prior to scanning with the Affymetrix GeneChip® Scanner 3000 7G. DAT and CEL images were visually inspected for anomalies and accurate grid placement. All data are available through the Gene Expression Omnibus (GEO) database as series accession number GSE17258.
The analysis described here was generated using Partek® software (http://www.partek.com/). For the comparative analysis, data for the HG-U133 Plus 2.0 GeneChip was taken from the MAQC data set publicly available at http://edkb.fda.gov/MAQC/MainStudy/upload/. All array data was pre-processed which included probe level RMA background correction, quantile normalization across all arrays, and Log2 transformation followed by median polish to summarize probes to obtain the overall score for each probe set. The data were filtered based on the expression values of each probe set within the triplicate set for each sample; if the expression value of a probe set was below 3.5 (log 2 value) the probe set was removed from the analysis. In order to identify probe sets that were significantly different between samples A and B, a one-way ANOVA statistical test was performed on normalized and filtered probe set level intensity values between each group to generate p-value and fold change data.
To identify genes that were significantly different between samples A and B, a list of non-redundant evidenced probe sets were imported into Partek® software. Non-logged median values for F and T evidenced probe sets with the same Entrez Gene ID were calculated to generate an overall gene score for each gene. After removing genes which were expressed at a low level, a one-way ANOVA statistical test was then performed on the gene level intensity values between sample groups A and B to generate p-values and fold changes for each gene.
In order to estimate gene level correlation between the human GWSA and the HG-U133 Plus 2.0 array data, both data sets were imported separately and subjected to the same pre-processing method. For the GWSA data, a gene score for each sample was generated as described above. After separately filtering the low-expressing genes in both data sets, a total of 7,147 common genes were identified. The log value of the ratio of sample A/sample B of gene values was calculated for each gene and platform, and a linear correlation coefficient was calculated between the GWSA and HG-U133 Plus 2.0.
A one-way ANOVA statistical test was subsequently performed on the log 2 based ratios between each of the sample groups to generate p-values and fold change of the B/E ratio between samples A and B. Similar methods were used to calculate C/E and D/E ratio, which provided similar data.
Using the human GWSA gene level intensity data, two lists were generated, those genes that displayed no differential expression change (-1.2 ≤ X ≤ 1.2) between samples A and B and a second group showing significant gene expression change, greater than 3 fold change. ANOVA analysis identified events within these two separate groups having significant B/E ratios between samples A and B. Top events for RT-PCR validation were selected by choosing events with the highest fold changes, p-values < 0.001, and that titrated all 4 samples correctly.
First strand cDNAs were prepared using a random priming protocol starting with 5 μg of the individual RNA samples using the High Capacity cDNA Archive Kit (Applied Biosystems; Foster City, CA, USA). cDNA was prepared from both the human normal brain and universal human reference RNA samples and the quality and consistency were assessed by determining the levels of ribosomal protein P0 by RT-PCR. Additionally, samples were normalized by the level of ribosomal P0 for validation of each splice event.
Lasergene/Primer Select (DNAstar) software was used for primer design. Primer sequences will be provided upon request. RT-PCR amplification was performed using a cDNA dilution with the equivalent of 17 ng of total RNA for each reaction. Promega 2× PCR Master Mix (Madison, WI, USA) was used for each 50 ul reaction with forward and reverse primer at a final concentration of 0.3 uM. A "touchdown" cycling program was used to amplify products (annealing temp 65°C-60°C, 5 cycles, and 60°C, 35 cycles). 12 μl of PCR products were analyzed using either 2% Agarose or 3% Metaphor gels.
Partek® Genomic Suites was used for all analysis. Copyright Partek and all other Partek Inc. product or service names are registered trademarks or trademarks of Partek Inc., St. Louis, MO, USA.
- Lander ES, Linton LM, Birren B, Nusbaum C, Zody MC, Baldwin J, Devon K, Dewar K, Doyle M, FitzHugh W, Funke R, Gage D, Harris K, Heaford A, Howland J, Kann L, Lehoczky J, LeVine R, McEwan P, McKernan K, Meldrim J, Mesirov JP, Miranda C, Morris W, Naylor J, Raymond C, Rosetti M, Santos R, Sheridan A, Sougnez C, Stange-Thomann N, Stojanovic N, Subramanian A, Wyman D, Rogers J, Sulston J, Ainscough R, Beck S, Bentley D, Burton J, Clee C, Carter N, Coulson A, Deadman R, Deloukas P, Dunham A, Dunham I, Durbin R, French L, Grafham D, Gregory S, Hubbard T, Humphray S, Hunt A, Jones M, Lloyd C, McMurray A, Matthews L, Mercer S, Milne S, Mullikin JC, Mungall A, Plumb R, Ross M, Shownkeen R, Sims S, Waterston RH, Wilson RK, Hillier LW, McPherson JD, Marra MA, Mardis ER, Fulton LA, Chinwalla AT, Pepin KH, Gish WR, Chissoe SL, Wendl MC, Delehaunty KD, Miner TL, Delehaunty A, Kramer JB, Cook LL, Fulton RS, Johnson DL, Minx PJ, Clifton SW, Hawkins T, Branscomb E, Predki P, Richardson P, Wenning S, Slezak T, Doggett N, Cheng JF, Olsen A, Lucas S, Elkin C, Uberbacher E, Frazier M, Gibbs RA, Muzny DM, Scherer SE, Bouck JB, Sodergren EJ, Worley KC, Rives CM, Gorrell JH, Metzker ML, Naylor SL, Kucherlapati RS, Nelson DL, Weinstock GM, Sakaki Y, Fujiyama A, Hattori M, Yada T, Toyoda A, Itoh T, Kawagoe C, Watanabe H, Totoki Y, Taylor T, Weissenbach J, Heilig R, Saurin W, Artiguenave F, Brottier P, Bruls T, Pelletier E, Robert C, Wincker P, Smith DR, Doucette-Stamm L, Rubenfield M, Weinstock K, Lee HM, Dubois J, Rosenthal A, Platzer M, Nyakatura G, Taudien S, Rump A, Yang H, Yu J, Wang J, Huang G, Gu J, Hood L, Rowen L, Madan A, Qin S, Davis RW, Federspiel NA, Abola AP, Proctor MJ, Myers RM, Schmutz J, Dickson M, Grimwood J, Cox DR, Olson MV, Kaul R, Raymond C, Shimizu N, Kawasaki K, Minoshima S, Evans GA, Athanasiou M, Schultz R, Roe BA, Chen F, Pan H, Ramser J, Lehrach H, Reinhardt R, McCombie WR, de la Bastide M, Dedhia N, Blöcker H, Hornischer K, Nordsiek G, Agarwala R, Aravind L, Bailey JA, Bateman A, Batzoglou S, Birney E, Bork P, Brown DG, Burge CB, Cerutti L, Chen HC, Church D, Clamp M, Copley RR, Doerks T, Eddy SR, Eichler EE, Furey TS, Galagan J, Gilbert JG, Harmon C, Hayashizaki Y, Haussler D, Hermjakob H, Hokamp K, Jang W, Johnson LS, Jones TA, Kasif S, Kaspryzk A, Kennedy S, Kent WJ, Kitts P, Koonin EV, Korf I, Kulp D, Lancet D, Lowe TM, McLysaght A, Mikkelsen T, Moran JV, Mulder N, Pollara VJ, Ponting CP, Schuler G, Schultz J, Slater G, Smit AF, Stupka E, Szustakowski J, Thierry-Mieg D, Thierry-Mieg J, Wagner L, Wallis J, Wheeler R, Williams A, Wolf YI, Wolfe KH, Yang SP, Yeh RF, Collins F, Guyer MS, Peterson J, Felsenfeld A, Wetterstrand KA, Patrinos A, Morgan MJ, de Jong P, Catanese JJ, Osoegawa K, Shizuya H, Choi S, Chen YJ, Szustakowki J: Initial sequencing and analysis of the human genome. Nature. 2001, 409: 860-921. 10.1038/35057062.View ArticlePubMedGoogle Scholar
- Black DL: Mechanisms of alternative pre-messenger RNA splicing. Annu Rev Biochem. 2003, 72: 291-336. 10.1146/annurev.biochem.72.121801.161720.View ArticlePubMedGoogle Scholar
- Heinzen EL, Yoon W, Weale ME, Sen A, Wood NW, Burke JR, Welsh-Bohmer KA, Hulette CM, Sisodiya SM, Goldstein DB: Alternative ion channel splicing in mesial temporal lobe epilepsy and Alzheimer's disease. Genome Biol. 2007, 8: R32-10.1186/gb-2007-8-3-r32.PubMed CentralView ArticlePubMedGoogle Scholar
- Einstein R, Jordan H, Zhou W, Brenner M, Moses EG, Liggett SB: Alternative splicing of the G protein-coupled receptor superfamily in human airway smooth muscle diversifies the complement of receptors. Proc Natl Acad Sci USA. 2008, 105: 5230-5. 10.1073/pnas.0801319105.PubMed CentralView ArticlePubMedGoogle Scholar
- Kozyrev SV, Abelson A, Wojcik J, Zaghlool A, Linga Reddy MVP, Sanchez E, Gunnarsson I, Svenungsson E, Sturfelt G, Jönsen A, Truedsson L, Pons-Estel BA, Witte T, D'Alfonso S, Barizzone N, Barrizzone N, Danieli MG, Gutierrez C, Suarez A, Junker P, Laustrup H, González-Escribano MF, Martin J, Abderrahim H, Alarcón-Riquelme ME: Functional variants in the B-cell gene BANK1 are associated with systemic lupus erythematosus. Nat Genet. 2008, 40: 211-6. 10.1038/ng.79.View ArticlePubMedGoogle Scholar
- Christofk HR, Heiden Vander MG, Harris MH, Ramanathan A, Gerszten RE, Wei R, Fleming MD, Schreiber SL, Cantley LC: The M2 splice isoform of pyruvate kinase is important for cancer metabolism and tumour growth. Nature. 2008, 452: 230-3. 10.1038/nature06734.View ArticlePubMedGoogle Scholar
- Boue S, Letunic I, Bork P: Alternative splicing and evolution. Bioessays. 2003, 25: 1031-4. 10.1002/bies.10371.View ArticlePubMedGoogle Scholar
- Calarco JA, Xing Y, Cáceres M, Calarco JP, Xiao X, Pan Q, Lee C, Preuss TM, Blencowe BJ: Global analysis of alternative splicing differences between humans and chimpanzees. Genes Dev. 2007, 21: 2963-75. 10.1101/gad.1606907.PubMed CentralView ArticlePubMedGoogle Scholar
- Shi L, Reid LH, Jones WD, Shippy R, Warrington JA, Baker SC, Collins PJ, de Longueville F, Kawasaki ES, Lee KY, Luo Y, Sun YA, Willey JC, Setterquist RA, Fischer GM, Tong W, Dragan YP, Dix DJ, Frueh FW, Goodsaid FM, Herman D, Jensen RV, Johnson CD, Lobenhofer EK, Puri RK, Schrf U, Thierry-Mieg J, Wang C, Wilson M, Wolber PK, Zhang L, Amur S, Bao W, Barbacioru CC, Lucas AB, Bertholet V, Boysen C, Bromley B, Brown D, Brunner A, Canales R, Cao XM, Cebula TA, Chen JJ, Cheng J, Chu T, Chudin E, Corson J, Corton JC, Croner LJ, Davies C, Davison TS, Delenstarr G, Deng X, Dorris D, Eklund AC, Fan X, Fang H, Fulmer-Smentek S, Fuscoe JC, Gallagher K, Ge W, Guo L, Guo X, Hager J, Haje PK, Han J, Han T, Harbottle HC, Harris SC, Hatchwell E, Hauser CA, Hester S, Hong H, Hurban P, Jackson SA, Ji H, Knight CR, Kuo WP, LeClerc JE, Levy S, Li Q, Liu C, Liu Y, Lombardi MJ, Ma Y, Magnuson SR, Maqsodi B, McDaniel T, Mei N, Myklebost O, Ning B, Novoradovskaya N, Orr MS, Osborn TW, Papallo A, Patterson TA, Perkins RG, Peters EH, Peterson R, Philips KL, Pine PS, Pusztai L, Qian F, Ren H, Rosen M, Rosenzweig BA, Samaha RR, Schena M, Schroth GP, Shchegrova S, Smith DD, Staedtler F, Su Z, Sun H, Szallasi Z, Tezak Z, Thierry-Mieg D, Thompson KL, Tikhonova I, Turpaz Y, Vallanat B, Van C, Walker SJ, Wang SJ, Wang Y, Wolfinger R, Wong A, Wu J, Xiao C, Xie Q, Xu J, Yang W, Zhang L, Zhong S, Zong Y, Slikker W: The MicroArray Quality Control (MAQC) project shows inter- and intraplatform reproducibility of gene expression measurements. Nat Biotechnol. 2006, 24: 1151-61. 10.1038/nbt1239.View ArticlePubMedGoogle Scholar
- Canales RD, Luo Y, Willey JC, Austermiller B, Barbacioru CC, Boysen C, Hunkapiller K, Jensen RV, Knight CR, Lee KY, Ma Y, Maqsodi B, Papallo A, Peters EH, Poulter K, Ruppel PL, Samaha RR, Shi L, Yang W, Zhang L, Goodsaid FM: Evaluation of DNA microarray results with quantitative gene expression platforms. Nat Biotechnol. 2006, 24: 1115-22. 10.1038/nbt1236.View ArticlePubMedGoogle Scholar
- Fehlbaum P, Guihal C, Bracco L, Cochet O: A microarray configuration to quantify expression levels and relative abundance of splice variants. Nucleic Acids Res. 2005, 33: e47-10.1093/nar/gni047.PubMed CentralView ArticlePubMedGoogle Scholar
- Pando MP, Kotraiah V, McGowan K, Bracco L, Einstein R: Alternative isoform discrimination by the next generation of expression profiling microarrays. Expert Opin Ther Targets. 2006, 10: 613-25. 10.1517/14728188.8.131.523.View ArticlePubMedGoogle Scholar
- Shippy R, Fulmer-Smentek S, Jensen RV, Jones WD, Wolber PK, Johnson CD, Pine PS, Boysen C, Guo X, Chudin E, Sun YA, Willey JC, Thierry-Mieg J, Thierry-Mieg D, Setterquist RA, Wilson M, Lucas AB, Novoradovskaya N, Papallo A, Turpaz Y, Baker SC, Warrington JA, Shi L, Herman D: Using RNA sample titrations to assess microarray platform performance and normalization techniques. Nat Biotechnol. 2006, 24: 1123-31. 10.1038/nbt1241.PubMed CentralView ArticlePubMedGoogle Scholar
- Vijver van de MJ, He YD, van't Veer LJ, Dai H, Hart AAM, Voskuil DW, Schreiber GJ, Peterse JL, Roberts C, Marton MJ, Parrish M, Atsma D, Witteveen A, Glas A, Delahaye L, Velde van der T, Bartelink H, Rodenhuis S, Rutgers ET, Friend SH, Bernards R: A gene-expression signature as a predictor of survival in breast cancer. N Engl J Med. 2002, 347: 1999-2009. 10.1056/NEJMoa021967.View ArticlePubMedGoogle Scholar
- Naderi A, Teschendorff AE, Barbosa-Morais NL, Pinder SE, Green AR, Powe DG, Robertson JFR, Aparicio S, Ellis IO, Brenton JD, Caldas C: A gene-expression signature to predict survival in breast cancer across independent data sets. Oncogene. 2007, 26: 1507-16. 10.1038/sj.onc.1209920.View ArticlePubMedGoogle Scholar
- Pan Q, Shai O, Lee LJ, Frey BJ, Blencowe BJ: Deep surveying of alternative splicing complexity in the human transcriptome by high-throughput sequencing. Nat Genet. 2008, 40: 1413-5. 10.1038/ng.259.View ArticlePubMedGoogle Scholar
- Wang ET, Sandberg R, Luo S, Khrebtukova I, Zhang L, Mayr C, Kingsmore SF, Schroth GP, Burge CB: Alternative isoform regulation in human tissue transcriptomes. Nature. 2008, 456: 470-6. 10.1038/nature07509.PubMed CentralView ArticlePubMedGoogle Scholar
- Hu GK, Madore SJ, Moldover B, Jatkoe T, Balaban D, Thomas J, Wang Y: Predicting splice variant from DNA chip expression data. Genome Res. 2001, 11: 1237-45. 10.1101/gr.165501.PubMed CentralView ArticlePubMedGoogle Scholar
- Fan W, Khalid N, Hallahan AR, Olson JM, Zhao LP: A statistical method for predicting splice variants between two groups of samples using GeneChip expression array data. Theor Biol Med Model. 2006, 3: 19-10.1186/1742-4682-3-19.PubMed CentralView ArticlePubMedGoogle Scholar
- Lu J, Lee JC, Salit ML, Cam MC: Transcript-based redefinition of grouped oligonucleotide probe sets using AceView: high-resolution annotation for microarrays. BMC Bioinformatics. 2007, 8: 108-10.1186/1471-2105-8-108.PubMed CentralView ArticlePubMedGoogle Scholar
- Johnson JM, Castle J, Garrett-Engele P, Kan Z, Loerch PM, Armour CD, Santos R, Schadt EE, Stoughton R, Shoemaker DD: Genome-wide survey of human alternative pre-mRNA splicing with exon junction microarrays. Science. 2003, 302: 2141-4. 10.1126/science.1090100.View ArticlePubMedGoogle Scholar
- Pan Q, Shai O, Misquitta C, Zhang W, Saltzman AL, Mohammad N, Babak T, Siu H, Hughes TR, Morris QD, Frey BJ, Blencowe BJ: Revealing global regulatory features of mammalian alternative splicing using a quantitative microarray platform. Mol Cell. 2004, 16: 929-41. 10.1016/j.molcel.2004.12.004.View ArticlePubMedGoogle Scholar
- Le K, Mitsouras K, Roy M, Wang Q, Xu Q, Nelson SF, Lee C: Detecting tissue-specific regulation of alternative splicing as a qualitative change in microarray data. Nucleic Acids Res. 2004, 32: e18-10.1093/nar/gnh173.View ArticleGoogle Scholar
- Bingham JL, Carrigan PE, Miller LJ, Srinivasan S: Extent and diversity of human alternative splicing established by complementary database annotation and microarray analysis. OMICS. 2008, 12: 83-92. 10.1089/omi.2007.0041.View ArticlePubMedGoogle Scholar
- Bemmo A, Benovoy D, Kwan T, Gaffney D, Jensen R, Majewski J: Gene Expression and Isoform Variation Analysis using Affymetrix Exon Arrays. BMC Genomics. 2008, 9: 529-10.1186/1471-2164-9-529.PubMed CentralView ArticlePubMedGoogle Scholar
- Lee JH, Horak CE, Khanna C, Meng Z, Yu LR, Veenstra TD, Steeg PS: Alterations in Gemin5 expression contribute to alternative mRNA splicing patterns and tumor cell motility. Cancer Res. 2008, 68: 639-44. 10.1158/0008-5472.CAN-07-2632.PubMed CentralView ArticlePubMedGoogle Scholar
- Filippov V, Schmidt EL, Filippova M, Duerksen-Hughes PJ: Splicing and splice factor SRp55 participate in the response to DNA damage by changing isoform ratios of target genes. Gene. 2008, 420: 34-41. 10.1016/j.gene.2008.05.008.PubMed CentralView ArticlePubMedGoogle Scholar
- Hewetson A, Wright-Pastusek AE, Helmer RA, Wesley KA, Chilton BS: Conservation of inter-protein binding sites in RUSH and RFBP, an ATP11B isoform. Mol Cell Endocrinol. 2008, 292: 79-86. 10.1016/j.mce.2008.05.007.PubMed CentralView ArticlePubMedGoogle Scholar
- Novoyatleva T, Heinrich B, Tang Y, Benderska N, Butchbach MER, Lorson CL, Lorson MA, Ben-Dov C, Fehlbaum P, Bracco L, Burghes AHM, Bollen M, Stamm S: Protein phosphatase 1 binds to the RNA recognition motif of several splicing factors and regulates alternative pre-mRNA processing. Hum Mol Genet. 2008, 17: 52-70. 10.1093/hmg/ddm284.View ArticlePubMedGoogle Scholar
- Lord KA, Creasy CL, King AG, King C, Burns BM, Lee JC, Dillon SB: REDK, a novel human regulatory erythroid kinase. Blood. 2000, 95: 2838-46.PubMedGoogle Scholar
- Meng J, Tsai-Morris C, Dufau ML: Human prolactin receptor variants in breast cancer: low ratio of short forms to the long-form human prolactin receptor associated with mammary carcinoma. Cancer Res. 2004, 64: 5677-82. 10.1158/0008-5472.CAN-04-1019.View ArticlePubMedGoogle Scholar
- Frey UH, Nückel H, Dobrev D, Manthey I, Sandalcioglu IE, Eisenhardt A, Worm K, Hauner H, Siffert W: Quantification of G protein Gaalphas subunit splice variants in different human tissues and cells using pyrosequencing. Gene Expr. 2005, 12: 69-81. 10.3727/000000005783992124.View ArticlePubMedGoogle Scholar
- Diernfellner ACR, Schafmeier T, Merrow MW, Brunner M: Molecular mechanism of temperature sensing by the circadian clock of Neurospora crassa. Genes Dev. 2005, 19: 1968-73. 10.1101/gad.345905.PubMed CentralView ArticlePubMedGoogle Scholar
- Ip JY, Tong A, Pan Q, Topp JD, Blencowe BJ, Lynch KW: Global analysis of alternative splicing during T-cell activation. RNA. 2007, 13: 563-72. 10.1261/rna.457207.PubMed CentralView ArticlePubMedGoogle Scholar
- Xing Y, Lee CJ: Protein modularity of alternatively spliced exons is associated with tissue-specific regulation of alternative splicing. PLoS Genet. 2005, 1: e34-10.1371/journal.pgen.0010034.PubMed CentralView ArticlePubMedGoogle Scholar
- Putilina T, Jaworski C, Gentleman S, McDonald B, Kadiri M, Wong P: Analysis of a human cDNA containing a tissue-specific alternatively spliced LIM domain. Biochem Biophys Res Commun. 1998, 252: 433-9. 10.1006/bbrc.1998.9656.View ArticlePubMedGoogle Scholar
- Tran YK, Bögler O, Gorse KM, Wieland I, Green MR, Newsham IF: A novel member of the NF2/ERM/4.1 superfamily with growth suppressing properties in lung cancer. Cancer Res. 1999, 59: 35-43.PubMedGoogle Scholar
- Geer LY, Domrachev M, Lipman DJ, Bryant SH: CDART: protein homology by domain architecture. Genome Res. 2002, 12: 1619-23. 10.1101/gr.278202.PubMed CentralView ArticlePubMedGoogle Scholar
- Mane S, Evans C, Cooper K, Crasta O, Folkerts O, Hutchison S, Harkins T, Thierry-Mieg D, Thierry-Mieg J, Jensen R: Transcriptome sequencing of the Microarray Quality Control (MAQC) RNA reference samples using next generation sequencing. BMC Genomics. 2009, 10: 264-10.1186/1471-2164-10-264.PubMed CentralView ArticlePubMedGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.