Detection for gene-gene co-association via kernel canonical correlation analysis

Yuan, Zhongshang; Gao, Qingsong; He, Yungang; Zhang, Xiaoshuai; Li, Fangyu; Zhao, Jinghua; Xue, Fuzhong

doi:10.1186/1471-2156-13-83

Research article
Open access
Published: 08 October 2012

Detection for gene-gene co-association via kernel canonical correlation analysis

Zhongshang Yuan¹,
Qingsong Gao²,
Yungang He^3,4,
Xiaoshuai Zhang¹,
Fangyu Li¹,
Jinghua Zhao⁵ &
…
Fuzhong Xue¹

BMC Genetics volume 13, Article number: 83 (2012) Cite this article

4461 Accesses
22 Citations
1 Altmetric
Metrics details

Abstract

Background

Currently, most methods for detecting gene-gene interaction (GGI) in genomewide association studies (GWASs) are limited in their use of single nucleotide polymorphism (SNP) as the unit of association. One way to address this drawback is to consider higher level units such as genes or regions in the analysis. Earlier we proposed a statistic based on canonical correlations (CCU) as a gene-based method for detecting gene-gene co-association. However, it can only capture linear relationship and not nonlinear correlation between genes. We therefore proposed a counterpart (KCCU) based on kernel canonical correlation analysis (KCCA).

Results

Through simulation the KCCU statistic was shown to be a valid test and more powerful than CCU statistic with respect to sample size and interaction odds ratio. Analysis of data from regions involving three genes on rheumatoid arthritis (RA) from Genetic Analysis Workshop 16 (GAW16) indicated that only KCCU statistic was able to identify interactions reported earlier.

Conclusions

KCCU statistic is a valid and powerful gene-based method for detecting gene-gene co-association.

Background

Genome-wide association studies (GWASs), which may involve a large number of single nucleotide polymorphisms (SNPs) on many individuals, are widely used to identify genetic variants underlying complex diseases or other types of traits. Although a primary interest in a GWAS is to identify SNPs associated with a trait of interest, it is important to consider the associate genes and their co-association as well. One form of co-association is epistasis, which was introduced approximately 100 years ago and generally defined as interactions among genes [1]. These are linked to gene-gene interactions (GGIs) which are often characterized to be functional, compositional and statistical [2]. The statistical definition was given by Fisher [3] and developed further by Cockerham [4] and Kempthorne [5], whereby the effect of GGIs is treated as deviation from additive genetic effects of single genes [6].

Methods to detect GGIs on the basis of the statistical definition include but are unlimited to logistic regression, multifactor dimensionality reduction [7], linkage disequilibrium (LD)-based [8, 9] and entropy-based statistics [10, 11], together with others implemented in PLINK[12], Tuning ReliefF [13], Random Jungle[14], BEAM[15] and BOOST[16]. However, most of these consider SNP as the unit of association, which has limitations and are insufficient for interpretation of GGI [17] which calls for consideration of higher level units such as genes or regions in the analysis. Gene-based analysis can account for multiple independent functional variants within genes with a potential increase of power to identify GGI. Earlier, Peng et al. [17] proposed a gene-based statistic (CCU statistic) for detecting gene-gene co-association based on canonical correlation analysis (CCA) in a case–control study, which was defined as joint effect of genes contributing to a binary trait and proved to have good performance on detecting gene-gene co-association or GGI. However, CCA can only detect linear correlation, and may be inappropriate for genomic data containing nonlinear structure. Recent years have witnessed considerable work and successes on kernel CCA (KCCA) as a nonlinear generalization of the classical CCA in machine learning, face recognition, data classification [18–20], and notably genomic data analysis by Yamanishi et al. [19]. We here construct a kernel CCU (KCCU) statistic for detecting gene-gene co-association and evaluate its performance via simulations and data analysis.

Methods

CCA

CCA is a classical multivariate method which concerns about linear dependencies between sets of variables. Let X_i, Y_i (i = 1, . . ., m) denote samples of measurements on m objects, We assume the data to be column centred. Let A be any m × n matrix then L(A) = {Aα|α ∈ Rⁿ} will be referred to as the column-space and L(A^T) = {A^Tα|α ∈ R^m} the row-space of A. The aim of canonical correlation analysis is to determine vectors ν_j ∈ L(X^T) and ω_j ∈ L(Y^T) such that a_j = Xν_j and b_j = Yω_j are maximally correlated. $c o r (a_{j}, b_{j}) = \frac{〈a_{j}, b_{j}〉}{||a_{j}|| \cdot ||b_{j}||}$ with 〈〉 indicating inner product. Usually, this is formulated as a constrained optimization problem $\begin{array}{l} arg max {ν_{j}}^{T} X^{T} Y ω_{j} \\ ν_{j} \in L (X^{T}), ω_{j} \in L (Y^{T}) \end{array}$ subject to ${ν_{j}}^{T} X^{T} X ν_{j} = {ω_{j}}^{T} Y^{T} Y ω_{j} = 1$ which yields the first pair of canonical vectors (ν₁, ω₁) and a₁ = Xν₁, b₁ = Yω₁ are the corresponding canonical variates and their correlation is called the maximum canonical coefficient. Pairs of canonical vectors (ν_j, ω_j) can be recursively defined by maximizing similar expression and keeping subsequent variates orthogonal to those previously obtained. CCA can be interpreted as constructing pairs of factors from X and Y, respectively by linear combination of the variables involved, as a way to account for linear dependencies between sets of variables.

KCCA

KCCA generalizes CCA as follows: Objects x_i and y_i are first mapped to some Hilbert spaces H_x and H_y through mapping Φ_x(.) and Φ_y(.), CCA is then performed on images {Φ_x(x_i)}_i = 1^m and {Φ_y(y_i)}_i = 1^m. Let K_x and K_y denote m × m kernel inner product matrices (also known as kernel gram matrices), constructed element-wise as ${(Κ_{x})}_{ij} = 〈Φ_{x} (x_{i}), Φ_{x} (x_{j})〉$ and ${(Κ_{y})}_{ij} = 〈Φ_{y} (y_{i}), Φ_{y} (y_{j})〉$ . Analogous to CCA, the aim of KCCA is to find canonical vectors in terms of expansion coefficients α_j, β_j ∈ R^m as a constrained optimization problem $\begin{array}{l} arg max {α_{j}}^{T} Κ_{X} Κ_{Y} β_{j} \\ α_{j}, β_{j} \in R^{m} \end{array}$ subject to ${α_{j}}^{T} Κ_{X} Κ_{X} α_{j} = {β_{j}}^{T} Κ_{Y} Κ_{Y} β_{j} = 1$ .

Explicit form for the mapping Φ_x(.) and Φ_y(.) are not always required but the kernel K_x and K_y need to be fixed. Common kernel functions include linear, polynomial, radial basis function (RBF), sigmoid [21], identical-by-state and weighted identical-by-state kernels [22]. It is worthwhile to note that these kernel functions generally have similar performance with parameters that are appropriately chosen.

Test statistic

Strategy analogous to CCU statistic was used to construct the KCCU statistic except that the maximum kernel canonical coefficient of the two genes, rather than the maximum canonical coefficient, was taken as a measure of gene-gene co-association in cases and controls. Let genotyped data of case–control study be (X₁^D, X₂^D, …, X_P^D) and (Y₁^D, Y₂^D, …, Y_q^D) for gene A and gene B for cases, and (X₁^C, X₂^C, …, X_P^C) and (Y₁^C, Y₂^C, …, Y_q^C) for controls. The maximum kernel canonical coefficient κr_D between (X₁^D, X₂^D, …, X_P^D) and (Y₁^D, Y₂^D, …, Y_q^D) obtained through KCCA could be considered as a measurement of gene-based gene–gene co-association in cases, and κr_C between (X₁^C, X₂^C, …, X_P^C) and (Y₁^C, Y₂^C, …, Y_q^C) be a measurement of gene–gene co-association in controls. The transformation analogous to Fisher’s simple correlation coefficient transformation was done to κr_D and κr_C, i.e. $κ z_{D} = \frac{1}{2} (log (1 + κ r_{D}) - log (1 - κ r_{D}))$ and $κ z_{C} = \frac{1}{2} (log (1 + κ r_{C}) - log (1 - κ r_{C}))$ .

The KCCU statistic for detecting statistical significance of the difference of gene-based gene-gene co-association between cases and controls can be defined as $U = \frac{κ z_{D} - κ z_{C}}{\sqrt{var (κ z_{D}) + var (κ z_{C})}}$ , which is approximately N(0,1). With the difficulty in obtaining an explicit form for var(κz_D) and var(κz_C), a bootstrap procedure was employed. Seeing that the performance of kernel methods strongly relates to the choice of kernel functions and their parameters, we chose the RBF kernel owing to its flexibility in parameter specification [23]. In general, two approaches are popular: 1. via empirically assigning candidate values for the parameter(s) involved subject to a learning algorithm for the best performance; 2. via some cross-validation procedure. Both are computer intensive [24].

Data simulation

Simulation studies were conducted to assess the performance of KCCU relative to CCU under both the null (H₀) and alternative hypotheses (H₁), which were based on the HapMap data in the following steps:

Step 1. Phased haplotype (Phases 1 & 2 of CEU) data were downloaded from the HapMap web site (http://snp.cshl.org) on two unlinked genome regions for generating the simulated genotypes. The GNPDA2 region is on Chr 4: 44401210.44410098 involving six SNPs while FAIM2 region is on Chr 12: 48571829.48583937 involving seven SNPs. Their LD patterns where shown in Figures 1 and 2 together with pairwise r².

Step 2. Based on data above, large samples with 100, 000 cases and 100, 000 controls were generated using software gs2.0 [25] under a two-locus interaction multiplicative effects model (see Additional file 1), treating the 2^nd SNP of the first region and the SNP of the other as the causal variants and they were removed in the simulation to assess gene-gene co-association. The interaction odds ratio was set as 1.0 under H₀ and 1.1, 1.2, 1.3, 1.4, 1.5 under H₁. The SNPs in the regions were coded according to an additive genetic model. To further investigate the performance on causal SNPs with respect to minor allele frequency and LD, different SNP pairs from the two gene regions were defined as the casual variants.

Step 3. From the remaining SNPs, simulated data were sampled and CCU and KCCU performed under various sample sizes N (N/2 cases and N/2 controls, N = 1000…5000) with R package kernlab (http://cran.r-project.org/web/packages/kernlab/index.html). 500 simulations were repeated each with a significant level of 0.05.

Applications

The proposed KCCU statistic was applied to rheumatoid arthritis (RA) data from GAW16 Problem 1, consisting of 2,062 Illumina 550k SNP chips from 868 RA patients and 1,194 normal controls collected by the North American Rheumatoid Arthritis Consortium [26]. Three genes (C5, ITGAV, and VEGFA) on three different chromosomes were selected to detect gene-gene co-association in this work, involving eight, eight and four SNPs, respectively. Logistic regression test and the CCU statistic were also used. For each pair of genes, the statistic which yielded the minimum p value was recorded from all pairs of SNPs one on each gene. The significance of the statistic was compared to its empirical distribution generated from 1,000 permutations by permuting case–control labels [27] which is relatively easy compared to the “BY” method [28] for multiple testing adjustment.

Results

Simulation

Shown in Table 1 are simulation results under H₀. The KCCU statistic is normally distributed according to the one sample Kolmogorov-Smirnov test with the type I error rates of KCCU statistic being close to given nominal value (α = 0.05) for different sample sizes. This indicates that the proposed statistic performs well under the null hypothesis.

Table 1 Performance of CCU and KCCU under the null hypothesis

Full size table

Results on various interaction odds ratios and a sample size of 3,000 are shown in Figure 3, as with different sample sizes with an interaction odds ratio of 1.4 in Figure 4. It is clear that power of KCCU is a monotonically increasing function of sample size and interaction odds ratio. Figure 5 shows results with different SNP pairs defined as causal SNPs with an interaction odds ratio of 1.3. The power of KCCU statistic was higher than that of CCU statistic. Power as a function of interaction odds ratio for different sample size is provided as Additional file 1.

Application

The performance of logistic regression test, CCU and KCCU statistics on pair-wise gene-gene co-association of three genes is shown in Table 2, which also contains results on the Gaussian RBF kernels with various parameter values (σ=0.05, 0.5, 5 and 50). Through KCCU the three genes were shown to have co-association with each other at significance level 0.05 regardless the parameter value, in contrast to the CCU statistics showing no significant co-association and none of the SNP pairs were significant under logistic regression test with correction for multiple testing.

Table 2 P-values of gene-gene co-association among C5 , ITGAV and VEGFA

Full size table

Discussion

We have extended the CCU statistic to a new statistic KCCU, which can extract nonlinear correlation between two genes. Simulation studies show that both CCU and KCCU statistics performed well under null hypothesis with KCCU being more powerful than CCU with respect to significant level, sample size and relative risk. As results vary with user-defined kernel parameter, various parameters were used (the bandwidth parameter in RBF kernel) to RA data in GAW16 Problem 1, showing that the logistic regression test and CCU statistic failed to detect any interaction but KCCU statistics identified the pair-wise interactions among the three genes under various parameters. The interaction between ITGAV and VEGF genes has been identified by a rank method [29]. As suggested by a reviewer, it is critical to consider time-efficiency in genome-wide association studies to make the proposed methods practical. In our case, the computing time as required for KCCU was about 2.5 times slower than CCU, but nevertheless will still be feasible with the development as well as the extensive applications of multiprocessor and multithreading computational technique.

A reviewer has also suggested us to reiterate the relationship between gene-gene co-association and GGI which is readily available. GGI generally refers to the synergetic or antagonistic effect of two genes in addition to the summation of their independent effects on an outcome. To represent the interaction between two genes A and B in a case–control association study, a product term is customarily added to the logistic regression model Logit(P) = β₀ + β₁A + β₂B + γA × B so that γ reflects both the direction and size of the interaction. This model implicitly assumes that gene A and gene B are independent so as to infer interaction (γ). However, it might well be that genes are correlated with each other in genetic networks to contribute to disease susceptibility, so the independence assumption is rarely ratified. Gene-gene co-association extends the concept of GGI in that it describes the generic joint distribution of two gene effects on disease or trait without assuming either independence or linear relationship. Here the measurement of the co-association between genes is based on the correlation between genes (such as CCU statistic and KCCU statistic), provides a measure of the contribution of two genes. As for two unlinked genes, their relationship can be described as either co-association or interaction. The reviewer has also brought to our attention to earlier work by Song and Nicolae [30] on imposing natural restrictions for the parameter space and discussion on the definition of “no interaction” between two unlinked loci as two loci being independent conditioned on the subject having the disease. In this paper, the null hypothesis of the proposed test is that there is no gene-gene co-association (i.e. GGI for two unlinked genes), the data under the null hypothesis are generated from the gs software with the interaction odds ratio parameter to be one.

Several issues remain to be resolved: the uncertainty to set the kernel function with appropriate parameters for each data, the undesirable performance of both CCU and KCCU with small interaction odds ratio (e.g. 1.1), and the possible failure of maximum kernel canonical correlation coefficient to represent gene-gene co-association.

Conclusions

KCCU statistic is a valid and powerful gene-based method for detecting gene-gene co-association compared to CCU and logistic regression test. Further work is needed to make its use in GWAS more practical.

References

Bateson W: Mendel's principles of heredity. Mendel's principles of heredity. 1909. 1902
Google Scholar
Phillips PC: Epistasis-the essential role of gene interactions in the structure and evolution of genetic systems. Nat Rev Genet. 2008, 9: 855-867. 10.1038/nrg2452.
Article PubMed Central CAS PubMed Google Scholar
Fisher RA: The correlation between relatives on the supposition of Mendelian inheritance. Trans R Soc Edinb. 1918, 52: 399-433.
Article Google Scholar
Cockerham CC: An extension of the concept of partitioning hereditary variance for analysis of covariances among relatives when epistasis is present. Genetics. 1954, 39 (6): 859-
PubMed Central CAS PubMed Google Scholar
Kempthorne O: The correlation between relatives in a random mating population. Proc R Soc Lond B Biol Sci. 1954, 143 (910): 103-10.1098/rspb.1954.0056.
Article Google Scholar
Cordell HJ: Epistasis: what it means, what it doesn't mean, and statistical methods to detect it in humans. Hum Mol Genet. 2002, 11 (20): 2463-10.1093/hmg/11.20.2463.
Article CAS PubMed Google Scholar
Ritchie MD, Hahn LW, Roodi N, Bailey LR, Dupont WD, Parl FF, Moore JH: Multifactor-dimensionality reduction reveals high-order interactions among estrogen-metabolism genes in sporadic breast cancer. Am J Hum Genet. 2001, 69 (1): 138-147. 10.1086/321276.
Article PubMed Central CAS PubMed Google Scholar
Wu X, Jin L, Xiong M: Composite measure of linkage disequilibrium for testing interaction between unlinked loci. Eur J Hum Genet. 2008, 16 (5): 644-651. 10.1038/sj.ejhg.5202004.
Article CAS PubMed Google Scholar
Zhao J, Jin L, Xiong M: Test for interaction between two unlinked loci. Am J Hum Genet. 2006, 79 (5): 831-845. 10.1086/508571.
Article PubMed Central CAS PubMed Google Scholar
Dong C, Chu X, Wang Y, Jin L, Shi T, Huang W, Li Y: Exploration of gene-gene interaction effects using entropy-based methods. Eur J Hum Genet. 2007, 16 (2): 229-235.
Article PubMed Google Scholar
Kang G, Yue W, Zhang J, Cui Y, Zuo Y, Zhang D: An entropy-based approach for testing genetic epistasis underlying complex diseases. J Theor Biol. 2008, 250 (2): 362-374. 10.1016/j.jtbi.2007.10.001.
Article CAS PubMed Google Scholar
Purcell S, Neale B, Todd-Brown K, Thomas L, Ferreira MA, Bender D, Maller J, Sklar P, de Bakker PI, Daly MJ, Sham PC: PLINK: a tool set for wholegenome association and population-based linkage analyses. Am J Hum Genet. 2007, 81: 559-575. 10.1086/519795.
Article PubMed Central CAS PubMed Google Scholar
Moore J, White B: Tuning reliefF for genomewide genetic analysis. Lect Notes Comput Sci. 2007, 4447: 166-175. 10.1007/978-3-540-71783-6_16.
Article Google Scholar
Schwarz D, Ko’ Nig I, Ziegler A: On safari to random jungle: A fast implementation of random forests for high dimensional data. Bioinformatics. 2010, 26: 1752-1758. 10.1093/bioinformatics/btq257.
Article PubMed Central CAS PubMed Google Scholar
Zhang Y, Liu JS: Bayesian inference of epistatic interactions in case–control studies. Nat Genet. 2007, 39: 1167-1173. 10.1038/ng2110.
Article CAS PubMed Google Scholar
Wan X, Yang C, Yang Q, Xue H, Fan X, Tang N, Yu W: BOOST: A Fast Approach to Detecting Gene-Gene Interactions in Genome-wide Case–control Studies. Am J Hum Genet. 2010, 87: 325-340. 10.1016/j.ajhg.2010.07.021.
Article PubMed Central CAS PubMed Google Scholar
Peng Q, Zhao J, Xue F: A gene-based method for detecting gene-gene co-association in a case–control association study. Eur J Hum Genet. 2009, 18 (5): 582-587.
Article PubMed Central PubMed Google Scholar
Zheng W, Zhou X, Zou C, Zhao L: Facial expression recognition using kernel canonical correlation analysis (KCCA). Neural Networks IEEE Trans. 2006, 17 (1): 233-238. 10.1109/TNN.2005.860849.
Article Google Scholar
Yamanishi Y, Vert JP, Nakaya A, Kanehisa M: Extraction of correlated gene clusters from multiple genomic data by generalized kernel canonical correlation analysis. Bioinformatics. 2003, 19 (suppl 1): i323-10.1093/bioinformatics/btg1045.
Article PubMed Google Scholar
Zepeda JAY, Davoine F, Charbit M: Visapp 2007: Proceedings of the Second International Conference on Computer Vision Theory and Applications, Volume Iu/Mtsv. Face tracking using canonical correlation analysis. 2007, 396-402.
Google Scholar
Liu Z, Chen D, Bensmail H: Gene expression data classification with Kernel principal component analysis. J Biomed Biotechnol. 2005, 2: 155-159.
Article Google Scholar
Wu MC, Kraft P, Epstein MP, Taylor DM, Chanock SJ, Hunter DJ, Lin X: Powerful SNP-set analysis for case–control genome-wide association studies. Am J Hum Genet. 2010, 86 (6): 929-942. 10.1016/j.ajhg.2010.05.002.
Article PubMed Central CAS PubMed Google Scholar
Nguyen VH, Golinval JC: Fault detection based on Kernel Principal Component Analysis. Eng Struct. 2010, 32 (11): 3683-3691. 10.1016/j.engstruct.2010.08.012.
Article Google Scholar
Zhang DQ, Zhou ZH, Chen SC: Adaptive kernel principal component analysis with unsupervised learning of kernels. Proceedings of the 6th IEEE International Conference on Data Mining (ICDM). 2006
Google Scholar
Li J, Chen Y: Generating samples for association studies based on HapMap data. BMC Bioinforma. 2008, 9 (1): 44-10.1186/1471-2105-9-44.
Article Google Scholar
Plenge RM, Seielstad M, Padyukov L, Lee AT, Remmers EF, Ding B, Liew A, Khalili H, Chandrasekaran A, Davies LRL, et al: TRAF1-C5 as a risk locus for rheumatoid arthritis - A genomewide study. New Engl J Med. 2007, 357 (12): 1199-1209. 10.1056/NEJMoa073491.
Article PubMed Central CAS PubMed Google Scholar
Westfall PH, Young SS: Resampling-based multiple testing. 1993, New York: Wiley
Google Scholar
Benjamini Y, Yekutieli D: The control of the false discovery rate in multiple testing under dependency. Ann Stat. 2001, 29: 1165-1188. 10.1214/aos/1013699998.
Article Google Scholar
Huang CH, Cong L, Xie J, Qiao B, Lo SH, Zheng T: Rheumatoid arthritis-associated gene-gene interaction network for rheumatoid arthritis candidate genes. BMC Proc. 2009, 3 (Suppl 7): S75-10.1186/1753-6561-3-s7-s75.
Article PubMed Central PubMed Google Scholar
Song M, Nicolae D: Restricted parameter space models for testing gene-gene interaction. Genet Epidemiol. 2009, 33: 386-393. 10.1002/gepi.20392.
Article PubMed Central PubMed Google Scholar

Download references

Acknowledgements

This work was supported by the grant from National Natural Science Foundation of China (30871392) and Young Talents Innovation Foundation of School of Public Health, Shandong University. We thank GAW16 and the North American Rheumatoid Arthritis Consortium for the RA data and two anonymous reviewers for suggestions which led to substantial improvement and clarification of the paper.

Author information

Authors and Affiliations

Department of Epidemiology and Health Statistics, School of Public Health, Shandong University, Jinan, 250012, China
Zhongshang Yuan, Xiaoshuai Zhang, Fangyu Li & Fuzhong Xue
Berlin Institute for Medical Systems Biology, Max-Delbrück-Center for Molecular Medicine, 13125, Berlin, Germany
Qingsong Gao
CAS-MPG Partner Institute for Computational Biology, Shanghai Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai, 200031, China
Yungang He
Key Laboratory of Computational Biology, CAS-MPG Partner Institute for Computational Biology, Chinese Academy of Sciences, Shanghai, 200031, China
Yungang He
MRC Epidemiology Unit, Institute of Metabolic Science, Addenbrooke’s Hospital, Cambridge, UK
Jinghua Zhao

Authors

Zhongshang Yuan
View author publications
You can also search for this author in PubMed Google Scholar
Qingsong Gao
View author publications
You can also search for this author in PubMed Google Scholar
Yungang He
View author publications
You can also search for this author in PubMed Google Scholar
Xiaoshuai Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Fangyu Li
View author publications
You can also search for this author in PubMed Google Scholar
Jinghua Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Fuzhong Xue
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Fuzhong Xue.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors’ contributions

ZSY, QSG, YGH, XSZ, FYL, JHZ and FZX conceptualized the study, acquired and analyzed the data and prepared for the manuscript. All authors approved the final manuscript.

Zhongshang Yuan, Qingsong Gao contributed equally to this work.

Electronic supplementary material

Additional file 1: Two-locus interaction multiplicative effects model.(DOCX 1 MB)

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Authors’ original file for figure 4

Authors’ original file for figure 5

Rights and permissions

Open Access This article is published under license to BioMed Central Ltd. This is an Open Access article is distributed under the terms of the Creative Commons Attribution License ( https://creativecommons.org/licenses/by/2.0 ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Yuan, Z., Gao, Q., He, Y. et al. Detection for gene-gene co-association via kernel canonical correlation analysis. BMC Genet 13, 83 (2012). https://doi.org/10.1186/1471-2156-13-83

Download citation

Received: 18 March 2012
Accepted: 28 September 2012
Published: 08 October 2012
DOI: https://doi.org/10.1186/1471-2156-13-83

Detection for gene-gene co-association via kernel canonical correlation analysis