Kernel Generalized Canonical Correlation Analysis

Abstract : There is a growing need to analyze datasets characterized by several sets of variables observed on a single set of observations. Such complex but structured dataset are known as multiblock dataset, and their analysis requires the development of new and flexible tools. For this purpose, Kernel Generalized Canonical Correlation Analysis (KGCCA) is proposed and offers a general framework for multiblock data analysis taking into account an a priori graph of connections between blocks. It appears that KGCCA subsumes, with a single monotonically convergent algorithm, a remarkably large number of well-known and new methods as particular cases. KGCCA is applied to a simulated 33-block dataset and a real molecular biology dataset that combines Gene Expression data, Comparative Genomic Hybridization data and a qualitative phenotype measured for a set of 5353 children with glioma. KGCCA is available on CRAN as part of the RGCCA package.
Complete list of metadatas

https://hal-centralesupelec.archives-ouvertes.fr/hal-01238943
Contributor : Alexandra Siebert <>
Submitted on : Monday, December 7, 2015 - 12:25:54 PM
Last modification on : Friday, March 8, 2019 - 1:20:13 AM

Links full text

Identifiers

Citation

Arthur Tenenhaus, Cathy Philippe, Vincent Frouin. Kernel Generalized Canonical Correlation Analysis. Computational Statistics and Data Analysis, Elsevier, 2015, 90 (C), pp.114-131. ⟨10.1016/j.csda.2015.04.004⟩. ⟨hal-01238943⟩

Share

Metrics

Record views

463