Kernel Generalized Canonical Correlation Analysis - CentraleSupélec Accéder directement au contenu
Article Dans Une Revue Computational Statistics and Data Analysis Année : 2015

Kernel Generalized Canonical Correlation Analysis

Arthur Tenenhaus
Vincent Frouin

Résumé

There is a growing need to analyze datasets characterized by several sets of variables observed on a single set of observations. Such complex but structured dataset are known as multiblock dataset, and their analysis requires the development of new and flexible tools. For this purpose, Kernel Generalized Canonical Correlation Analysis (KGCCA) is proposed and offers a general framework for multiblock data analysis taking into account an a priori graph of connections between blocks. It appears that KGCCA subsumes, with a single monotonically convergent algorithm, a remarkably large number of well-known and new methods as particular cases. KGCCA is applied to a simulated 33-block dataset and a real molecular biology dataset that combines Gene Expression data, Comparative Genomic Hybridization data and a qualitative phenotype measured for a set of 5353 children with glioma. KGCCA is available on CRAN as part of the RGCCA package.

Dates et versions

hal-01238943 , version 1 (07-12-2015)

Identifiants

Citer

Arthur Tenenhaus, Cathy Philippe, Vincent Frouin. Kernel Generalized Canonical Correlation Analysis. Computational Statistics and Data Analysis, 2015, 90 (C), pp.114-131. ⟨10.1016/j.csda.2015.04.004⟩. ⟨hal-01238943⟩
370 Consultations
0 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More