Computational Information Geometry for Binary Classification of High-Dimensional Random Tensors

Gia-Thuy Pham; Remy Boyer; Frank Nielsen

doi:10.3390/e20030203

Article Dans Une Revue Entropy Année : 2018

Computational Information Geometry for Binary Classification of High-Dimensional Random Tensors

(1) , (1) , (2)

1
2

Gia-Thuy Pham

Fonction : Auteur
PersonId : 17830
IdHAL : gia-thuy-pham

Laboratoire des signaux et systèmes

Remy Boyer

Fonction : Auteur
PersonId : 179288
IdHAL : remyboyer160131
ORCID : 0000-0002-7170-350X
IdRef : 131238620

Laboratoire des signaux et systèmes

Frank Nielsen

Fonction : Auteur
PersonId : 953872

Laboratoire d'informatique de l'École polytechnique [Palaiseau]

Résumé

Evaluating the performance of Bayesian classification in a high-dimensional random tensor is a fundamental problem, usually difficult and under-studied. In this work, we consider two Signal to Noise Ratio (SNR)-based binary classification problems of interest. Under the alternative hypothesis, i.e., for a non-zero SNR, the observed signals are either a noisy rank-R tensor admitting a Q-order Canonical Polyadic Decomposition (CPD) with large factors of size N q × R, i.e., for 1 ≤ q ≤ Q, where R, N q → ∞ with R 1/q /N q converge towards a finite constant or a noisy tensor admitting TucKer Decomposition (TKD) of multilinear (M 1 ,. .. , M Q)-rank with large factors of size N q × M q , i.e., for 1 ≤ q ≤ Q, where N q , M q → ∞ with M q /N q converge towards a finite constant. The classification of the random entries (coefficients) of the core tensor in the CPD/TKD is hard to study since the exact derivation of the minimal Bayes' error probability is mathematically intractable. To circumvent this difficulty, the Chernoff Upper Bound (CUB) for larger SNR and the Fisher information at low SNR are derived and studied, based on information geometry theory. The tightest CUB is reached for the value minimizing the error exponent, denoted by s. In general, due to the asymmetry of the s-divergence, the Bhattacharyya Upper Bound (BUB) (that is, the Chernoff Information calculated at s = 1/2) cannot solve this problem effectively. As a consequence, we rely on a costly numerical optimization strategy to find s. However, thanks to powerful random matrix theory tools, a simple analytical expression of s is provided with respect to the Signal to Noise Ratio (SNR) in the two schemes considered. This work shows that the BUB is the tightest bound at low SNRs. However, for higher SNRs, the latest property is no longer true.

Mots clés

optimal Bayesian detection information geometry minimal error probability Chernoff/Bhattacharyya upper bound large random tensor Fisher information large random sensing matrix

Domaines

Autres [stat.ML]

Fichier principal

entropy-20-00203.pdf (519.77 Ko)

Origine : Fichiers éditeurs autorisés sur une archive ouverte

Remy Boyer : Connectez-vous pour contacter le contributeur

https://centralesupelec.hal.science/hal-01758637

Soumis le : mercredi 4 avril 2018-16:19:00

Dernière modification le : dimanche 17 mars 2024-11:46:04

Dates et versions

hal-01758637 , version 1 (04-04-2018)

Identifiants

HAL Id : hal-01758637 , version 1
DOI : 10.3390/e20030203

Citer

Gia-Thuy Pham, Remy Boyer, Frank Nielsen. Computational Information Geometry for Binary Classification of High-Dimensional Random Tensors. Entropy, 2018, 20 (3), pp.203. ⟨10.3390/e20030203⟩. ⟨hal-01758637⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

X CNRS LIX SUP_LSS X-LIX X-DEP-INFO SUP_SIGNAUX CENTRALESUPELEC CRISTAL CRISTAL-SIGMA UNIV-PARIS-SACLAY GS-ENGINEERING GS-COMPUTER-SCIENCE

175 Consultations

192 Téléchargements

Computational Information Geometry for Binary Classification of High-Dimensional Random Tensors

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager