Learning Anonymized Representations with Adversarial Neural Networks

Abstract : Statistical methods protecting sensitive information or the identity of the data owner have become critical to ensure privacy of individuals as well as of organizations. This paper investigates anonymization methods based on representation learning and deep neural networks, and motivated by novel information theoretical bounds. We introduce a novel training objective for simultaneously training a predictor over target variables of interest (the regular labels) while preventing an intermediate representation to be predictive of the private labels. The architecture is based on three sub-networks: one going from input to representation, one from representation to predicted regular labels, and one from representation to predicted private labels. The training procedure aims at learning representations that preserve the relevant part of the information (about regular labels) while dismissing information about the private labels which correspond to the identity of a person. We demonstrate the success of this approach for two distinct classification versus anonymization tasks (handwritten digits and sentiment analysis).
Complete list of metadatas

https://hal-centralesupelec.archives-ouvertes.fr/hal-01742447
Contributor : Pablo Piantanida <>
Submitted on : Saturday, March 24, 2018 - 11:58:46 PM
Last modification on : Monday, June 24, 2019 - 2:36:08 PM

Links full text

Identifiers

  • HAL Id : hal-01742447, version 1
  • ARXIV : 1802.09386

Citation

Clément Feutry, Pablo Piantanida, Yoshua Bengio, Pierre Duhamel. Learning Anonymized Representations with Adversarial Neural Networks. 2018. ⟨hal-01742447⟩

Share

Metrics

Record views

218