Managing Uncertainty within the KTD Framework

Matthieu Geist; Olivier Pietquin

Communication Dans Un Congrès Année : 2011

Managing Uncertainty within the KTD Framework

(1) , (1, 2)

1
2

Matthieu Geist

Fonction : Auteur
PersonId : 6945
IdHAL : matthieu-geist

SUPELEC-Campus Metz

Olivier Pietquin

Fonction : Auteur
PersonId : 4024
IdHAL : olivier-pietquin
ORCID : 0000-0002-5386-465X
IdRef : 142821861

SUPELEC-Campus Metz

Georgia Tech Lorraine [Metz]

Résumé

The dilemma between exploration and exploitation is an important topic in reinforcement learning (RL). Most successful approaches in addressing this problem tend to use some uncertainty information about values estimated during learning. On another hand, scalability is known as being a lack of RL algorithms and value function approximation has become a major topic of research. Both problems arise in real-world applications, however few approaches allow approximating the value function while maintaining uncertainty information about estimates. Even fewer use this information in the purpose of addressing the exploration/exploitation dilemma. In this paper, we show how such an uncertainty information can be derived from a Kalman-based Temporal Differences (KTD) framework and how it can be used.

Mots clés

Value function approximation active learning exploration/exploitation dilemma

Sébastien Van Luchene : Connectez-vous pour contacter le contributeur

https://centralesupelec.hal.science/hal-00599636

Soumis le : vendredi 10 juin 2011-14:21:05

Dernière modification le : jeudi 13 avril 2023-09:26:12

Dates et versions

hal-00599636 , version 1 (10-06-2011)

Identifiants

HAL Id : hal-00599636 , version 1

Citer

Matthieu Geist, Olivier Pietquin. Managing Uncertainty within the KTD Framework. Active Learning and Experimental Design workshop in conjunction with AISTATS 2010, May 2010, Sardinia, Italy. pp.157-168. ⟨hal-00599636⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

SUPELEC CNRS UNIV-FCOMTE CENTRALESUPELEC UMI-GTL

84 Consultations

0 Téléchargements

Managing Uncertainty within the KTD Framework

Résumé

Mots clés

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager