Managing Uncertainty within the KTD Framework - CentraleSupélec Access content directly
Conference Papers Year : 2011

Managing Uncertainty within the KTD Framework

Matthieu Geist

Abstract

The dilemma between exploration and exploitation is an important topic in reinforcement learning (RL). Most successful approaches in addressing this problem tend to use some uncertainty information about values estimated during learning. On another hand, scalability is known as being a lack of RL algorithms and value function approximation has become a major topic of research. Both problems arise in real-world applications, however few approaches allow approximating the value function while maintaining uncertainty information about estimates. Even fewer use this information in the purpose of addressing the exploration/exploitation dilemma. In this paper, we show how such an uncertainty information can be derived from a Kalman-based Temporal Differences (KTD) framework and how it can be used.
Not file

Dates and versions

hal-00599636 , version 1 (10-06-2011)

Identifiers

  • HAL Id : hal-00599636 , version 1

Cite

Matthieu Geist, Olivier Pietquin. Managing Uncertainty within the KTD Framework. Active Learning and Experimental Design workshop in conjunction with AISTATS 2010, May 2010, Sardinia, Italy. pp.157-168. ⟨hal-00599636⟩
82 View
0 Download

Share

Gmail Facebook Twitter LinkedIn More