Learning Optimal Control Strategies from Interactions with a PADAS

Fabio Tango; Raghav Aras; Olivier Pietquin

doi:10.1007/978-88-470-1821-1_12

Communication Dans Un Congrès Année : 2011

Learning Optimal Control Strategies from Interactions with a PADAS

(1) , (2) , (2, 3)

1
2
3

Fabio Tango

Fonction : Auteur

Centro Ricerche Fiat

Raghav Aras

Fonction : Auteur
PersonId : 830439

SUPELEC-Campus Metz

Olivier Pietquin

Fonction : Auteur
PersonId : 4024
IdHAL : olivier-pietquin
ORCID : 0000-0002-5386-465X
IdRef : 142821861

SUPELEC-Campus Metz

Georgia Tech Lorraine [Metz]

Résumé

This paper addresses the problem of finding an optimal warning and intervention strategy (WIS) for a partially autonomous driver assistance system (PADAS). An optimal WIS here is defined as the minimizing the probability of collision with a leading vehicle while keeping the number of warnings and interventions as low as possible so as to not distract the driver. A novel approach to this problem is proposed in this paper. The optimal WIS will be considered as solving a sequential decision making problem. The adopted point of view comes from machine learning where the answer to optimal sequential decision making is the Reinforcement Learning (RL) paradigm.

Domaines

Apprentissage [cs.LG]

Sébastien Van Luchene : Connectez-vous pour contacter le contributeur

https://centralesupelec.hal.science/hal-00618402

Soumis le : jeudi 1 septembre 2011-15:59:16

Dernière modification le : jeudi 13 avril 2023-09:26:12

Dates et versions

hal-00618402 , version 1 (01-09-2011)

Identifiants

HAL Id : hal-00618402 , version 1
DOI : 10.1007/978-88-470-1821-1_12

Citer

Fabio Tango, Raghav Aras, Olivier Pietquin. Learning Optimal Control Strategies from Interactions with a PADAS. HMAT 2010, Jun 2010, Belgirate, Italy. pp.119-127, ⟨10.1007/978-88-470-1821-1_12⟩. ⟨hal-00618402⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

SUPELEC CNRS UNIV-FCOMTE CENTRALESUPELEC UMI-GTL

37 Consultations

0 Téléchargements

Learning Optimal Control Strategies from Interactions with a PADAS

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Altmetric

Partager