Safe robot execution in model-based reinforcement learning

Martínez, David; Alenyà, Guillem; Torras, Carme

Por favor, use este identificador para citar o enlazar a este item: http://hdl.handle.net/10261/133222

COMPARTIR / EXPORTAR:

SHARE BASE	Comparte tu historia de Acceso Abierto
Visualizar otros formatos: MARC \| Dublin Core \| RDF \| ORE \| MODS \| METS \| DIDL \| DATACITE
Refman EndNote Bibtex RefWorks Excel CSV PDF DataCite Send via email

Título:	Safe robot execution in model-based reinforcement learning
Autor:	Martínez, David CSIC; Alenyà, Guillem CSIC ORCID ; Torras, Carme CSIC ORCID
Fecha de publicación:	2015
Editor:	Institute of Electrical and Electronics Engineers
Citación:	IROS 2015
Resumen:	Task learning in robotics requires repeatedly executing the same actions in different states to learn the model of the task. However, in real-world domains, there are usually sequences of actions that, if executed, may produce unrecoverable errors (e.g. breaking an object). Robots should avoid repeating such errors when learning, and thus explore the state space in a more intelligent way. This requires identifying dangerous action effects to avoid including such actions in the generated plans, while at the same time enforcing that the learned models are complete enough for the planner not to fall into dead-ends. We thus propose a new learning method that allows a robot to reason about dead-ends and their causes. Some such causes may be dangerous action effects (i.e., leading to unrecoverable errors if the action were executed in the given state) so that the method allows the robot to skip the exploration of risky actions and guarantees the safety of planned actions. If a plan might lead to a dead-end (e.g., one that includes a dangerous action effect), the robot tries to find an alternative safe plan and, if not found, it actively asks a teacher whether the risky action should be executed. This method permits learning safe policies as well as minimizing unrecoverable errors during the learning process. Experimental validation of the approach is provided in two different scenarios: a robotic task and a simulated problem from the international planning competition. Our approach greatly increases success ratios in problems where previous approaches had high probabilities of failing.
Descripción:	Trabajo presentado a la International Conference on Intelligent Robots and Systems celebrada en Hamburgo (Alemania) del 28 de septiembre al 2 de octubre de 2015.
Versión del editor:	http://dx.doi.org/10.1109/IROS.2015.7354295
URI:	http://hdl.handle.net/10261/133222
DOI:	10.1109/IROS.2015.7354295
Identificadores:	doi: 10.1109/IROS.2015.7354295
Aparece en las colecciones:	(IRII) Comunicaciones congresos

Ficheros en este ítem:

Fichero	Descripción	Tamaño	Formato
reinforcement learning.pdf		694,99 kB	Unknown	Visualizar/Abrir

Mostrar el registro completo

CORE Recommender

Page view(s)

236

checked on 23-abr-2024

Download(s)

487

checked on 23-abr-2024

Google Scholar^TM

Check

Safe robot execution in model-based reinforcement learning

Ficheros en este ítem:

Page view(s)

Download(s)

Google Scholar^TM

Altmetric

Altmetric

Safe robot execution in model-based reinforcement learning

Ficheros en este ítem:

Page view(s)

Download(s)

Google ScholarTM

Altmetric

Altmetric

Google Scholar^TM