Por favor, use este identificador para citar o enlazar a este item: http://hdl.handle.net/10261/30568
COMPARTIR / EXPORTAR:
logo share SHARE BASE
Visualizar otros formatos: MARC | Dublin Core | RDF | ORE | MODS | METS | DIDL | DATACITE

Invitar a revisión por pares abierta
Título

Point-based value iteration for continuous POMDPs

AutorPorta, Josep M. CSIC ORCID ; Vlassis, Nikos; Spaan, Matthijs T. J.; Poupart, Pascal
Palabras clavePlanning under uncertainty
Continuous state space
Continuous action space
Continuous observation space
Fecha de publicación2006
EditorMassachusetts Institute of Technology
CitaciónJournal of Machine Learning Research 7: 2329-2367 (2006)
ResumenWe propose a novel approach to optimize Partially Observable Markov Decisions Processes (POMDPs) defined on continuous spaces. To date, most algorithms for model-based POMDPs are restricted to discrete states, actions, and observations, but many real-world problems such as, for instance, robot navigation, are naturally defined on continuous spaces. In this work, we demonstrate that the value function for continuous POMDPs is convex in the beliefs over continuous state spaces, and piecewise-linear convex for the particular case of discrete observations and actions but still continuous states. We also demonstrate that continuous Bellman backups are contracting and isotonic ensuring the monotonic convergence of value-iteration algorithms. Relying on those properties, we extend the PERSEUS algorithm, originally developed for discrete POMDPs, to work in continuous state spaces by representing the observation, transition, and reward models using Gaussian mixtures, and the beliefs using Gaussian mixtures or particle sets. With these representations, the integrals that appear in the Bellman backup can be computed in closed form and, therefore, the algorithm is computationally feasible. Finally, we further extend PERSEUS to deal with continuous action and observation sets by designing effective sampling approaches.
Versión del editorhttp://jmlr.org/papers/v7/porta06a.html
URIhttp://hdl.handle.net/10261/30568
ISSN1532-4435
Aparece en las colecciones: (IRII) Artículos




Ficheros en este ítem:
Fichero Descripción Tamaño Formato
Point-based value.pdf515,49 kBAdobe PDFVista previa
Visualizar/Abrir
Mostrar el registro completo

CORE Recommender

Page view(s)

293
checked on 22-abr-2024

Download(s)

109
checked on 22-abr-2024

Google ScholarTM

Check


NOTA: Los ítems de Digital.CSIC están protegidos por copyright, con todos los derechos reservados, a menos que se indique lo contrario.