English   español  
Please use this identifier to cite or link to this item: http://hdl.handle.net/10261/30568
logo share SHARE   Add this article to your Mendeley library MendeleyBASE
Visualizar otros formatos: MARC | Dublin Core | RDF | ORE | MODS | METS | DIDL
Exportar a otros formatos:

DC FieldValueLanguage
dc.contributor.authorPorta, Josep M.-
dc.contributor.authorVlassis, Nikos-
dc.contributor.authorSpaan, Matthijs T. J.-
dc.contributor.authorPoupart, Pascal-
dc.identifier.citationJournal of Machine Learning Research 7: 2329-2367 (2006)-
dc.description.abstractWe propose a novel approach to optimize Partially Observable Markov Decisions Processes (POMDPs) defined on continuous spaces. To date, most algorithms for model-based POMDPs are restricted to discrete states, actions, and observations, but many real-world problems such as, for instance, robot navigation, are naturally defined on continuous spaces. In this work, we demonstrate that the value function for continuous POMDPs is convex in the beliefs over continuous state spaces, and piecewise-linear convex for the particular case of discrete observations and actions but still continuous states. We also demonstrate that continuous Bellman backups are contracting and isotonic ensuring the monotonic convergence of value-iteration algorithms. Relying on those properties, we extend the PERSEUS algorithm, originally developed for discrete POMDPs, to work in continuous state spaces by representing the observation, transition, and reward models using Gaussian mixtures, and the beliefs using Gaussian mixtures or particle sets. With these representations, the integrals that appear in the Bellman backup can be computed in closed form and, therefore, the algorithm is computationally feasible. Finally, we further extend PERSEUS to deal with continuous action and observation sets by designing effective sampling approaches.-
dc.description.sponsorshipThis work was supported by the project 'Perception, action & cognition through learning of object-action complexes.' (4915). Josep M. Porta has been partially supported by a Ramón y Cajal contract from the Spanish government and by the EU PACO-PLUS Project FP6-2004-IST-4-27657. Nikos Vlassis and Matthijs Spaan are supported by PROGRESS, the embedded systems research program of the Dutch organization for Scientific Research NWO, the Dutch Ministry of Economic Affairs and the Technology Foundation STW, project AES5414. Pascal Poupart is supported by the Canada’s National Science and Engineering Research Council.-
dc.publisherMassachusetts Institute of Technology-
dc.relation.isversionofPublisher's version-
dc.subjectPlanning under uncertainty-
dc.subjectContinuous state space-
dc.subjectContinuous action space-
dc.subjectContinuous observation space-
dc.titlePoint-based value iteration for continuous POMDPs-
dc.description.peerreviewedPeer Reviewed-
dc.contributor.funderEuropean Commission-
Appears in Collections:(IRII) Artículos
Files in This Item:
File Description SizeFormat 
Point-based value.pdf515,49 kBAdobe PDFThumbnail
Show simple item record

WARNING: Items in Digital.CSIC are protected by copyright, with all rights reserved, unless otherwise indicated.