Exploration in POMDPs

Authors	C. Dimitrakakis
Publication date	2008
Series	IAS technical reports, IAS-UVA-08-01
Number of pages	8
Publisher	Amsterdam: Informatics Institute
Organisations	Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract	In recent work, Bayesian methods for exploration in Markov decision processes (MDPs) and for solving known partially-observable Markov decision processes (POMDPs) have been proposed. In this paper we review the similarities and differences between those two domains and propose methods to deal with them simultaneously. This enables us to attack the Bayes-optimal reinforcement learning problem in POMDPs.
Document type	Report
Published at	http://www.science.uva.nl/research/isla/pub/IAS-UVA-08-01.pdf
Downloads	279433.pdf
Permalink to this page

Back

UvA-DARE