Solving Transition-Independent Multi-agent MDPs with Sparse Interactions

J. Scharpff; D.M. Roijers; F.A. Oliehoek; M.T.J. Spaan; M.M. de Weerdt

Solving Transition-Independent Multi-agent MDPs with Sparse Interactions

Authors	J. Scharpff D.M. Roijers F.A. Oliehoek M.T.J. Spaan M.M. de Weerdt
Publication date	2016
Book title	Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence and the Twenty-Eighth Innovative Applications of Artificial Intelligence Conference
Book subtitle	12-17 February 2016, Phoenix, Arizona, USA
ISBN	9781577357605 9781577357643
Event	30th AAAI Conference on Artificial Intelligence
Volume \| Issue number	4
Pages (from-to)	3174-3180
Number of pages	7
Publisher	Palo Alto, California: AAAI Press
Organisations	Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract	In cooperative multi-agent sequential decision making under uncertainty, agents must coordinate to find an optimal joint policy that maximises joint value. Typical algorithms exploit additive structure in the value function, but in the fully- observable multi-agent MDP (MMDP) setting such structure is not present. We propose a new optimal solver for transition-independent MMDPs, in which agents can only affect their own state but their reward depends on joint transitions. We represent these dependencies compactly in conditional return graphs (CRGs). Using CRGs the value of a joint policy and the bounds on partially specified joint policies can be efficiently computed. We propose CoRe, a novel branch-and-bound policy search algorithm building on CRGs. CoRe typically requires less runtime than the available alternatives and finds solutions to previously unsolvable problems.
Document type	Conference contribution
Note	Extended version available on arXiv.org.
Language	English
Published at	https://ojs.aaai.org/index.php/AAAI/article/view/10405 https://arxiv.org/abs/1511.09047
Downloads	1511.09047 (Other version)
Permalink to this page

Back

UvA-DARE

Digital Academic Repository

Solving Transition-Independent Multi-agent MDPs with Sparse Interactions