Solving Transition-Independent Multi-agent MDPs with Sparse Interactions

Open Access
Authors
  • M.M. de Weerdt
Publication date 2016
Book title Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence and the Twenty-Eighth Innovative Applications of Artificial Intelligence Conference
Book subtitle 12-17 February 2016, Phoenix, Arizona, USA
ISBN
  • 9781577357605
  • 9781577357643
Event 30th AAAI Conference on Artificial Intelligence
Volume | Issue number 4
Pages (from-to) 3174-3180
Number of pages 7
Publisher Palo Alto, California: AAAI Press
Organisations
  • Faculty of Science (FNWI) - Informatics Institute (IVI)
Abstract
In cooperative multi-agent sequential decision making under uncertainty, agents must coordinate to find an optimal joint policy that maximises joint value. Typical algorithms exploit additive structure in the value function, but in the fully- observable multi-agent MDP (MMDP) setting such structure is not present. We propose a new optimal solver for transition-independent MMDPs, in which agents can only affect their own state but their reward depends on joint transitions. We represent these dependencies compactly in conditional return graphs (CRGs). Using CRGs the value of a joint policy and the bounds on partially specified joint policies can be efficiently computed. We propose CoRe, a novel branch-and-bound policy search algorithm building on CRGs. CoRe typically requires less runtime than the available alternatives and finds solutions to previously unsolvable problems.
Document type Conference contribution
Note Extended version available on arXiv.org.
Language English
Published at https://ojs.aaai.org/index.php/AAAI/article/view/10405 https://arxiv.org/abs/1511.09047
Downloads
1511.09047 (Other version)
Permalink to this page
Back