Publication Type
Journal Article
Version
publishedVersion
Publication Date
6-2015
Abstract
Decentralized POMDPs provide an expressive framework for multiagent sequential decision making. However, the complexity of these models---NEXP-Complete even for two agents---has limited their scalability. We present a promising new class of approximation algorithms by developing novel connections between multiagent planning and machine learning. We show how the multiagent planning problem can be reformulated as inference in a mixture of dynamic Bayesian networks (DBNs). This planning-as-inference approach paves the way for the application of efficient inference techniques in DBNs to multiagent decision making. To further improve scalability, we identify certain conditions that are sufficient to extend the approach to multiagent systems with dozens of agents. Specifically, we show that the necessary inference within the expectation-maximization framework can be decomposed into processes that often involve a small subset of agents, thereby facilitating scalability. We further show that a number of existing multiagent planning models satisfy these conditions. Experiments on large planning benchmarks confirm the benefits of our approach in terms of runtime and scalability with respect to existing techniques.
Discipline
Artificial Intelligence and Robotics | Computer Sciences
Research Areas
Intelligent Systems and Optimization
Publication
Journal of Artificial Intelligence Research
Volume
53
First Page
223
Last Page
270
ISSN
1076-9757
Identifier
10.1613/jair.4649
Publisher
AI Access Foundation
Citation
Akshat KUMAR; ZILBERSTEIN, Shlomo; and TOUSSAINT, Marc.
Probabilistic Inference Techniques for Scalable Multiagent Decision Making. (2015). Journal of Artificial Intelligence Research. 53, 223-270.
Available at: https://ink.library.smu.edu.sg/sis_research/3076
Copyright Owner and License
AI Access Foundation
Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.
Additional URL
https://doi.org/10.1613/jair.4649