Publication Type
Conference Proceeding Article
Version
acceptedVersion
Publication Date
5-2011
Abstract
Decentralized POMDPs provide a rigorous framework for multi-agent decision-theoretic planning. However, their high complexity has limited scalability. In this work, we present a promising new class of algorithms based on probabilistic inference for infinite-horizon ND-POMDPs---a restricted Dec-POMDP model. We first transform the policy optimization problem to that of likelihood maximization in a mixture of dynamic Bayes nets (DBNs). We then develop the Expectation-Maximization (EM) algorithm for maximizing the likelihood in this representation. The EM algorithm for ND-POMDPs lends itself naturally to a simple message-passing paradigm guided by the agent interaction graph. It is thus highly scalable w.r.t. the number of agents, can be easily parallelized, and produces good quality solutions.
Discipline
Artificial Intelligence and Robotics | Operations Research, Systems Engineering and Industrial Engineering
Research Areas
Intelligent Systems and Optimization
Publication
AAMAS '11: The 10th International Conference on Autonomous Agents and Multiagent Systems, Taipei, Taiwan, May 2-6
First Page
1087
Last Page
1088
ISBN
9780982657171
Publisher
IFAAMAS
City or Country
Richland, SC
Citation
KUMAR, Akshat and ZILBERSTEIN, Shlomo.
Message-Passing Algorithms for Large Structured Decentralized POMDPs. (2011). AAMAS '11: The 10th International Conference on Autonomous Agents and Multiagent Systems, Taipei, Taiwan, May 2-6. 1087-1088.
Available at: https://ink.library.smu.edu.sg/sis_research/2207
Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.
Additional URL
https://dl.acm.org/citation.cfm?id=2034431
Included in
Artificial Intelligence and Robotics Commons, Operations Research, Systems Engineering and Industrial Engineering Commons