Publication Type
Conference Proceeding Article
Version
publishedVersion
Publication Date
2-2017
Abstract
Multiagent sequential decision making has seen rapid progress with formal models such as decentralized MDPs and POMDPs. However, scalability to large multiagent systems and applicability to real world problems remain limited. To address these challenges, we study multiagent planning problems where the collective behavior of a population of agents affects the joint-reward and environment dynamics. Our work exploits recent advances in graphical models for modeling and inference with a population of individuals such as collective graphical models and the notion of finite partial exchangeability in lifted inference. We develop a collective decentralized MDP model where policies can be computed based on counts of agents in different states. As the policy search space over counts is combinatorial, we develop a sampling based framework that can compute open and closed loop policies. Comparisons with previous best approaches on synthetic instances and a real world taxi dataset modeling supply-demand matching show that our approach significantly outperforms them w.r.t.solution quality.
Keywords
Artificial intelligence, Decision making, Graphic methods, Taxicabs
Discipline
Artificial Intelligence and Robotics | Computer Sciences | Transportation
Research Areas
Intelligent Systems and Optimization
Publication
Proceedings of the 31st AAAI Conference on Artificial Intelligence AAAI -17: San Francisco, CA, February 4-9
First Page
3036
Last Page
3043
Publisher
AAAI Press
City or Country
Menlo Park, CA
Citation
NGUYEN, Duc Thien; Akshat KUMAR; and LAU, Hoong Chuin.
Collective multiagent sequential decision making under uncertainty. (2017). Proceedings of the 31st AAAI Conference on Artificial Intelligence AAAI -17: San Francisco, CA, February 4-9. 3036-3043.
Available at: https://ink.library.smu.edu.sg/sis_research/3529
Copyright Owner and License
Publisher
Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.
Additional URL
https://aaai.org/ocs/index.php/AAAI/AAAI17/paper/view/14891