Research Collection School Of Computing and Information Systems

Distributed Model Shaping for Scaling to Decentralized POMDPs with hundreds of agents

Prasanna VELAGAPUDI, Carnegie Mellon University
Pradeep Reddy VARAKANTHAM, Singapore Management UniversityFollow
Katia Sycara, Carnegie Mellon University
Paul Scerri, Carnegie Mellon University

Publication Type

Conference Proceeding Article

Version

submittedVersion

Publication Date

5-2011

Abstract

The use of distributed POMDPs for cooperative teams has been severely limited by the incredibly large joint policy- space that results from combining the policy-spaces of the individual agents. However, much of the computational cost of exploring the entire joint policy space can be avoided by observing that in many domains important interactions between agents occur in a relatively small set of scenarios, previously defined as coordination locales (CLs) [11]. Moreover, even when numerous interactions might occur, given a set of individual policies there are relatively few actual interactions. Exploiting this observation and building on an existing model shaping algorithm, this paper presents D-TREMOR, an algorithm in which cooperative agents iteratively generate individual policies, identify and communicate possible interactions between their policies, shape their models based on this information and generate new policies. D-TREMOR has three properties that jointly distinguish it from previous DEC-POMDP work: (1) it is completely distributed; (2) it is scalable (allowing 100 agents to compute a \good" joint policy in under 6 hours) and (3) it has low communication overhead. D-TREMOR complements these traits with the following key contributions, which ensure improved scalability and solution quality: (a) techniques to ensure convergence; (b) faster approaches to detect and evaluate CLs; (c) heuristics to capture dependencies between CLs; and (d) novel shaping heuristics to aggregate effects of CLs. While the resulting policies are not globally optimal, empirical results show that agents have policies that effectively manage uncertainty and the joint policy is better than policies generated by independent solvers.

Keywords

DEC-POMDP, Uncertainty, Multi-agent systems

Discipline

Artificial Intelligence and Robotics | Operations Research, Systems Engineering and Industrial Engineering

Publication

AAMAS '11: The 10th International Conference on Autonomous Agents and Multiagent Systems: May 2-6, Taipei, Taiwan

First Page

955

Last Page

962

Publisher

IFAAMAS

City or Country

Ann Arbor, MI

Citation

VELAGAPUDI, Prasanna; VARAKANTHAM, Pradeep Reddy; Sycara, Katia; and Scerri, Paul. Distributed Model Shaping for Scaling to Decentralized POMDPs with hundreds of agents. (2011). AAMAS '11: The 10th International Conference on Autonomous Agents and Multiagent Systems: May 2-6, Taipei, Taiwan. 955-962.
Available at: https://ink.library.smu.edu.sg/sis_research/1342

Creative Commons License

This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.

Download

Included in

Artificial Intelligence and Robotics Commons, Operations Research, Systems Engineering and Industrial Engineering Commons

COinS

Research Collection School Of Computing and Information Systems

Distributed Model Shaping for Scaling to Decentralized POMDPs with hundreds of agents

Publication Type

Version

Publication Date

Abstract

Keywords

Discipline

Publication

First Page

Last Page

Publisher

City or Country

Citation

Creative Commons License

Included in

Search

Links

Browse

Links

Research Collection School Of Computing and Information Systems

Distributed Model Shaping for Scaling to Decentralized POMDPs with hundreds of agents

Author

Publication Type

Version

Publication Date

Abstract

Keywords

Discipline

Publication

First Page

Last Page

Publisher

City or Country

Citation

Creative Commons License

Included in

Share

Search

Links

Browse

Links