Research Collection School Of Computing and Information Systems

Towards Efficient Computation of Quality Bounded Solutions in POMDPs: Expected Value Approximation and Dynamic Disjunctive Beliefs

Pradeep Reddy VARAKANTHAM, Singapore Management UniversityFollow
Rajiv Maheswaran, University of Southern California
Tapana GUPTA, University of Southern California
Milind Tambe, University of Southern California

Publication Type

Conference Proceeding Article

Version

acceptedVersion

Publication Date

1-2007

Abstract

While POMDPs (partially observable markov decision problems) are a popular computational model with wide-ranging applications, the computational cost for optimal policy generation is prohibitive. Researchers are investigating ever-more efficient algorithms, yet many applications demand such algorithms bound any loss in policy quality when chasing efficiency. To address this challenge, we present two new techniques. The first approximates in the value space to obtain solutions efficiently for a pre-specified error bound. Unlike existing techniques, our technique guarantees the resulting policy will meet this bound. Furthermore, it does not require costly computations to determine the quality loss of the policy. Our second technique prunes large tracts of belief space that are unreachable, allowing faster policy computation without any sacrifice in optimality. The combination of the two techniques, which are complementary to existing optimal policy generation algorithms, provides solutions with tight error bounds efficiently in domains where competing algorithms fail to provide such tight bounds.

Discipline

Artificial Intelligence and Robotics | Business | Operations Research, Systems Engineering and Industrial Engineering

Publication

IJCAI '07: Proceedings of the 20th International Joint Conference on Artifical Intelligence: Hyderabad, India, 6-12 January 2007

First Page

2683

Last Page

2643

Publisher

AAAI Press

City or Country

Menlo Park, CA

Citation

VARAKANTHAM, Pradeep Reddy; Maheswaran, Rajiv; GUPTA, Tapana; and Tambe, Milind. Towards Efficient Computation of Quality Bounded Solutions in POMDPs: Expected Value Approximation and Dynamic Disjunctive Beliefs. (2007). IJCAI '07: Proceedings of the 20th International Joint Conference on Artifical Intelligence: Hyderabad, India, 6-12 January 2007. 2683-2643.
Available at: https://ink.library.smu.edu.sg/sis_research/956

Creative Commons License

This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.

Additional URL

http://portal.acm.org/citation.cfm?id=1625700

Download

Included in

Artificial Intelligence and Robotics Commons, Business Commons, Operations Research, Systems Engineering and Industrial Engineering Commons

COinS

Research Collection School Of Computing and Information Systems

Towards Efficient Computation of Quality Bounded Solutions in POMDPs: Expected Value Approximation and Dynamic Disjunctive Beliefs

Publication Type

Version

Publication Date

Abstract

Discipline

Publication

First Page

Last Page

Publisher

City or Country

Citation

Creative Commons License

Additional URL

Included in

Search

Links

Browse

Links

Research Collection School Of Computing and Information Systems

Towards Efficient Computation of Quality Bounded Solutions in POMDPs: Expected Value Approximation and Dynamic Disjunctive Beliefs

Author

Publication Type

Version

Publication Date

Abstract

Discipline

Publication

First Page

Last Page

Publisher

City or Country

Citation

Creative Commons License

Additional URL

Included in

Share

Search

Links

Browse

Links