Research Collection School Of Computing and Information Systems

Sampled fictitious play for multi-action stochastic dynamic programs

Archis Ghate
Shih-Fen CHENG, Singapore Management UniversityFollow
Stephen Baumert
Daniel Reaume
Dushyant Sharma
Robert L. Smith

Publication Type

Journal Article

Version

submittedVersion

Publication Date

3-2014

Abstract

We introduce a class of finite-horizon dynamic optimization problems that we call multi-action stochastic dynamic programs (DPs). Their distinguishing feature is that the decision in each state is a multi-dimensional vector. These problems can in principle be solved using Bellman's backward recursion. However, complexity of this procedure grows exponentially in the dimension of the decision vectors. This is called the curse of action-space dimensionality. To overcome this computational challenge, we propose an approximation algorithm rooted in the game theoretic paradigm of Sampled Fictitious Play (SFP). SFP solves a sequence of DPs with a one-dimensional action-space, which are exponentially smaller than the original multi-action stochastic DP. In particular, the computational effort in a fixed number of SFP iterations is linear in the dimension of the decision vectors. We show that the sequence of SFP iterates converges to a local optimum, and present a numerical case study in manufacturing where SFP is able to find solutions with objective values within 1% of the optimal objective value hundreds of times faster than the time taken by backward recursion. In this case study, SFP solutions are also better by a statistically significant margin than those found by a one-step lookahead heuristic.

Keywords

Approximate dynamic programming, game theory, operations management

Discipline

Computer Sciences | Operations Research, Systems Engineering and Industrial Engineering

Research Areas

Intelligent Systems and Optimization

Publication

IIE Transactions

Volume

Issue

First Page

742

Last Page

756

ISSN

0740-817X

Identifier

10.1080/0740817X.2013.857062

Publisher

Taylor and Francis

Citation

Ghate, Archis; CHENG, Shih-Fen; Baumert, Stephen; Reaume, Daniel; Sharma, Dushyant; and Smith, Robert L.. Sampled fictitious play for multi-action stochastic dynamic programs. (2014). IIE Transactions. 46, (7), 742-756.
Available at: https://ink.library.smu.edu.sg/sis_research/1982

Copyright Owner and License

Authors

Creative Commons License

This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.

Additional URL

https://doi.org/10.1080/0740817X.2013.857062

Download

Find it in your library

Included in

Computer Sciences Commons, Operations Research, Systems Engineering and Industrial Engineering Commons

COinS

Research Collection School Of Computing and Information Systems

Sampled fictitious play for multi-action stochastic dynamic programs

Publication Type

Version

Publication Date

Abstract

Keywords

Discipline

Research Areas

Publication

Volume

Issue

First Page

Last Page

ISSN

Identifier

Publisher

Citation

Copyright Owner and License

Creative Commons License

Additional URL

Included in

Search

Links

Browse

Links

Research Collection School Of Computing and Information Systems

Sampled fictitious play for multi-action stochastic dynamic programs

Author

Publication Type

Version

Publication Date

Abstract

Keywords

Discipline

Research Areas

Publication

Volume

Issue

First Page

Last Page

ISSN

Identifier

Publisher

Citation

Copyright Owner and License

Creative Commons License

Additional URL

Included in

Share

Search

Links

Browse

Links