Publication Type
Journal Article
Version
submittedVersion
Publication Date
3-2014
Abstract
We introduce a class of finite-horizon dynamic optimization problems that we call multi-action stochastic dynamic programs (DPs). Their distinguishing feature is that the decision in each state is a multi-dimensional vector. These problems can in principle be solved using Bellman's backward recursion. However, complexity of this procedure grows exponentially in the dimension of the decision vectors. This is called the curse of action-space dimensionality. To overcome this computational challenge, we propose an approximation algorithm rooted in the game theoretic paradigm of Sampled Fictitious Play (SFP). SFP solves a sequence of DPs with a one-dimensional action-space, which are exponentially smaller than the original multi-action stochastic DP. In particular, the computational effort in a fixed number of SFP iterations is linear in the dimension of the decision vectors. We show that the sequence of SFP iterates converges to a local optimum, and present a numerical case study in manufacturing where SFP is able to find solutions with objective values within 1% of the optimal objective value hundreds of times faster than the time taken by backward recursion. In this case study, SFP solutions are also better by a statistically significant margin than those found by a one-step lookahead heuristic.
Keywords
Approximate dynamic programming, game theory, operations management
Discipline
Computer Sciences | Operations Research, Systems Engineering and Industrial Engineering
Research Areas
Intelligent Systems and Optimization
Publication
IIE Transactions
Volume
46
Issue
7
First Page
742
Last Page
756
ISSN
0740-817X
Identifier
10.1080/0740817X.2013.857062
Publisher
Taylor and Francis
Citation
Ghate, Archis; CHENG, Shih-Fen; Baumert, Stephen; Reaume, Daniel; Sharma, Dushyant; and Smith, Robert L..
Sampled fictitious play for multi-action stochastic dynamic programs. (2014). IIE Transactions. 46, (7), 742-756.
Available at: https://ink.library.smu.edu.sg/sis_research/1982
Copyright Owner and License
Authors
Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.
Additional URL
https://doi.org/10.1080/0740817X.2013.857062
Included in
Computer Sciences Commons, Operations Research, Systems Engineering and Industrial Engineering Commons