Publication Type

Journal Article

Version

acceptedVersion

Publication Date

9-2025

Abstract

Although most on-demand mission-critical systems are engineered to be reliable to support critical tasks, occasional failures may still occur during missions. To increase system survivability, a common practice is to abort the mission before an imminent failure. We consider optimal mission abort for a system whose deterioration follows a general three-state (normal, defective, failed) semi-Markov chain. The failure is assumed self-revealed, whereas the healthy and defective states have to be inferred from imperfect condition-monitoring data. Because of the non-Markovian process dynamics, optimal mission abort for this partially observable system is an intractable stopping problem. For a tractable solution, we introduce a novel tool of Erlang mixtures to approximate nonexponential sojourn times in the semi-Markov chain. This allows us to approximate the original process by a surrogate continuous-time Markov chain whose optimal control policy can be solved through a partially observable Markov decision process (POMDP). We show that the POMDP optimal policies converge almost surely to the optimal abort decision rules when the Erlang rate parameter diverges. This implies that the expected cost by adopting the POMDP solution converges to the optimal expected cost. Next, we provide comprehensive structural results on the optimal policy of the surrogate POMDP. Based on the results, we develop a modified point-based value iteration algorithm to numerically solve the surrogate POMDP. We further consider mission abort in a multitask setting where a system executes several tasks consecutively before a thorough inspection. Through a case study on an unmanned aerial vehicle, we demonstrate the capability of real-time implementation of our model, even when the condition-monitoring signals are generated with high frequency.

Keywords

Semi-Markov chain; partially observable Markov decision process; control-limit policy; mixture of Erlang distribution, optimal stopping

Discipline

Databases and Information Systems | Operations and Supply Chain Management

Research Areas

Integrative Research Areas

Publication

Operations Research

Volume

Issue

First Page

2396

Last Page

2416

ISSN

0030-364X

Identifier

10.1287/opre.2022.0643

Publisher

Institute for Operations Research and Management Sciences

Citation

SUN, Qiuzhuang; HU, Jiawen; and YE, Zhi-Sheng. Optimal abort policy for mission-critical systems under imperfect condition monitoring. (2025). Operations Research. 73, (5), 2396-2416.
Available at: https://ink.library.smu.edu.sg/cis_research/433

Copyright Owner and License

Authors

Creative Commons License

This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.

Additional URL

https://doi.org/10.1287/opre.2022.0643

Download

Included in

Databases and Information Systems Commons, Operations and Supply Chain Management Commons

COinS

Research Collection College of Integrative Studies

Optimal abort policy for mission-critical systems under imperfect condition monitoring

Publication Type

Version

Publication Date

Abstract

Keywords

Discipline

Research Areas

Publication

Volume

Issue

First Page

Last Page

ISSN

Identifier

Publisher

Citation

Copyright Owner and License

Creative Commons License

Additional URL

Included in

Search

Links

Browse

Links

Research Collection College of Integrative Studies

Optimal abort policy for mission-critical systems under imperfect condition monitoring

Author

Publication Type

Version

Publication Date

Abstract

Keywords

Discipline

Research Areas

Publication

Volume

Issue

First Page

Last Page

ISSN

Identifier

Publisher

Citation

Copyright Owner and License

Creative Commons License

Additional URL

Included in

Share

Search

Links

Browse

Links