Publication Type
Conference Proceeding Article
Version
publishedVersion
Publication Date
10-2023
Abstract
We consider the problem of learning control policies in discrete-time stochastic systems which guarantee that the system stabilizes within some specified stabilization region with probability 1. Our approach is based on the novel notion of stabilizing ranking supermartingales (sRSMs) that we introduce in this work. Our sRSMs overcome the limitation of methods proposed in previous works whose applicability is restricted to systems in which the stabilizing region cannot be left once entered under any control policy. We present a learning procedure that learns a control policy together with an sRSM that formally certifies probability 1 stability, both learned as neural networks. We show that this procedure can also be adapted to formally verifying that, under a given Lipschitz continuous control policy, the stochastic system stabilizes within some stabilizing region with probability 1. Our experimental evaluation shows that our learning procedure can successfully learn provably stabilizing policies in practice.
Keywords
Learning-based control, Stochastic systems, Martingales, Formal verification, Stabilization
Discipline
Databases and Information Systems
Research Areas
Intelligent Systems and Optimization
Areas of Excellence
Digital transformation
Publication
Proceedings of the 21st International Symposium, ATVA 2023, Singapore, October 24-27
First Page
357
Last Page
379
ISBN
9783031453281
Identifier
10.1007/978-3-031-45329-8_17
Publisher
Springer
City or Country
Cham
Citation
ANSARIPOUR, Matin; CHATTERJEE, Krishnendu; HENZINGER, A. Thomas; LECHNER, Mathias; and ZIKELIC, Dorde.
Learning provably stabilizing neural controllers for discrete-time stochastic systems. (2023). Proceedings of the 21st International Symposium, ATVA 2023, Singapore, October 24-27. 357-379.
Available at: https://ink.library.smu.edu.sg/sis_research/9067
Copyright Owner and License
Authors
Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.
Additional URL
https://doi.org/10.1007/978-3-031-45329-8_17