Research Collection School Of Computing and Information Systems

Planning and learning for Non-Markovian negative side effects using finite state controllers

Aishwarya SRIVASTAVA
Sandhya Saisubramanian
Praveen Paruchuri
Akshat KUMAR, Singapore Management UniversityFollow
Shlomo Zilberstein

Publication Type

Conference Proceeding Article

Version

publishedVersion

Publication Date

2-2023

Abstract

Autonomous systems are often deployed in the open world where it is hard to obtain complete specifications of objectives and constraints. Operating based on an incomplete model can produce negative side effects (NSEs), which affect the safety and reliability of the system. We focus on mitigating NSEs in environments modeled as Markov decision processes (MDPs). First, we learn a model of NSEs using observed data that contains state-action trajectories and severity of associated NSEs. Unlike previous works that associate NSEs with state-action pairs, our framework associates NSEs with entire trajectories, which is more general and captures non-Markovian dependence on states and actions. Second, we learn finite state controllers (FSCs) that predict NSE severity for a given trajectory and generalize well to unseen data. Finally, we develop a constrained MDP model that uses information from the underlying MDP and the learned FSC for planning while avoiding NSEs. Our empirical evaluation demonstrates the effectiveness of our approach in learning and mitigating Markovian and non-Markovian NSEs.

Keywords

Constrained Markov decision process, Finite-state controllers, Incomplete model, Non-Markovian

Discipline

Artificial Intelligence and Robotics

Research Areas

Intelligent Systems and Optimization

Publication

Proceedings of the 37th AAAI Conference on Artificial Intelligence, AAAI 2023, Washington, February 7-14

Volume

First Page

15144

Last Page

15151

ISBN

9781577358800

Identifier

10.1609/aaai.v37i12.26767

Publisher

AAAI Press

City or Country

Washington

Citation

SRIVASTAVA, Aishwarya; Saisubramanian, Sandhya; Paruchuri, Praveen; KUMAR, Akshat; and Zilberstein, Shlomo. Planning and learning for Non-Markovian negative side effects using finite state controllers. (2023). Proceedings of the 37th AAAI Conference on Artificial Intelligence, AAAI 2023, Washington, February 7-14. 37, 15144-15151.
Available at: https://ink.library.smu.edu.sg/sis_research/8092

Creative Commons License

This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.

Additional URL

https://doi.org/10.1609/aaai.v37i12.26767

Download

Included in

Artificial Intelligence and Robotics Commons

COinS

Research Collection School Of Computing and Information Systems

Planning and learning for Non-Markovian negative side effects using finite state controllers

Publication Type

Version

Publication Date

Abstract

Keywords

Discipline

Research Areas

Publication

Volume

First Page

Last Page

ISBN

Identifier

Publisher

City or Country

Citation

Creative Commons License

Additional URL

Included in

Search

Links

Browse

Links

Research Collection School Of Computing and Information Systems

Planning and learning for Non-Markovian negative side effects using finite state controllers

Author

Publication Type

Version

Publication Date

Abstract

Keywords

Discipline

Research Areas

Publication

Volume

First Page

Last Page

ISBN

Identifier

Publisher

City or Country

Citation

Creative Commons License

Additional URL

Included in

Share

Search

Links

Browse

Links