Publication Type
Conference Proceeding Article
Version
publishedVersion
Publication Date
8-2021
Abstract
Reinforcement learning (RL), a promising framework for data-driven decision making in an uncertain environment, has successfully been applied in many real-world operation and control problems. However, the application of RL in a large-scale decentralized multi-agent environment remains a challenging problem due to the partial observability and limited communications between agents. In this paper, we develop a model-based kernel RL approach and a model-free deep RL approach for learning a decentralized, shared policy among homogeneous agents. By leveraging the strengths of both these methods, we further propose a novel deep ensemble multi-agent reinforcement learning (MARL) method that efficiently learns to arbitrate between the decisions of the local kernel-based RL model and the wider-reaching deep RL model. We validate the proposed deep ensemble method on a highly challenging real-world air traffic control problem, where the goal is to provide effective guidance to aircraft to avoid air traffic congestion, conflicting situations, and to improve arrival timeliness, by dynamically recommending adjustments of aircraft speeds in real-time. Extensive empirical results from an open-source air traffic management simulation model, developed by Eurocontrol and built on a realworld data set including thousands of aircrafts, demonstrate that our proposed deep ensemble MARL method significantly outperforms three state-of-the-art benchmark approaches.
Discipline
Numerical Analysis and Computation | Transportation
Research Areas
Intelligent Systems and Optimization
Areas of Excellence
Digital transformation
Publication
Proceedings of the Thirty-First International Conference on Automated Planning and Scheduling, Guangzhou, China, 2021 August 2-13
First Page
468
Last Page
476
ISBN
9781713832317
Identifier
10.1609/icaps.v31i1.15993
Publisher
AAAI
City or Country
Guangzhou, Virtual
Citation
GHOSH, Supriyo; LAGUNA, Sean; WYNTER, Laura; WYNTER, Laura; and POONAWALA, Hasan.
A deep ensemble method for multi-agent reinforcement learning: A case study on air traffic control. (2021). Proceedings of the Thirty-First International Conference on Automated Planning and Scheduling, Guangzhou, China, 2021 August 2-13. 468-476.
Available at: https://ink.library.smu.edu.sg/sis_research/10267
Creative Commons License

This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.
Additional URL
https://doi.org/10.1609/icaps.v31i1.15993