Publication Type
Journal Article
Version
acceptedVersion
Publication Date
1-2024
Abstract
This paper studies the problem in transportation networks and introduces a novel reinforcement learning-based algorithm, namely. Different from almost all canonical sota solutions, which are usually computationally expensive and lack generalizability to unforeseen destination nodes, segac offers the following appealing characteristics. segac updates the ego vehicle’s navigation policy in a sample efficient manner, reduces the variance of both value network and policy network during training, and is automatically adaptive to new destinations. Furthermore, the pre-trained segac policy network enables its real-time decision-making ability within seconds, outperforming state-of-the-art sota algorithms in simulations across various transportation networks. We also successfully deploy segac to two real metropolitan transportation networks, namely Chengdu and Beijing, using real traffic data, with satisfying results.
Keywords
Gaussian distribution, Generalized actor critic, Navigation, Optimization, Real-time systems, Reliability, Routing, sample efficiency, stochastic on-time arrival (SOTA), Transportation, variance reduction
Discipline
Operations Research, Systems Engineering and Industrial Engineering | Theory and Algorithms | Transportation
Research Areas
Intelligent Systems and Optimization
Publication
IEEE Transactions on Intelligent Transportation Systems
First Page
1
Last Page
16
ISSN
1524-9050
Identifier
10.1109/TITS.2024.3361445
Publisher
Institute of Electrical and Electronics Engineers
Citation
GUO, Honglian; HE, Zhi; SHENG, Wenda; CAO, Zhiguang; ZHOU, Yingjie; and GAO, Weinan.
SEGAC: Sample Efficient Generalized Actor Critic for the Stochastic On-Time Arrival Problem. (2024). IEEE Transactions on Intelligent Transportation Systems. 1-16.
Available at: https://ink.library.smu.edu.sg/sis_research/8704
Copyright Owner and License
Authors
Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.
Additional URL
https://doi.org/10.1109/TITS.2024.3361445
Included in
Operations Research, Systems Engineering and Industrial Engineering Commons, Theory and Algorithms Commons, Transportation Commons