Research Collection School Of Computing and Information Systems

Deep reinforcement learning approach to solve dynamic vehicle routing problem with stochastic customers

Waldy JOE, Singapore Management UniversityFollow
Hoong Chuin LAU, Singapore Management UniversityFollow

Publication Type

Conference Proceeding Article

Version

publishedVersion

Publication Date

10-2020

Abstract

In real-world urban logistics operations, changes to the routes and tasks occur in response to dynamic events. To ensure customers’ demands are met, planners need to make these changes quickly (sometimes instantaneously). This paper proposes the formulation of a dynamic vehicle routing problem with time windows and both known and stochastic customers as a route-based Markov Decision Process. We propose a solution approach that combines Deep Reinforcement Learning (specifically neural networks-based TemporalDifference learning with experience replay) to approximate the value function and a routing heuristic based on Simulated Annealing, called DRLSA. Our approach enables optimized re-routing decision to be generated almost instantaneously. Furthermore, to exploit the structure of this problem, we propose a state representation based on the total cost of the remaining routes of the vehicles. We show that the cost of the remaining routes of vehicles can serve as proxy to the sequence of the routes and time window requirements. DRLSA is evaluated against the commonly used Approximate Value Iteration (AVI) and Multiple Scenario Approach (MSA). Our experiment results show that DRLSA can achieve on average, 10% improvement over myopic, outperforming AVI and MSA even with small training episodes on problems with degree of dynamism above 0.5.

Keywords

Vehicle routing, Automatic vehicle, Reinforcement learning

Discipline

Numerical Analysis and Scientific Computing | Operations Research, Systems Engineering and Industrial Engineering | Transportation

Research Areas

Intelligent Systems and Optimization

Publication

Proceedings of the 30th International Conference on Automated Planning and Scheduling, Nancy, France, October 26-30

First Page

394

Last Page

402

Publisher

AAAI Press

City or Country

Menlo Park, CA

Citation

JOE, Waldy and LAU, Hoong Chuin. Deep reinforcement learning approach to solve dynamic vehicle routing problem with stochastic customers. (2020). Proceedings of the 30th International Conference on Automated Planning and Scheduling, Nancy, France, October 26-30. 394-402.
Available at: https://ink.library.smu.edu.sg/sis_research/5568

Copyright Owner and License

Publisher

Creative Commons License

This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.

Download

Find it in your library

Included in

Numerical Analysis and Scientific Computing Commons, Operations Research, Systems Engineering and Industrial Engineering Commons, Transportation Commons

COinS

Research Collection School Of Computing and Information Systems

Deep reinforcement learning approach to solve dynamic vehicle routing problem with stochastic customers

Publication Type

Version

Publication Date

Abstract

Keywords

Discipline

Research Areas

Publication

First Page

Last Page

Publisher

City or Country

Citation

Copyright Owner and License

Creative Commons License

Included in

Search

Links

Browse

Links

Research Collection School Of Computing and Information Systems

Deep reinforcement learning approach to solve dynamic vehicle routing problem with stochastic customers

Author

Publication Type

Version

Publication Date

Abstract

Keywords

Discipline

Research Areas

Publication

First Page

Last Page

Publisher

City or Country

Citation

Copyright Owner and License

Creative Commons License

Included in

Share

Search

Links

Browse

Links