Publication Type
Journal Article
Version
acceptedVersion
Publication Date
11-2025
Abstract
With the growing influence of the internet and information technology, Electrical and Electronic Equipment (EEE) has become a gateway to technological innovations. However, discarded devices, also called e-waste, pose a significant threat to the environment and human health if not properly treated, disposed of, or recycled. In this study, we extend a novel model for the e-waste collection in an urban context: the Heterogeneous VRP with Multiple Time Windows and Stochastic Travel Times (HVRP-MTWSTT). We propose a solution method that employs deep reinforcement learning to guide local search heuristics (DRL-LSH). The contributions of this paper are as follows: (1) HVRP-MTWSTT represents the first stochastic VRP in the context of the e-waste collection problem, incorporating complex constraints such as multiple time windows across a multi-period horizon with a heterogeneous vehicle fleet, (2) The DRL-LSH model uses deep reinforcement learning to provide an online adaptive operator selection layer, selecting the appropriate heuristic based on the search state. The computational experiments demonstrate that DRL-LSH outperforms the state-of-the-art hyperheuristic method by 24.26% on large-scale benchmark instances, with the performance gap increasing as the problem size grows. Additionally, to demonstrate the capability of DRL-LSH in addressing real-world problems, we tested and compared it with reference metaheuristic and hyperheuristic algorithms using a real-world e-waste collection case study in Singapore. The results showed that DRL-LSH significantly outperformed the reference algorithms on a real-world instance in terms of operating profit.
Keywords
Deep reinforcement learning, E-waste, Adaptive operator selection
Discipline
Artificial Intelligence and Robotics
Areas of Excellence
Sustainability
Publication
European Journal of Operational Research
Volume
327
Issue
1
First Page
309
Last Page
325
ISSN
0377-2217
Identifier
10.1016/j.ejor.2025.04.033
Publisher
Elsevier
Citation
NGUYEN, Dang Viet Anh; GUNAWAN, Aldy; MISIR, Mustafa; LIM, Kwan Hui; and VANSTEENWEGEN, Pieter.
Deep reinforcement learning for solving the stochastic e-waste collection problem. (2025). European Journal of Operational Research. 327, (1), 309-325.
Available at: https://ink.library.smu.edu.sg/sis_research/10293
Creative Commons License

This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.
Additional URL
https://doi.org/10.1016/j.ejor.2025.04.033