Publication Type
Conference Proceeding Article
Version
submittedVersion
Publication Date
12-2025
Abstract
Existing neural methods for multi-task vehicle routing problems (VRPs) typically learn unified solvers to handle multiple constraints simultaneously. However, they often underutilize the compositional structure of VRP variants, each derivable from a common set of basis VRP variants. This critical oversight causes unified solvers to miss out the potential benefits of basis solvers, each specialized for a basis VRP variant. To overcome this limitation, we propose a framework that enables unified solvers to perceive the shared-component nature across VRP variants by proactively reusing basis solvers, while mitigating the exponential growth of trained neural solvers. Specifically, we introduce a State-Decomposable MDP (SDMDP) that reformulates VRPs by expressing the state space as the Cartesian product of basis state spaces associated with basis VRP variants. More crucially, this formulation inherently yields the optimal basis policy for each basis VRP variant. Furthermore, a Latent Space-based SDMDP extension is developed by incorporating both the optimal basis policies and a learnable mixture function to enable the policy reuse in the latent space. Under mild assumptions, this extension provably recovers the optimal unified policy of SDMDP through the mixture function that computes the state embedding as a mapping from the basis state embeddings generated by optimal basis policies. For practical implementation, we introduce the Mixture-of-Specialized-Experts Solver (MoSES), which realizes basis policies through specialized Low-Rank Adaptation (LoRA) experts, and implements the mixture function via an adaptive gating mechanism. Extensive experiments conducted across VRP variants showcase the superiority of MoSES over prior methods.
Keywords
multi-task vehicle routing, state-decomposable MDP, mixture of experts, basis solvers, latent-space policy reuse, low-rank adaptation, neural combinatorial optimization, reinforcement learning, unified VRP solver, policy composition
Discipline
Artificial Intelligence and Robotics
Research Areas
Intelligent Systems and Optimization
Publication
Proceedings of the 39th Conference on Neural Information Processing
City or Country
San Diego, US
Citation
PAN, Yuxin; CAO, Zhiguang; GU, Chengyang; LIU, Liu; ZHAO, Peilin; CHEN, Yize; and LIN, Fangzhen.
Multi-task vehicle routing solver via mixture of specialized experts under state-decomposable MDP. (2025). Proceedings of the 39th Conference on Neural Information Processing.
Available at: https://ink.library.smu.edu.sg/sis_research/10567
Creative Commons License

This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.