Research Collection School Of Computing and Information Systems

Multi-task vehicle routing solver via mixture of specialized experts under state-decomposable MDP

Publication Type

Conference Proceeding Article

Version

submittedVersion

Publication Date

12-2025

Abstract

Existing neural methods for multi-task vehicle routing problems (VRPs) typically learn unified solvers to handle multiple constraints simultaneously. However, they often underutilize the compositional structure of VRP variants, each derivable from a common set of basis VRP variants. This critical oversight causes unified solvers to miss out the potential benefits of basis solvers, each specialized for a basis VRP variant. To overcome this limitation, we propose a framework that enables unified solvers to perceive the shared-component nature across VRP variants by proactively reusing basis solvers, while mitigating the exponential growth of trained neural solvers. Specifically, we introduce a State-Decomposable MDP (SDMDP) that reformulates VRPs by expressing the state space as the Cartesian product of basis state spaces associated with basis VRP variants. More crucially, this formulation inherently yields the optimal basis policy for each basis VRP variant. Furthermore, a Latent Space-based SDMDP extension is developed by incorporating both the optimal basis policies and a learnable mixture function to enable the policy reuse in the latent space. Under mild assumptions, this extension provably recovers the optimal unified policy of SDMDP through the mixture function that computes the state embedding as a mapping from the basis state embeddings generated by optimal basis policies. For practical implementation, we introduce the Mixture-of-Specialized-Experts Solver (MoSES), which realizes basis policies through specialized Low-Rank Adaptation (LoRA) experts, and implements the mixture function via an adaptive gating mechanism. Extensive experiments conducted across VRP variants showcase the superiority of MoSES over prior methods.

Keywords

multi-task vehicle routing, state-decomposable MDP, mixture of experts, basis solvers, latent-space policy reuse, low-rank adaptation, neural combinatorial optimization, reinforcement learning, unified VRP solver, policy composition

Discipline

Artificial Intelligence and Robotics

Research Areas

Intelligent Systems and Optimization

Publication

Proceedings of the 39th Conference on Neural Information Processing

City or Country

San Diego, US

Citation

PAN, Yuxin; CAO, Zhiguang; GU, Chengyang; LIU, Liu; ZHAO, Peilin; CHEN, Yize; and LIN, Fangzhen. Multi-task vehicle routing solver via mixture of specialized experts under state-decomposable MDP. (2025). Proceedings of the 39th Conference on Neural Information Processing.
Available at: https://ink.library.smu.edu.sg/sis_research/10567

Creative Commons License

This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.

Download

Included in

Artificial Intelligence and Robotics Commons

COinS

Research Collection School Of Computing and Information Systems

Multi-task vehicle routing solver via mixture of specialized experts under state-decomposable MDP

Publication Type

Version

Publication Date

Abstract

Keywords

Discipline

Research Areas

Publication

City or Country

Citation

Creative Commons License

Included in

Search

Links

Browse

Links

Research Collection School Of Computing and Information Systems

Multi-task vehicle routing solver via mixture of specialized experts under state-decomposable MDP

Author

Publication Type

Version

Publication Date

Abstract

Keywords

Discipline

Research Areas

Publication

City or Country

Citation

Creative Commons License

Included in

Share

Search

Links

Browse

Links