Research Collection School Of Computing and Information Systems

MEASE: Multi-agent Episodic Action Sequence Explanation

Publication Type

Conference Proceeding Article

Version

acceptedVersion

Publication Date

5-2026

Abstract

Multi-agent reinforcement learning (MARL) achieves remarkable performance in complex coordination tasks, yet interpreting the emergent behaviors of trained agents remains a fundamental challenge. Most current explainability methods focus on individual agent decisions, overlooking the critical interplay of joint strategiesand temporal coordination patterns that define successful multi-agent policies. We present MEASE (Multi-agent Episodic Action Sequence Explanation), a novel explainable MARL (XMARL) framework that explains trained MARL policies as human-interpretable emergent cooperative joint behaviors. MEASE employs a cognition-inspired episodic memory model to learn spatio-temporal multi-agent interaction patterns, coupled with abstraction algorithms that identify significant cooperative agent behaviors. We evaluate MEASE on diverse scenarios in the VMAS and MOSMAC environments, demonstrating its generalizability across various tasksand domains. These explanations, which prescribe "when to do what" for multi-agent systems, serve as executable coordination protocols that faithfully capture the learned behaviors. Quantitativevalidation shows that deploying explanations as strategies achieves 93% of the original MARL policy performance. A user study with 31 participants validates the clarity and usefulness of the explanations. These results demonstrate that MEASE effectively extracts explanatory knowledge from complex multi-agent behaviors.

Keywords

Explainable Multi-agent Reinforcement Learning, Multi-agent Reinforcement Learning, Sequential Decision-making

Discipline

Artificial Intelligence and Robotics

Research Areas

Intelligent Systems and Optimization

Publication

Proceedings of the 25th International Conference on Autonomous Agents and Multiagent Systems, Paphos, Cyprus, 2026 May 25-29

First Page

Last Page

Publisher

International Foundation for Autonomous Agents and Multiagent Systems

City or Country

Richland, SC

Citation

KHAING, Phyo Wai; GENG, Minghong; PATERIA, Shubham; SUBAGDJA, Budhitama; and TAN, Ah-hwee. MEASE: Multi-agent Episodic Action Sequence Explanation. (2026). Proceedings of the 25th International Conference on Autonomous Agents and Multiagent Systems, Paphos, Cyprus, 2026 May 25-29. 1-10.
Available at: https://ink.library.smu.edu.sg/sis_research/11081

Creative Commons License

This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.

Download

Included in

Artificial Intelligence and Robotics Commons

COinS

Research Collection School Of Computing and Information Systems

MEASE: Multi-agent Episodic Action Sequence Explanation

Publication Type

Version

Publication Date

Abstract

Keywords

Discipline

Research Areas

Publication

First Page

Last Page

Publisher

City or Country

Citation

Creative Commons License

Included in

Search

Links

Browse

Links

Research Collection School Of Computing and Information Systems

MEASE: Multi-agent Episodic Action Sequence Explanation

Author

Publication Type

Version

Publication Date

Abstract

Keywords

Discipline

Research Areas

Publication

First Page

Last Page

Publisher

City or Country

Citation

Creative Commons License

Included in

Share

Search

Links

Browse

Links