Publication Type

Conference Proceeding Article

Version

acceptedVersion

Publication Date

7-2024

Abstract

Decision-making problems, categorized as single-agent, e.g., Atari, cooperative multi-agent, e.g., Hanabi, competitive multi-agent, e.g., Hold’em poker, and mixed cooperative and competitive, e.g., football, are ubiquitous in the real world. Although various methods have been proposed to address the specific decision-making categories, these methods typically evolve independently and cannot generalize to other categories. Therefore, a fundamental question for decision-making is: Can we develop a single algorithm to tackle ALL categories of decision-making problems? There are several main challenges to address this question: i) different decision-making categories involve different numbers of agents and different relationships between agents, ii) different categories have different solution concepts and evaluation measures, and iii) there lacks a comprehensive benchmark covering all the categories. This work presents a preliminary attempt to address the question with three main contributions. i) We propose the generalized mirror descent (GMD), a generalization of MD variants, which considers multiple historical policies and works with a broader class of Bregman divergences. ii) We propose the configurable mirror descent (CMD) where a meta-controller is introduced to dynamically adjust the hyper-parameters in GMD conditional on the evaluation measures. iii) We construct the GameBench with 15 academic-friendly games across different decision-making categories. Extensive experiments demonstrate that CMD achieves empirically competitive or better outcomes compared to baselines while providing the capability of exploring diverse dimensions of decision making.

Keywords

Decision making categorization, Decision making algorithm, Reinforcement learning, Machine learning

Discipline

Artificial Intelligence and Robotics | Management Information Systems

Research Areas

Data Science and Engineering; Intelligent Systems and Optimization

Publication

Proceedings of the 41st International Conference on Machine Learning (ICML 2024) : Vienna, Austria, July 21-27

Volume

235

First Page

28164

Last Page

28203

Publisher

PMLR

City or Country

Cambridge

Citation

LI, Pengdeng; LI, Shuxin; YANG, Chang; WANG, Xinrun; CHAN, Hau; and AN, Bo. Configurable mirror descent : Towards a unification of decision making. (2024). Proceedings of the 41st International Conference on Machine Learning (ICML 2024) : Vienna, Austria, July 21-27. 235, 28164-28203.
Available at: https://ink.library.smu.edu.sg/sis_research/9829

Creative Commons License

This work is licensed under a Creative Commons Attribution-NonCommercial-No Derivative Works 4.0 International License.

Comments

PDF provided by faculty

Download

Included in

Artificial Intelligence and Robotics Commons, Management Information Systems Commons

COinS

Research Collection School Of Computing and Information Systems

Configurable mirror descent : Towards a unification of decision making

Publication Type

Version

Publication Date

Abstract

Keywords

Discipline

Research Areas

Publication

Volume

First Page

Last Page

Publisher

City or Country

Citation

Creative Commons License

Comments

Included in

Search

Links

Browse

Links

Research Collection School Of Computing and Information Systems

Configurable mirror descent : Towards a unification of decision making

Author

Publication Type

Version

Publication Date

Abstract

Keywords

Discipline

Research Areas

Publication

Volume

First Page

Last Page

Publisher

City or Country

Citation

Creative Commons License

Comments

Included in

Share

Search

Links

Browse

Links