混合博弈问题的求解与应用综述

doi:10.13328/j.cnki.jos.007212

微信服务号

微信订阅号

2025年5月1日 9:54 星期四

首页 > 过刊浏览>2025年第36卷第1期 >107-151. DOI:10.13328/j.cnki.jos.007212

PDF HTML阅读 XML下载导出引用引用提醒

混合博弈问题的求解与应用综述
DOI:
                        10.13328/j.cnki.jos.007212
                    
CSTR:
                        32375.14.jos.007212
                    
作者:
                        董绍康董绍康
计算机软件新技术国家重点实验室 (南京大学), 江苏 南京 210023
在期刊界中查找
在百度中查找
在本站中查找
李超李超
计算机软件新技术国家重点实验室 (南京大学), 江苏 南京 210023
在期刊界中查找
在百度中查找
在本站中查找
杨光杨光
计算机软件新技术国家重点实验室 (南京大学), 江苏 南京 210023
在期刊界中查找
在百度中查找
在本站中查找
葛振兴葛振兴
计算机软件新技术国家重点实验室 (南京大学), 江苏 南京 210023
在期刊界中查找
在百度中查找
在本站中查找
曹宏业曹宏业
计算机软件新技术国家重点实验室 (南京大学), 江苏 南京 210023
在期刊界中查找
在百度中查找
在本站中查找
陈武兵陈武兵
计算机软件新技术国家重点实验室 (南京大学), 江苏 南京 210023
在期刊界中查找
在百度中查找
在本站中查找
杨尚东杨尚东
计算机软件新技术国家重点实验室 (南京大学), 江苏 南京 210023;南京邮电大学 计算机学院、软件学院、网络空间安全学院, 江苏 南京 210023
在期刊界中查找
在百度中查找
在本站中查找
陈兴国陈兴国
计算机软件新技术国家重点实验室 (南京大学), 江苏 南京 210023;南京邮电大学 计算机学院、软件学院、网络空间安全学院, 江苏 南京 210023
在期刊界中查找
在百度中查找
在本站中查找
李文斌李文斌
计算机软件新技术国家重点实验室 (南京大学), 江苏 南京 210023
在期刊界中查找
在百度中查找
在本站中查找
高阳高阳
计算机软件新技术国家重点实验室 (南京大学), 江苏 南京 210023
在期刊界中查找
在百度中查找
在本站中查找

                    
作者单位:
作者简介:
通讯作者:
中图分类号:
基金项目:国家自然科学基金(62192783, 62106100, 62206133, 62276142); 江苏省自然科学基金(BK20221441); 江苏省产业前瞻与关键核心技术竞争项目(BE2021028); 深圳市中央引导地方科技发展资金(2021Szvup056); 南京大学计算机软件新技术国家重点实验室资助项目(KFKT2022B12)

Survey on Solutions and Applications for Mixed-motive Games

Author:

DONG Shao-Kang
DONG Shao-Kang
State Key Laboratory for Novel Software Technology (Nanjing University), Nanjing 210023, China
在期刊界中查找
在百度中查找
在本站中查找
LI Chao
LI Chao
State Key Laboratory for Novel Software Technology (Nanjing University), Nanjing 210023, China
在期刊界中查找
在百度中查找
在本站中查找
YANG Guang
YANG Guang
State Key Laboratory for Novel Software Technology (Nanjing University), Nanjing 210023, China
在期刊界中查找
在百度中查找
在本站中查找
GE Zhen-Xing
GE Zhen-Xing
State Key Laboratory for Novel Software Technology (Nanjing University), Nanjing 210023, China
在期刊界中查找
在百度中查找
在本站中查找
CAO Hong-Ye
CAO Hong-Ye
State Key Laboratory for Novel Software Technology (Nanjing University), Nanjing 210023, China
在期刊界中查找
在百度中查找
在本站中查找
CHEN Wu-Bing
CHEN Wu-Bing
State Key Laboratory for Novel Software Technology (Nanjing University), Nanjing 210023, China
在期刊界中查找
在百度中查找
在本站中查找
YANG Shang-Dong
YANG Shang-Dong
State Key Laboratory for Novel Software Technology (Nanjing University), Nanjing 210023, China;School of Computer Science, Nanjing University of Posts and Telecommunications, Nanjing 210023, China
在期刊界中查找
在百度中查找
在本站中查找
CHEN Xing-Guo
CHEN Xing-Guo
State Key Laboratory for Novel Software Technology (Nanjing University), Nanjing 210023, China;School of Computer Science, Nanjing University of Posts and Telecommunications, Nanjing 210023, China
在期刊界中查找
在百度中查找
在本站中查找
LI Wen-Bin
LI Wen-Bin
State Key Laboratory for Novel Software Technology (Nanjing University), Nanjing 210023, China
在期刊界中查找
在百度中查找
在本站中查找
GAO Yang
GAO Yang
State Key Laboratory for Novel Software Technology (Nanjing University), Nanjing 210023, China
在期刊界中查找
在百度中查找
在本站中查找

Affiliation:

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

近年来, 随着人工智能技术在序贯决策和博弈对抗等问题的应用方面取得了飞速发展, 围棋、游戏、德扑和麻将等领域取得了巨大的进步, 例如, AlphaGo、OpenAI Five、AlphaStar、DeepStack、Libratus、Pluribus和Suphx等系统都在这些领域中达到或超过人类专家水平. 这些应用集中在双人、两队或者多人的零和博弈问题中, 而对于混合博弈问题的研究缺乏实质性的进展与突破. 区别于零和博弈, 混合博弈需要综合考虑个体收益、集体收益和均衡收益等诸多目标, 被广泛应用于公共资源分配、任务调度和自动驾驶等现实场景. 因此, 对于混合博弈问题的研究至关重要. 通过梳理当前混合博弈领域中的重要概念和相关工作, 深入分析国内外研究现状和未来发展方向. 具体地, 首先介绍混合博弈问题的定义与分类; 其次详细阐述博弈解概念和求解目标, 包含纳什均衡、相关均衡、帕累托最优等解概念, 最大化个体收益、最大化集体收益以及兼顾公平等求解目标; 接下来根据不同的求解目标, 分别对博弈论方法、强化学习方法以及这两种方法的结合进行详细探讨和分析; 最后介绍相关的应用场景和实验仿真环境, 并对未来研究的方向进行总结与展望.

关键词:混合博弈;博弈论;强化学习

Abstract:

In recent years, there has been rapid advancement in the application of artificial intelligence technology to sequential decision-making and adversarial game scenarios, resulting in significant progress in domains such as Go, games, poker, and Mahjong. Notably, systems like AlphaGo, OpenAI Five, AlphaStar, DeepStack, Libratus, Pluribus, and Suphx have achieved or surpassed human expert-level performance in these areas. While these applications primarily focus on zero-sum games involving two players, two teams, or multiple players, there has been limited substantive progress in addressing mixed-motive games. Unlike zero-sum games, mixed-motive games necessitate comprehensive consideration of individual returns, collective returns, and equilibrium. These games are extensively applied in real-world applications such as public resource allocation, task scheduling, and autonomous driving, making research in this area crucial. This study offers a comprehensive overview of key concepts and relevant research in the field of mixed-motive games, providingan in-depth analysis of current trends and future directions both domestically and internationally. Specifically, this study first introduces the definition and classification of mixed-motive games. It then elaborates on game solution concepts and objectives, including Nash equilibrium, correlated equilibrium, and Pareto optimality, as well as objectives related to maximizing individual and collective gains, while considering fairness. Furthermore, the study engages in a thorough exploration and analysis of game theory methods, reinforcement learning methods, and their combination based on different solution objectives. In addition, the study discusses relevant application scenarios and experimental simulation environments before concluding with a summary and outlook on future research directions.

Key words:mixed-motive game;game theory;reinforcement learning

引用本文

董绍康,李超,杨光,葛振兴,曹宏业,陈武兵,杨尚东,陈兴国,李文斌,高阳.混合博弈问题的求解与应用综述.软件学报,2025,36(1):107-151

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:2023-08-03
最后修改日期:2024-01-14
录用日期:
在线发布日期: 2024-06-20
出版日期: 2025-01-06

微信服务号

微信订阅号

引用本文

分享

文章指标

历史

文章二维码

微信服务号

微信订阅号

引用本文

分享

微信扫一扫：分享

文章指标

历史

文章二维码