基于多智能体强化学习的协同目标分配
马悦, 吴琳, 许霄
Cooperative targets assignment based on multi-agent reinforcement learning
Yue MA, Lin WU, Xiao XU
图3
基于A2C算法的学习过程
Fig.3
Learning process based on A2C