Provably Efficient Fictitious Play Policy Optimization for
Zero-Sum Markov Games with Structured Transitions
Shuang Qiu 1 Xiaohan Wei 2 Jieping Ye 1 Zhaoran Wang 3 Zhuoran Yang 4
Abstract understanding of multi-agent policy optimization, especially
the zero-sum Markov game (Littman, 1994) via policy opti-
While single-agent policy optimization in a fixed mization, lags rather behind. Most recent works ...


雷达卡




京公网安备 11010802022788号







