Learning Fair Policies in Decentralized Cooperative
Multi-Agent Reinforcement Learning
Matthieu Zimmer * 1 Claire Glanois * 1 Umer Siddique 1 Paul Weng 1 2
Abstract current main focus is on their performance with respect to
the total (or average) of some per-user efficiency measure
We consider the problem of learning fair policies
(e.g., waiting times of cars in traffi ...


雷达卡




京公网安备 11010802022788号







