Abstract: Previous multi-agent deep reinforcement learning (MADRL) algorithms have shown strong performance in symmetric scenarios. However, research on numerically disadvantaged scenarios is limited ...