Allowing different agents to optimize for different objective trade-offs—rather than forcing all agents to use the same preferences—improves both individual performance and team coordination in multi-objective cooperative settings.
This paper tackles multi-objective multi-agent reinforcement learning where teams must balance multiple conflicting goals while coordinating across agents with different roles. The authors propose PCMA, which learns different preference weights for each agent to enable better trade-offs between objectives and improve overall team performance.