Abstract
Potential-based reward shaping has previously been proven
to both be equivalent to Q-table initialisation and guarantee
policy invariance in single-agent reinforcement learning.
The method has since been used in multi-agent reinforcement
learning without consideration of whether the theoretical
equivalence and guarantees hold. This paper extends
the existing proofs to similar results in multi-agent systems,
providing the theoretical background to explain the success
of previous empirical studies. Specically, it is proven
that the equivalence to Q-table initialisation remains and
the Nash Equilibria of the underlying stochastic game are
not modied. Furthermore, we demonstrate empirically that
potential-based reward shaping eects exploration and,
consequentially, can alter the joint policy converged upon.
to both be equivalent to Q-table initialisation and guarantee
policy invariance in single-agent reinforcement learning.
The method has since been used in multi-agent reinforcement
learning without consideration of whether the theoretical
equivalence and guarantees hold. This paper extends
the existing proofs to similar results in multi-agent systems,
providing the theoretical background to explain the success
of previous empirical studies. Specically, it is proven
that the equivalence to Q-table initialisation remains and
the Nash Equilibria of the underlying stochastic game are
not modied. Furthermore, we demonstrate empirically that
potential-based reward shaping eects exploration and,
consequentially, can alter the joint policy converged upon.
Original language | English |
---|---|
Title of host publication | The 10th International Conference on Autonomous Agents and Multiagent Systems |
Publisher | ACM |
Pages | 225-232 |
ISBN (Electronic) | 0-9826571-5-3 |
ISBN (Print) | 978-0-9826571-5-7 |
Publication status | Published - May 2011 |
Event | Tenth International Conference on Autonomous Agents and Multi-Agent Systems - Taipeh, Taiwan Duration: 2 May 2011 → … |
Conference
Conference | Tenth International Conference on Autonomous Agents and Multi-Agent Systems |
---|---|
Country/Territory | Taiwan |
City | Taipeh |
Period | 2/05/11 → … |