Reinforcement learning of coordination in cooperative multi-agent systems

S Kapetanakis, D Kudenko

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

We report on an investigation of reinforcement learning techniques for the learning of coordination in cooperative multi-agent systems. Specifically, we focus on a novel action selection strategy for Q-learning (Watkins 1989). The new technique is applicable to scenarios where mutual observation of actions is not possible.

To date, reinforcement learning approaches for such independent agents did not guarantee convergence to the optimal joint action in scenarios with high miscoordination costs. We improve on previous results (Claus & Boutilier 1998) by, demonstrating empirically that our extension causes the agents to converge almost always to the optimal joint action even in these difficult cases.

Original languageEnglish
Title of host publicationEIGHTEENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE (AAAI-02)/FOURTEENTH INNOVATIVE APPLICATIONS OF ARTIFICIAL INTELLIGENCE CONFERENCE (IAAI-02), PROCEEDINGS
Place of PublicationCAMBRIDGE
PublisherMIT Press
Pages326-331
Number of pages6
ISBN (Print)0-262-51129-0
Publication statusPublished - 2002
EventAAAI-02 - Edmonton, Alberta, Canada
Duration: 28 Jul 20021 Aug 2002

Conference

ConferenceAAAI-02
Country/TerritoryCanada
CityEdmonton, Alberta
Period28/07/021/08/02

Cite this