Göm menyn
Files: Description Format
Fulltext PDF (requires Acrobat Reader)
Fulltext part 1 PostScript (requires a PostScript Reader)
  Fulltext part 2 PostScript (requires a PostScript Reader)
   
Authors: Kevin Irwig and Wayne Wobcke
Article title: Multi-Agent Reinforcement Learning with Vicarious Rewards
Publ. type: Article
Volume: 4
Article No: 34
Language: English
Abstract [en]: Reinforcement learning is the problem faced by an agent that must learn behaviour through trial-and-error interactions with a dynamicenvironment. In a multi-agent setting, the problem is often further complicated by the need to take into account the behaviour of otheragents in order to learn  to  perform effectively. Issues of coordination and cooperation must be addressed; in general, it is not sufficient for each agent to act selfishly in order to arrive at a globally optimal strategy. In this work, we apply the AdaptiveHeuristic Critic (AHC) and Q-learning algorithms to agents in a simple artificial multi-agent domain based on the Tileworld.We experimentally compare the performance of the AHC and Q-learning algorithms to each other as well as to a hand-coded greedy strategy.  The overall result is that AHC agents perform better than the others, particularly when many other agents are present or the world is dynamic.  We also examine the notion of global optimality in this system, and present a simple method of encouraging agents to learn cooperative behaviour, which we call vicarious reinforcement. The main result of this work is that agents that receive additional vicarious reinforcement perform better than selfish agents, even though the task being performed here is not inherently cooperative.
Publisher: Linköping University Electronic Press
Year: 1999
Available: 1999-12-30
No. of pages: 21
Series: Linköping Electronic Articles in Computer and Information Science
ISSN: 1401-9841


Responsible for this page: Peter Berkesand
Last updated: 2017-02-21