Integrating reinforcement learning, bidding and genetic algorithms
AbstractThis paper presents a GA-based multi-agent reinforce- ment learning bidding approach (GMARLB) for perform- ing multi-agent reinforcement learning. GMARLB inte- grates reinforcement learning, bidding and genetic algo- rithms. The general idea of our multi-agent systems is as follows: There are a number of individual agents in a team, each agent of the team has two modules: Q module and CQ module. Each agent can select actions to be performed at each step, which are done by the Q module. While the CQ module determines at each step whether the agent should continue or relinquish control. Once an agent relinquishes its control, a new agent is selected by bidding algorithms. We applied GA-based GMARLB to the Backgammon game. The experimental results show GMARLB can achieve a su- perior level of performance in game-playing, outperforming PubEval, while the system uses zero built-in knowledge.
Similar books and articles
Learning with Neighbours: Emergence of Convention in a Society of Learning Agents.Roland Mühlenbernd - 2011 - Synthese 183 (S1):87-109.
Bottom-Up Skill Learning in Reactive Sequential Decision Tasks.Ron Sun, Todd Peterson & Edward Merrill - unknown
Spontaneous Coordination and Evolutionary Learning Processes in an Agent-Based Model.Pierre Barbaroux & Gilles Enée - 2005 - Mind and Society 4 (2):179-195.
Brave Mobots Use Representation: Emergence of Representation in Fight-or-Flight Learning. [REVIEW]Chris Thornton - 1997 - Minds and Machines 7 (4):475-494.
An Evolutionary Game Theoretic Perspective on Learning in Multi-Agent Systems.Karl Tuyls, Ann Nowe, Tom Lenaerts & Bernard Manderick - 2004 - Synthese 139 (2):297 - 330.
Added to PP
Historical graph of downloads
Citations of this work
No citations found.
References found in this work
Computer Go: An AI Oriented Survey.Bruno Bouzy & Tristan Cazenave - 2001 - Artificial Intelligence 132 (1):39-103.
A Parallel Network That Learns to Play Backgammon.G. Tesauro & T. J. Sejnowski - 1989 - Artificial Intelligence 39 (3):357-390.