Q(st at):= (I — o')Q(st at) + o'(r(st+1)


Straightforward reinforcement learning for multi-agent co-learning settings often results in poor outcomes. Meta-learning processes beyond straightforward reinforcement learning may be necessary to achieve good (or optimal) outcomes. Algorithmic processes of meta-learning, or "manipulation", will be described, which is a cognitively realistic and effective means for learning cooperation. We will discuss various "manipulation" routines that address the issue of improving multi-agent co-learning. We hope to develop better adaptive means of multi-agent cooperation, without requiring a priori knowledge, and advance multi-agent co-learning beyond existing theories and techniques



    Upload a copy of this work     Papers currently archived: 74,509

External links

  • This entry has no external links. Add one.
Setup an account with your affiliations in order to access resources via your University's proxy server

Through your library

  • Only published works are available at libraries.

Similar books and articles

A Model for Updates in a Multi-Agent Setting.John Cantwell - 2007 - Journal of Applied Non-Classical Logics 17 (2):183-196.
Learning to Cooperate: Reciprocity and Self-Control.Peter Danielson - 2002 - Behavioral and Brain Sciences 25 (2):256-257.


Added to PP

6 (#1,095,852)

6 months
1 (#417,896)

Historical graph of downloads
How can I increase my downloads?

Citations of this work

No citations found.

Add more citations

References found in this work

No references found.

Add more references