Extending Environments To Measure Self-Reflection In Reinforcement Learning

Samuel Allen Alexander; Michael Castaneda; Kevin Compher; Oscar Martinez

PhilArchive

More download options

Extending Environments To Measure Self-Reflection In Reinforcement Learning

Samuel Allen Alexander, Michael Castaneda, Kevin Compher & Oscar Martinez

Journal of Artificial General Intelligence 13 (1) (2022) Copy BIBT_EX

Abstract

We consider an extended notion of reinforcement learning in which the environment can simulate the agent and base its outputs on the agent's hypothetical behavior. Since good performance usually requires paying attention to whatever things the environment's outputs are based on, we argue that for an agent to achieve on-average good performance across many such extended environments, it is necessary for the agent to self-reflect. Thus weighted-average performance over the space of all suitably well-behaved extended environments could be considered a way of measuring how self-reflective an agent is. We give examples of extended environments and introduce a simple transformation which experimentally seems to increase some standard RL agents' performance in a certain type of extended environment.

Cite

Plain text

BibTeX

Formatted text

Zotero

EndNote

Reference Manager

RefWorks

Options

Mark as duplicate

Find it on Scholar

Request removal from index

Revision history

Edit

Author's Profile

Samuel Allen Alexander

Ohio State University (PhD)

Keywords

extended environments reinforcement learning

Reprint years

My notes

Similar books and articles

The Archimedean trap: Why traditional reinforcement learning will probably not yield AGI.Samuel Allen Alexander - 2020 - Journal of Artificial General Intelligence 11 (1):70-85.

The role of secondary reinforcement in a partial reinforcement learning situation.M. R. Denny - 1946 - Journal of Experimental Psychology 36 (5):373.

Supplementary report: The effects of verbal reinforcement combinations on learning in children.Carolyn Curry - 1960 - Journal of Experimental Psychology 59 (6):434.

Secondary and generalized reinforcement in human learning.Frederick H. Kanfer & Joseph D. Matarazzo - 1959 - Journal of Experimental Psychology 58 (5):400.

Predictive Movements and Human Reinforcement Learning of Sequential Action.Roy de Kleijn, George Kachergis & Bernhard Hommel - 2018 - Cognitive Science 42 (S3):783-808.

How is Reinforcement Learning used in Business?Mauricio Fadel Argerich - forthcoming - Medium.

The growth of learning during non-differential reinforcement.Allen D. Calvin - 1953 - Journal of Experimental Psychology 46 (4):248.

Reinforcement learning with limited reinforcement: Using Bayes risk for active learning in POMDPs.Finale Doshi-Velez, Joelle Pineau & Nicholas Roy - 2012 - Artificial Intelligence 187-188 (C):115-132.

Some determinants of rigidity in discrimination-reversal learning.Arnold H. Buss - 1952 - Journal of Experimental Psychology 44 (3):222.

Self-regulated learning and students' perceptions of innovative and traditional learning environments: a longitudinal study in secondary education.Jaap Schuitema, Thea Peetsma & Ineke van der Veen - 2012 - Educational Studies 38 (4):397-413.

Extinction after partial reinforcement and minimal learning as a test of both verbal control and pre in concept learning.Daniel C. O'connell & Margaret V. Wagner - 1967 - Journal of Experimental Psychology 73 (1):151.

Reversal learning in rats as a function of percentage of reinforcement and degree of learning.Albert Erlebacher - 1963 - Journal of Experimental Psychology 66 (1):84.

Emotional State and Feedback-Related Negativity Induced by Positive, Negative, and Combined Reinforcement.Shuyuan Xu, Yuyan Sun, Min Huang, Yanhong Huang, Jing Han, Xuemei Tang & Wei Ren - 2021 - Frontiers in Psychology 12:647263.

From Low‐Lying Roofs to Towering Spires: Toward a Heideggerian understanding of learning environments.Tyler W. Ream Todd C. Ream - 2005 - Educational Philosophy and Theory 37 (4):585-597.

Determinants of the effects of vicarious reinforcement.Albert R. Marston - 1966 - Journal of Experimental Psychology 71 (4):550.

Analytics

Added to PP
2021-10-13

Downloads
332 (#57,886)

6 months
134 (#23,601)

Historical graph of downloads

How can I increase my downloads?

Applied ethics	Epistemology	History of Western Philosophy	Meta-ethics	Metaphysics	Normative ethics
Philosophy of biology	Philosophy of language	Philosophy of mind	Philosophy of religion	Science Logic and Mathematics	More ...

Extending Environments To Measure Self-Reflection In Reinforcement Learning

Abstract

Author's Profile

Categories

Keywords

Reprint years

Links

PhilArchive

External links

Through your library

My notes

Similar books and articles

Analytics

Author's Profile

Citations of this work

References found in this work