Search results for `Reinforcement learning`

Export citation

Bookmark

1 citation

7

Reinforcement Learning for Production‐Based Cognitive Models.Adrian Brasoveanu & Jakub Dotlačil - 2021 - Topics in Cognitive Science 13 (3):467-487.

We investigate how Reinforcement Learning methods can be used to solve the production selection and production ordering problem in ACT‐R. We focus on four algorithms from the Q learning family, tabular Q and three versions of Deep Q Networks, as well as the ACT‐R utility learning algorithm, which provides a baseline for the Q algorithms. We compare the performance of these five algorithms in a range of lexical decision tasks framed as sequential decision problems.

Export citation

Bookmark

43

Reinforcement learning: A brief guide for philosophers of mind.Julia Haas - 2022 - Philosophy Compass 17 (9):e12865.

In this opinionated review, I draw attention to some of the contributions reinforcement learning can make to questions in the philosophy of mind. In particular, I highlight reinforcement learning's foundational emphasis on the role of reward in agent learning, and canvass two ways in which the framework may advance our understanding of perception and motivation.

No categories

Philosophy of Cognitive Science

Export citation

Bookmark

2 citations

706

Reinforcement learning: A brief guide for philosophers of mind.Julia Haas - 2022 - Philosophy Compass 17 (9):e12865.

I argue for the role of reinforcement learning in the philosophy of mind. To start, I make several assumptions about the nature of reinforcement learning and its instantiation in minds like ours. I then review some of the contributions of reinforcement learning methods have made across the so-called 'decision sciences.' Finally, I show how principles from reinforcement learning can shape philosophical debates regarding the nature of perception and characterisations of desire.

Philosophy of Mind

Export citation

Bookmark

1 citation

15

Evolutionary Reinforcement Learning for Adaptively Detecting Database Intrusions.Seul-Gi Choi & Sung-Bae Cho - 2020 - Logic Journal of the IGPL 28 (4):449-460.

Relational database management system is the most popular database system. It is important to maintain data security from information leakage and data corruption. RDBMS can be attacked by an outsider or an insider. It is difficult to detect an insider attack because its patterns are constantly changing and evolving. In this paper, we propose an adaptive database intrusion detection system that can be resistant to potential insider misuse using evolutionary reinforcement learning, which combines reinforcement learning and (...)

Export citation

Bookmark

1 citation

42

Reconciling reinforcement learning models with behavioral extinction and renewal: Implications for addiction, relapse, and problem gambling.A. David Redish, Steve Jensen, Adam Johnson & Zeb Kurth-Nelson - 2007 - Psychological Review 114 (3):784-805.

No categories

Export citation

Bookmark

21 citations

44

Reinforcement Learning and Counterfactual Reasoning Explain Adaptive Behavior in a Changing Environment.Yunfeng Zhang, Jaehyon Paik & Peter Pirolli - 2015 - Topics in Cognitive Science 7 (2):368-381.

Animals routinely adapt to changes in the environment in order to survive. Though reinforcement learning may play a role in such adaptation, it is not clear that it is the only mechanism involved, as it is not well suited to producing rapid, relatively immediate changes in strategies in response to environmental changes. This research proposes that counterfactual reasoning might be an additional mechanism that facilitates change detection. An experiment is conducted in which a task state changes over time (...)

Export citation

Bookmark

55

Deep Reinforcement Learning as Foundation for Artificial General Intelligence.Itamar Arel - 2012 - In Pei Wang & Ben Goertzel (eds.), Theoretical Foundations of Artificial General Intelligence. Springer. pp. 89--102.

No categories

$165.68 new $168.75 from Amazon $171.24 used (collection) View on Amazon.com

Export citation

Bookmark

3

Reinforcement Learning with Probabilistic Boolean Network Models of Smart Grid Devices.Pedro Juan Rivera Torres, Carlos Gershenson García, María Fernanda Sánchez Puig & Samir Kanaan Izquierdo - 2022 - Complexity 2022:1-15.

The area of smart power grids needs to constantly improve its efficiency and resilience, to provide high quality electrical power in a resilient grid, while managing faults and avoiding failures. Achieving this requires high component reliability, adequate maintenance, and a studied failure occurrence. Correct system operation involves those activities and novel methodologies to detect, classify, and isolate faults and failures and model and simulate processes with predictive algorithms and analytics. In this paper, we showcase the application of a complex-adaptive, self-organizing (...)

Natural Sciences

Export citation

Bookmark

21

Using reinforcement learning to understand the emergence of "intelligent" eye-movement behavior during reading.Erik D. Reichle & Patryk A. Laurent - 2006 - Psychological Review 113 (2):390-408.

No categories

Cognitive Sciences, Misc in Cognitive Sciences

Export citation

Bookmark

6 citations

25

Reinforcement learning and higher level cognition: Introduction to special issue.Nathaniel D. Daw & Michael J. Frank - 2009 - Cognition 113 (3):259-261.

Export citation

Bookmark

2 citations

7

Reinforcement Learning-Based Collision Avoidance Guidance Algorithm for Fixed-Wing UAVs.Yu Zhao, Jifeng Guo, Chengchao Bai & Hongxing Zheng - 2021 - Complexity 2021:1-12.

A deep reinforcement learning-based computational guidance method is presented, which is used to identify and resolve the problem of collision avoidance for a variable number of fixed-wing UAVs in limited airspace. The cooperative guidance process is first analyzed for multiple aircraft by formulating flight scenarios using multiagent Markov game theory and solving it by machine learning algorithm. Furthermore, a self-learning framework is established by using the actor-critic model, which is proposed to train collision avoidance decision-making neural (...)

Natural Sciences

Export citation

Bookmark

123

Using Reinforcement Learning to Examine Dynamic Attention Allocation During Reading.Yanping Liu, Erik D. Reichle & Ding-Guo Gao - 2013 - Cognitive Science 37 (8):1507-1540.

A fundamental question in reading research concerns whether attention is allocated strictly serially, supporting lexical processing of one word at a time, or in parallel, supporting concurrent lexical processing of two or more words (Reichle, Liversedge, Pollatsek, & Rayner, 2009). The origins of this debate are reviewed. We then report three simulations to address this question using artificial reading agents (Liu & Reichle, 2010; Reichle & Laurent, 2006) that learn to dynamically allocate attention to 1–4 words to “read” as efficiently (...)

Export citation

Bookmark

1 citation

9

Relational reinforcement learning with guided demonstrations.David Martínez, Guillem Alenyà & Carme Torras - 2017 - Artificial Intelligence 247 (C):295-312.

Export citation

Bookmark

4 citations

14

Integrating reinforcement learning, equilibrium points, and minimum variance to understand the development of reaching: A computational model.Daniele Caligiore, Domenico Parisi & Gianluca Baldassarre - 2014 - Psychological Review 121 (3):389-421.

No categories

Export citation

Bookmark

1 citation

9

Deep Reinforcement Learning for UAV Intelligent Mission Planning.Longfei Yue, Rennong Yang, Ying Zhang, Lixin Yu & Zhuangzhuang Wang - 2022 - Complexity 2022:1-13.

Rapid and precise air operation mission planning is a key technology in unmanned aerial vehicles autonomous combat in battles. In this paper, an end-to-end UAV intelligent mission planning method based on deep reinforcement learning is proposed to solve the shortcomings of the traditional intelligent optimization algorithm, such as relying on simple, static, low-dimensional scenarios, and poor scalability. Specifically, the suppression of enemy air defense mission planning is described as a sequential decision-making problem and formalized as a Markov decision (...)

Natural Sciences

Philosophy of Consciousness in Philosophy of Mind

Export citation

Bookmark

17

Reinforcement learning for Golog programs with first-order state-abstraction.D. Beck & G. Lakemeyer - 2012 - Logic Journal of the IGPL 20 (5):909-942.

Export citation

Bookmark

25

Predictive Movements and Human Reinforcement Learning of Sequential Action.Roy de Kleijn, George Kachergis & Bernhard Hommel - 2018 - Cognitive Science 42 (S3):783-808.

Sequential action makes up the bulk of human daily activity, and yet much remains unknown about how people learn such actions. In one motor learning paradigm, the serial reaction time (SRT) task, people are taught a consistent sequence of button presses by cueing them with the next target response. However, the SRT task only records keypress response times to a cued target, and thus it cannot reveal the full time‐course of motion, including predictive movements. This paper describes a mouse (...)

No categories

Evolutionary Game Theory in Philosophy of Action

Export citation

Bookmark

1 citation

Integrating reinforcement learning, bidding and genetic algorithms.Ron Sun - unknown

This paper presents a GA-based multi-agent reinforce- ment learning bidding approach (GMARLB) for perform- ing multi-agent reinforcement learning. GMARLB inte- grates reinforcement learning, bidding and genetic algo- rithms. The general idea of our multi-agent systems is as follows: There are a number of individual agents in a team, each agent of the team has two modules: Q module and CQ module. Each agent can select actions to be performed at each step, which are done by (...)

Export citation

Bookmark

14

Reinforcement Learning in Autism Spectrum Disorder.Manuela Schuetze, Christiane S. Rohr, Deborah Dewey, Adam McCrimmon & Signe Bray - 2017 - Frontiers in Psychology 8.

Export citation

Bookmark

1 citation

24

Deep Reinforcement Learning for Vectored Thruster Autonomous Underwater Vehicle Control.Tao Liu, Yuli Hu & Hui Xu - 2021 - Complexity 2021:1-25.

Autonomous underwater vehicles are widely used to accomplish various missions in the complex marine environment; the design of a control system for AUVs is particularly difficult due to the high nonlinearity, variations in hydrodynamic coefficients, and external force from ocean currents. In this paper, we propose a controller based on deep reinforcement learning in a simulation environment for studying the control performance of the vectored thruster AUV. RL is an important method of artificial intelligence that can learn behavior (...)

No categories

Export citation

Bookmark

6

Reinforcement learning with limited reinforcement: Using Bayes risk for active learning in POMDPs.Finale Doshi-Velez, Joelle Pineau & Nicholas Roy - 2012 - Artificial Intelligence 187-188 (C):115-132.

Export citation

Bookmark

4

Reinforcement learning of non-Markov decision processes.Steven D. Whitehead & Long-Ji Lin - 1995 - Artificial Intelligence 73 (1-2):271-306.

Conscious and Unconscious Learning in Philosophy of Cognitive Science

Export citation

Bookmark

5 citations

10

“Reconciling reinforcement learning models with behavioral extinction and renewal: Implications for addiction, relapse, and problem gambling”: Correction.David A. Redish, Steve Jensen, Adam Johnson & Zeb Kurth-Nelson - 2009 - Psychological Review 116 (3):518-518.

Mental Disorders in Philosophy of Cognitive Science

Export citation

Bookmark

30

Reinforcement learning of self-regulated β-oscillations for motor restoration in chronic stroke.Georgios Naros & Alireza Gharabaghi - 2015 - Frontiers in Human Neuroscience 9.

Philosophy of Neuroscience in Philosophy of Cognitive Science

Export citation

Bookmark

2 citations

9

Framing reinforcement learning from human reward: Reward positivity, temporal discounting, episodicity, and performance.W. Bradley Knox & Peter Stone - 2015 - Artificial Intelligence 225 (C):24-50.

Conscious and Unconscious Learning in Philosophy of Cognitive Science

Export citation

Bookmark

1 citation

Reinforcement learning.Chris Jch Watkins & Peter Dayan - 2003 - In L. Nadel (ed.), Encyclopedia of Cognitive Science. Nature Publishing Group.

Unconscious and Conscious Processes in Philosophy of Cognitive Science

Export citation

Bookmark

2 citations

10

Reinforcement Learning With Parsimonious Computation and a Forgetting Process.Asako Toyama, Kentaro Katahira & Hideki Ohira - 2019 - Frontiers in Human Neuroscience 13.

Philosophy of Neuroscience in Philosophy of Cognitive Science

Export citation

Bookmark

7

Decentralized Reinforcement Learning of Robot Behaviors.David L. Leottau, Javier Ruiz-del-Solar & Robert Babuška - 2018 - Artificial Intelligence 256 (C):130-159.

Conscious and Unconscious Learning in Philosophy of Cognitive Science

Export citation

Bookmark

9

Reinforcement Learning.Oliver Kramer - 2009 - In L. Magnani (ed.), Computational Intelligence. pp. 101--117.

Unconscious and Conscious Processes in Philosophy of Cognitive Science

Conscious and Unconscious Learning in Philosophy of Cognitive Science

Export citation

Bookmark

1 citation

7

Reinforcement learning in factories: the auton project.Andrew W. Moore - 1996 - In Garrison W. Cottrell (ed.), Proceedings of the Eighteenth Annual Conference of the Cognitive Science Society. Lawrence Erlbaum. pp. 18--12.

Evolutionary Psychology in Philosophy of Cognitive Science

Export citation

Bookmark

41

Can reinforcement learning explain variation in early infant crying?Arnon Lotem & David W. Winkler - 2004 - Behavioral and Brain Sciences 27 (4):468-468.

We welcome Soltis' use of evolutionary signaling theory, but question his interpretations of colic as a signal of vigor and his explanation of abnormal high-pitched crying as a signal of poor infant quality. Instead, we suggest that these phenomena may be suboptimal by-products of a generally adaptive learning process by which infants adjust their crying levels in relation to parental responsiveness.

Direct download (7 more)

Export citation

Bookmark

1 citation

15

Predictive Movements and Human Reinforcement Learning of Sequential Action.Roy Kleijn, George Kachergis & Bernhard Hommel - 2018 - Cognitive Science 42 (S3):783-808.

Sequential action makes up the bulk of human daily activity, and yet much remains unknown about how people learn such actions. In one motor learning paradigm, the serial reaction time (SRT) task, people are taught a consistent sequence of button presses by cueing them with the next target response. However, the SRT task only records keypress response times to a cued target, and thus it cannot reveal the full time‐course of motion, including predictive movements. This paper describes a mouse (...)

No categories

Artificial Intelligence Methodology in Philosophy of Cognitive Science

Export citation

Bookmark

1 citation

291

Cheap talk, reinforcement learning, and the emergence of cooperation.J. McKenzie Alexander - 2015 - Philosophy of Science 82 (5):969-982.

Cheap talk has often been thought incapable of supporting the emergence of cooperation because costless signals, easily faked, are unlikely to be reliable (Zahavi and Zahavi, 1997). I show how, in a social network model of cheap talk with reinforcement learning, cheap talk does enable the emergence of cooperation, provided that individuals also temporally discount the past. This establishes one mechanism that suffices for moving a population of initially uncooperative individuals to a state of mutually beneficial cooperation even (...)

Game Theory in Philosophy of Action

Direct download (7 more)

Export citation

Bookmark

11601

The Archimedean trap: Why traditional reinforcement learning will probably not yield AGI.Samuel Allen Alexander - 2020 - Journal of Artificial General Intelligence 11 (1):70-85.

After generalizing the Archimedean property of real numbers in such a way as to make it adaptable to non-numeric structures, we demonstrate that the real numbers cannot be used to accurately measure non-Archimedean structures. We argue that, since an agent with Artificial General Intelligence (AGI) should have no problem engaging in tasks that inherently involve non-Archimedean rewards, and since traditional reinforcement learning rewards are real numbers, therefore traditional reinforcement learning probably will not lead to AGI. We (...)

Computation and Representation, Misc in Philosophy of Cognitive Science

Machine Learning in Philosophy of Cognitive Science

Philosophy of AI, Misc in Philosophy of Cognitive Science

Conscious and Unconscious Learning in Philosophy of Cognitive Science

Export citation

Bookmark

1 citation

42

Integrating reinforcement learning with models of representation learning.Matt Jones & Fabián Canas - 2010 - In S. Ohlsson & R. Catrambone (eds.), Proceedings of the 32nd Annual Conference of the Cognitive Science Society. Cognitive Science Society. pp. 1258--1263.

Conscious and Unconscious Learning in Philosophy of Cognitive Science

Export citation

Bookmark

1 citation

Reinforcement learning with raw image pixels as state input.D. Ernst, R. Marée & L. Wehenkel - 2006 - In O. Stock & M. Schaerf (eds.), Lecture Notes in Computer Science. Springer Verlag. pp. 4153.

Export citation

Bookmark

18

A reinforcement learning approach to gait training improves retention.Christopher J. Hasson, Julia Manczurowsky & Sheng-Che Yen - 2015 - Frontiers in Human Neuroscience 9.

Mental Disorders in Philosophy of Cognitive Science

Philosophy of Neuroscience in Philosophy of Cognitive Science

Export citation

Bookmark

7

Certified reinforcement learning with logic guidance.Hosein Hasanbeig, Daniel Kroening & Alessandro Abate - 2023 - Artificial Intelligence 322 (C):103949.

Computer Science in Formal Sciences

Export citation

Bookmark

38

A survey of inverse reinforcement learning: Challenges, methods and progress.Saurabh Arora & Prashant Doshi - 2021 - Artificial Intelligence 297 (C):103500.

Export citation

Bookmark

2 citations

13

Enforcing ethical goals over reinforcement-learning policies.Guido Governatori, Agata Ciabattoni, Ezio Bartocci & Emery A. Neufeld - 2022 - Ethics and Information Technology 24 (4):1-19.

Recent years have yielded many discussions on how to endow autonomous agents with the ability to make ethical decisions, and the need for explicit ethical reasoning and transparency is a persistent theme in this literature. We present a modular and transparent approach to equip autonomous agents with the ability to comply with ethical prescriptions, while still enacting pre-learned optimal behaviour. Our approach relies on a normative supervisor module, that integrates a theorem prover for defeasible deontic logic within the control loop (...)

No categories

Conscious and Unconscious Learning in Philosophy of Cognitive Science

Export citation

Bookmark

1 citation

Multi-Agent Reinforcement Learning: Weighting and Partitioning.Ron Sun & Todd Peterson - unknown

This paper addresses weighting and partitioning in complex reinforcement learning tasks, with the aim of facilitating learning. The paper presents some ideas regarding weighting of multiple agents and extends them into partitioning an input/state space into multiple regions with di erential weighting in these regions, to exploit di erential characteristics of regions and di erential characteristics of agents to reduce the learning complexity of agents (and their function approximators) and thus to facilitate the learning overall. (...)

Export citation

Bookmark

6 citations

29

Network formation by reinforcement learning: The long and medium run.Brian Skyrms - unknown

We investigate a simple stochastic model of social network formation by the process of reinforcement learning with discounting of the past. In the limit, for any value of the discounting parameter, small, stable cliques are formed. However, the time it takes to reach the limiting state in which cliques have formed is very sensitive to the discounting parameter. Depending on this value, the limiting result may or may not be a good predictor for realistic observation times.

Conscious and Unconscious Memory in Philosophy of Cognitive Science

Export citation

Bookmark

8 citations

Attention and reinforcement learning: Constructing representations from indirect feedback.Fabián Canas & Matt Jones - 2010 - In S. Ohlsson & R. Catrambone (eds.), Proceedings of the 32nd Annual Conference of the Cognitive Science Society. Cognitive Science Society.

Export citation

Bookmark

1 citation

20

Counterfactual state explanations for reinforcement learning agents via generative deep learning.Matthew L. Olson, Roli Khanna, Lawrence Neal, Fuxin Li & Weng-Keen Wong - 2021 - Artificial Intelligence 295 (C):103455.

Philosophy of Psychology in Philosophy of Cognitive Science

Export citation

Bookmark

4 citations

33

On Adaptation, Maximization, and Reinforcement Learning Among Cognitive Strategies.Ido Erev & Greg Barron - 2005 - Psychological Review 112 (4):912-931.

Unconscious and Conscious Processes in Philosophy of Cognitive Science

Export citation

Bookmark

31 citations

31

Developing PFC Representations Using Reinforcement Learning.Jeremy R. Reynolds & Randall C. O’Reilly - 2009 - Cognition 113 (3):281-292.

Conscious and Unconscious Learning in Philosophy of Cognitive Science

Philosophy of Consciousness in Philosophy of Mind

Supplementing neural reinforcement learning with symbolic methods possibilities and challenges.Ron Sun - unknown

Export citation

Bookmark

5 citations

methods to improve reinforcement learning are identi ed and discussed in some detail Each demonstrates to some extent the advantages of combining RL and symbolic meth ods These methods point to the potentials and the chal lenges of this line of research..

Representation in Connectionism in Philosophy of Cognitive Science

Export citation

Bookmark

203

Can model-free reinforcement learning explain deontological moral judgments?Alisabeth Ayars - 2016 - Cognition 150 (C):232-242.

Ethics and Cognitive Science in Normative Ethics

Export citation

Bookmark

5

Multi-robot inverse reinforcement learning under occlusion with estimation of state transitions.Kenneth Bogert & Prashant Doshi - 2018 - Artificial Intelligence 263 (C):46-73.