Results for ' Reinforcement learning'

988 found
Order:
See also
  1.  35
    Reinforcement learning and artificial agency.Patrick Butlin - 2024 - Mind and Language 39 (1):22-38.
    There is an apparent connection between reinforcement learning and agency. Artificial entities controlled by reinforcement learning algorithms are standardly referred to as agents, and the mainstream view in the psychology and neuroscience of agency is that humans and other animals are reinforcement learners. This article examines this connection, focusing on artificial reinforcement learning systems and assuming that there are various forms of agency. Artificial reinforcement learning systems satisfy plausible conditions for minimal (...)
    No categories
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark   1 citation  
  2.  7
    Reinforcement Learning for Production‐Based Cognitive Models.Adrian Brasoveanu & Jakub Dotlačil - 2021 - Topics in Cognitive Science 13 (3):467-487.
    We investigate how Reinforcement Learning methods can be used to solve the production selection and production ordering problem in ACT‐R. We focus on four algorithms from the Q learning family, tabular Q and three versions of Deep Q Networks, as well as the ACT‐R utility learning algorithm, which provides a baseline for the Q algorithms. We compare the performance of these five algorithms in a range of lexical decision tasks framed as sequential decision problems.
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  3. Reinforcement learning: A brief guide for philosophers of mind.Julia Haas - 2022 - Philosophy Compass 17 (9):e12865.
    I argue for the role of reinforcement learning in the philosophy of mind. To start, I make several assumptions about the nature of reinforcement learning and its instantiation in minds like ours. I then review some of the contributions of reinforcement learning methods have made across the so-called 'decision sciences.' Finally, I show how principles from reinforcement learning can shape philosophical debates regarding the nature of perception and characterisations of desire.
    Direct download (5 more)  
     
    Export citation  
     
    Bookmark   1 citation  
  4.  15
    Evolutionary Reinforcement Learning for Adaptively Detecting Database Intrusions.Seul-Gi Choi & Sung-Bae Cho - 2020 - Logic Journal of the IGPL 28 (4):449-460.
    Relational database management system is the most popular database system. It is important to maintain data security from information leakage and data corruption. RDBMS can be attacked by an outsider or an insider. It is difficult to detect an insider attack because its patterns are constantly changing and evolving. In this paper, we propose an adaptive database intrusion detection system that can be resistant to potential insider misuse using evolutionary reinforcement learning, which combines reinforcement learning and (...)
    Direct download (3 more)  
     
    Export citation  
     
    Bookmark   1 citation  
  5.  42
    Reconciling reinforcement learning models with behavioral extinction and renewal: Implications for addiction, relapse, and problem gambling.A. David Redish, Steve Jensen, Adam Johnson & Zeb Kurth-Nelson - 2007 - Psychological Review 114 (3):784-805.
    No categories
    Direct download (4 more)  
     
    Export citation  
     
    Bookmark   21 citations  
  6.  44
    Reinforcement Learning and Counterfactual Reasoning Explain Adaptive Behavior in a Changing Environment.Yunfeng Zhang, Jaehyon Paik & Peter Pirolli - 2015 - Topics in Cognitive Science 7 (2):368-381.
    Animals routinely adapt to changes in the environment in order to survive. Though reinforcement learning may play a role in such adaptation, it is not clear that it is the only mechanism involved, as it is not well suited to producing rapid, relatively immediate changes in strategies in response to environmental changes. This research proposes that counterfactual reasoning might be an additional mechanism that facilitates change detection. An experiment is conducted in which a task state changes over time (...)
    Direct download (4 more)  
     
    Export citation  
     
    Bookmark  
  7.  55
    Deep Reinforcement Learning as Foundation for Artificial General Intelligence.Itamar Arel - 2012 - In Pei Wang & Ben Goertzel (eds.), Theoretical Foundations of Artificial General Intelligence. Springer. pp. 89--102.
    No categories
    Direct download  
     
    Export citation  
     
    Bookmark  
  8.  43
    Reinforcement learning: A brief guide for philosophers of mind.Julia Haas - 2022 - Philosophy Compass 17 (9):e12865.
    In this opinionated review, I draw attention to some of the contributions reinforcement learning can make to questions in the philosophy of mind. In particular, I highlight reinforcement learning's foundational emphasis on the role of reward in agent learning, and canvass two ways in which the framework may advance our understanding of perception and motivation.
    No categories
    Direct download (4 more)  
     
    Export citation  
     
    Bookmark   1 citation  
  9.  3
    Reinforcement Learning with Probabilistic Boolean Network Models of Smart Grid Devices.Pedro Juan Rivera Torres, Carlos Gershenson García, María Fernanda Sánchez Puig & Samir Kanaan Izquierdo - 2022 - Complexity 2022:1-15.
    The area of smart power grids needs to constantly improve its efficiency and resilience, to provide high quality electrical power in a resilient grid, while managing faults and avoiding failures. Achieving this requires high component reliability, adequate maintenance, and a studied failure occurrence. Correct system operation involves those activities and novel methodologies to detect, classify, and isolate faults and failures and model and simulate processes with predictive algorithms and analytics. In this paper, we showcase the application of a complex-adaptive, self-organizing (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  10.  21
    Using reinforcement learning to understand the emergence of "intelligent" eye-movement behavior during reading.Erik D. Reichle & Patryk A. Laurent - 2006 - Psychological Review 113 (2):390-408.
    No categories
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark   6 citations  
  11.  25
    Reinforcement learning and higher level cognition: Introduction to special issue.Nathaniel D. Daw & Michael J. Frank - 2009 - Cognition 113 (3):259-261.
  12.  7
    Reinforcement Learning-Based Collision Avoidance Guidance Algorithm for Fixed-Wing UAVs.Yu Zhao, Jifeng Guo, Chengchao Bai & Hongxing Zheng - 2021 - Complexity 2021:1-12.
    A deep reinforcement learning-based computational guidance method is presented, which is used to identify and resolve the problem of collision avoidance for a variable number of fixed-wing UAVs in limited airspace. The cooperative guidance process is first analyzed for multiple aircraft by formulating flight scenarios using multiagent Markov game theory and solving it by machine learning algorithm. Furthermore, a self-learning framework is established by using the actor-critic model, which is proposed to train collision avoidance decision-making neural (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  13. Using Reinforcement Learning to Examine Dynamic Attention Allocation During Reading.Yanping Liu, Erik D. Reichle & Ding-Guo Gao - 2013 - Cognitive Science 37 (8):1507-1540.
    A fundamental question in reading research concerns whether attention is allocated strictly serially, supporting lexical processing of one word at a time, or in parallel, supporting concurrent lexical processing of two or more words (Reichle, Liversedge, Pollatsek, & Rayner, 2009). The origins of this debate are reviewed. We then report three simulations to address this question using artificial reading agents (Liu & Reichle, 2010; Reichle & Laurent, 2006) that learn to dynamically allocate attention to 1–4 words to “read” as efficiently (...)
    Direct download  
     
    Export citation  
     
    Bookmark   1 citation  
  14.  9
    Relational reinforcement learning with guided demonstrations.David Martínez, Guillem Alenyà & Carme Torras - 2017 - Artificial Intelligence 247 (C):295-312.
  15.  13
    Integrating reinforcement learning, equilibrium points, and minimum variance to understand the development of reaching: A computational model.Daniele Caligiore, Domenico Parisi & Gianluca Baldassarre - 2014 - Psychological Review 121 (3):389-421.
    No categories
    Direct download (3 more)  
     
    Export citation  
     
    Bookmark   1 citation  
  16.  17
    Reinforcement learning for Golog programs with first-order state-abstraction.D. Beck & G. Lakemeyer - 2012 - Logic Journal of the IGPL 20 (5):909-942.
  17.  9
    Deep Reinforcement Learning for UAV Intelligent Mission Planning.Longfei Yue, Rennong Yang, Ying Zhang, Lixin Yu & Zhuangzhuang Wang - 2022 - Complexity 2022:1-13.
    Rapid and precise air operation mission planning is a key technology in unmanned aerial vehicles autonomous combat in battles. In this paper, an end-to-end UAV intelligent mission planning method based on deep reinforcement learning is proposed to solve the shortcomings of the traditional intelligent optimization algorithm, such as relying on simple, static, low-dimensional scenarios, and poor scalability. Specifically, the suppression of enemy air defense mission planning is described as a sequential decision-making problem and formalized as a Markov decision (...)
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  18.  25
    Predictive Movements and Human Reinforcement Learning of Sequential Action.Roy de Kleijn, George Kachergis & Bernhard Hommel - 2018 - Cognitive Science 42 (S3):783-808.
    Sequential action makes up the bulk of human daily activity, and yet much remains unknown about how people learn such actions. In one motor learning paradigm, the serial reaction time (SRT) task, people are taught a consistent sequence of button presses by cueing them with the next target response. However, the SRT task only records keypress response times to a cued target, and thus it cannot reveal the full time‐course of motion, including predictive movements. This paper describes a mouse (...)
    No categories
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark   1 citation  
  19. Integrating reinforcement learning, bidding and genetic algorithms.Ron Sun - unknown
    This paper presents a GA-based multi-agent reinforce- ment learning bidding approach (GMARLB) for perform- ing multi-agent reinforcement learning. GMARLB inte- grates reinforcement learning, bidding and genetic algo- rithms. The general idea of our multi-agent systems is as follows: There are a number of individual agents in a team, each agent of the team has two modules: Q module and CQ module. Each agent can select actions to be performed at each step, which are done by (...)
     
    Export citation  
     
    Bookmark  
  20.  14
    Reinforcement Learning in Autism Spectrum Disorder.Manuela Schuetze, Christiane S. Rohr, Deborah Dewey, Adam McCrimmon & Signe Bray - 2017 - Frontiers in Psychology 8.
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark   1 citation  
  21.  24
    Deep Reinforcement Learning for Vectored Thruster Autonomous Underwater Vehicle Control.Tao Liu, Yuli Hu & Hui Xu - 2021 - Complexity 2021:1-25.
    Autonomous underwater vehicles are widely used to accomplish various missions in the complex marine environment; the design of a control system for AUVs is particularly difficult due to the high nonlinearity, variations in hydrodynamic coefficients, and external force from ocean currents. In this paper, we propose a controller based on deep reinforcement learning in a simulation environment for studying the control performance of the vectored thruster AUV. RL is an important method of artificial intelligence that can learn behavior (...)
    No categories
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  22.  6
    Reinforcement learning with limited reinforcement: Using Bayes risk for active learning in POMDPs.Finale Doshi-Velez, Joelle Pineau & Nicholas Roy - 2012 - Artificial Intelligence 187-188 (C):115-132.
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  23.  10
    “Reconciling reinforcement learning models with behavioral extinction and renewal: Implications for addiction, relapse, and problem gambling”: Correction.David A. Redish, Steve Jensen, Adam Johnson & Zeb Kurth-Nelson - 2009 - Psychological Review 116 (3):518-518.
  24.  4
    Reinforcement learning of non-Markov decision processes.Steven D. Whitehead & Long-Ji Lin - 1995 - Artificial Intelligence 73 (1-2):271-306.
  25.  29
    Reinforcement learning of self-regulated β-oscillations for motor restoration in chronic stroke.Georgios Naros & Alireza Gharabaghi - 2015 - Frontiers in Human Neuroscience 9.
  26.  9
    Framing reinforcement learning from human reward: Reward positivity, temporal discounting, episodicity, and performance.W. Bradley Knox & Peter Stone - 2015 - Artificial Intelligence 225 (C):24-50.
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark   1 citation  
  27. Reinforcement learning.Chris Jch Watkins & Peter Dayan - 2003 - In L. Nadel (ed.), Encyclopedia of Cognitive Science. Nature Publishing Group.
  28.  10
    Reinforcement Learning With Parsimonious Computation and a Forgetting Process.Asako Toyama, Kentaro Katahira & Hideki Ohira - 2019 - Frontiers in Human Neuroscience 13.
  29.  7
    Decentralized Reinforcement Learning of Robot Behaviors.David L. Leottau, Javier Ruiz-del-Solar & Robert Babuška - 2018 - Artificial Intelligence 256 (C):130-159.
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  30.  9
    Reinforcement Learning.Oliver Kramer - 2009 - In L. Magnani (ed.), Computational Intelligence. pp. 101--117.
  31.  7
    Reinforcement learning in factories: the auton project.Andrew W. Moore - 1996 - In Garrison W. Cottrell (ed.), Proceedings of the Eighteenth Annual Conference of the Cognitive Science Society. Lawrence Erlbaum. pp. 18--12.
  32.  40
    Can reinforcement learning explain variation in early infant crying?Arnon Lotem & David W. Winkler - 2004 - Behavioral and Brain Sciences 27 (4):468-468.
    We welcome Soltis' use of evolutionary signaling theory, but question his interpretations of colic as a signal of vigor and his explanation of abnormal high-pitched crying as a signal of poor infant quality. Instead, we suggest that these phenomena may be suboptimal by-products of a generally adaptive learning process by which infants adjust their crying levels in relation to parental responsiveness.
    Direct download (7 more)  
     
    Export citation  
     
    Bookmark   1 citation  
  33.  15
    Predictive Movements and Human Reinforcement Learning of Sequential Action.Roy Kleijn, George Kachergis & Bernhard Hommel - 2018 - Cognitive Science 42 (S3):783-808.
    Sequential action makes up the bulk of human daily activity, and yet much remains unknown about how people learn such actions. In one motor learning paradigm, the serial reaction time (SRT) task, people are taught a consistent sequence of button presses by cueing them with the next target response. However, the SRT task only records keypress response times to a cued target, and thus it cannot reveal the full time‐course of motion, including predictive movements. This paper describes a mouse (...)
    No categories
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark   1 citation  
  34. Cheap talk, reinforcement learning, and the emergence of cooperation.J. McKenzie Alexander - 2015 - Philosophy of Science 82 (5):969-982.
    Cheap talk has often been thought incapable of supporting the emergence of cooperation because costless signals, easily faked, are unlikely to be reliable (Zahavi and Zahavi, 1997). I show how, in a social network model of cheap talk with reinforcement learning, cheap talk does enable the emergence of cooperation, provided that individuals also temporally discount the past. This establishes one mechanism that suffices for moving a population of initially uncooperative individuals to a state of mutually beneficial cooperation even (...)
    Direct download (7 more)  
     
    Export citation  
     
    Bookmark  
  35. The Archimedean trap: Why traditional reinforcement learning will probably not yield AGI.Samuel Allen Alexander - 2020 - Journal of Artificial General Intelligence 11 (1):70-85.
    After generalizing the Archimedean property of real numbers in such a way as to make it adaptable to non-numeric structures, we demonstrate that the real numbers cannot be used to accurately measure non-Archimedean structures. We argue that, since an agent with Artificial General Intelligence (AGI) should have no problem engaging in tasks that inherently involve non-Archimedean rewards, and since traditional reinforcement learning rewards are real numbers, therefore traditional reinforcement learning probably will not lead to AGI. We (...)
    Direct download  
     
    Export citation  
     
    Bookmark   1 citation  
  36.  42
    Integrating reinforcement learning with models of representation learning.Matt Jones & Fabián Canas - 2010 - In S. Ohlsson & R. Catrambone (eds.), Proceedings of the 32nd Annual Conference of the Cognitive Science Society. Cognitive Science Society. pp. 1258--1263.
  37. Reinforcement learning with raw image pixels as state input.D. Ernst, R. Marée & L. Wehenkel - 2006 - In O. Stock & M. Schaerf (eds.), Lecture Notes in Computer Science. Springer Verlag. pp. 4153.
  38.  18
    A reinforcement learning approach to gait training improves retention.Christopher J. Hasson, Julia Manczurowsky & Sheng-Che Yen - 2015 - Frontiers in Human Neuroscience 9.
  39.  7
    Certified reinforcement learning with logic guidance.Hosein Hasanbeig, Daniel Kroening & Alessandro Abate - 2023 - Artificial Intelligence 322 (C):103949.
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
  40.  38
    A survey of inverse reinforcement learning: Challenges, methods and progress.Saurabh Arora & Prashant Doshi - 2021 - Artificial Intelligence 297 (C):103500.
  41.  12
    Enforcing ethical goals over reinforcement-learning policies.Guido Governatori, Agata Ciabattoni, Ezio Bartocci & Emery A. Neufeld - 2022 - Ethics and Information Technology 24 (4):1-19.
    Recent years have yielded many discussions on how to endow autonomous agents with the ability to make ethical decisions, and the need for explicit ethical reasoning and transparency is a persistent theme in this literature. We present a modular and transparent approach to equip autonomous agents with the ability to comply with ethical prescriptions, while still enacting pre-learned optimal behaviour. Our approach relies on a normative supervisor module, that integrates a theorem prover for defeasible deontic logic within the control loop (...)
    No categories
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark   1 citation  
  42. Multi-Agent Reinforcement Learning: Weighting and Partitioning.Ron Sun & Todd Peterson - unknown
    This paper addresses weighting and partitioning in complex reinforcement learning tasks, with the aim of facilitating learning. The paper presents some ideas regarding weighting of multiple agents and extends them into partitioning an input/state space into multiple regions with di erential weighting in these regions, to exploit di erential characteristics of regions and di erential characteristics of agents to reduce the learning complexity of agents (and their function approximators) and thus to facilitate the learning overall. (...)
     
    Export citation  
     
    Bookmark   6 citations  
  43.  29
    Network formation by reinforcement learning: The long and medium run.Brian Skyrms - unknown
    We investigate a simple stochastic model of social network formation by the process of reinforcement learning with discounting of the past. In the limit, for any value of the discounting parameter, small, stable cliques are formed. However, the time it takes to reach the limiting state in which cliques have formed is very sensitive to the discounting parameter. Depending on this value, the limiting result may or may not be a good predictor for realistic observation times.
    Direct download  
     
    Export citation  
     
    Bookmark   8 citations  
  44. Attention and reinforcement learning: Constructing representations from indirect feedback.Fabián Canas & Matt Jones - 2010 - In S. Ohlsson & R. Catrambone (eds.), Proceedings of the 32nd Annual Conference of the Cognitive Science Society. Cognitive Science Society.
  45.  20
    Counterfactual state explanations for reinforcement learning agents via generative deep learning.Matthew L. Olson, Roli Khanna, Lawrence Neal, Fuxin Li & Weng-Keen Wong - 2021 - Artificial Intelligence 295 (C):103455.
  46.  33
    On Adaptation, Maximization, and Reinforcement Learning Among Cognitive Strategies.Ido Erev & Greg Barron - 2005 - Psychological Review 112 (4):912-931.
  47.  31
    Developing PFC Representations Using Reinforcement Learning.Jeremy R. Reynolds & Randall C. O’Reilly - 2009 - Cognition 113 (3):281-292.
  48. Supplementing neural reinforcement learning with symbolic methods possibilities and challenges.Ron Sun - unknown
    methods to improve reinforcement learning are identi ed and discussed in some detail Each demonstrates to some extent the advantages of combining RL and symbolic meth ods These methods point to the potentials and the chal lenges of this line of research..
     
    Export citation  
     
    Bookmark  
  49. Can model-free reinforcement learning explain deontological moral judgments?Alisabeth Ayars - 2016 - Cognition 150 (C):232-242.
  50.  5
    Multi-robot inverse reinforcement learning under occlusion with estimation of state transitions.Kenneth Bogert & Prashant Doshi - 2018 - Artificial Intelligence 263 (C):46-73.
    Direct download (2 more)  
     
    Export citation  
     
    Bookmark  
1 — 50 / 988