More download options

Restricted gradient-descent algorithm for value-function approximation in reinforcement learning

André da Motta Salles Barreto & Charles W. Anderson

Artificial Intelligence 172 (4-5):454-482 (2008) Copy BIBT_EX

Abstract

This article has no associated abstract. (fix it)

Cite

Plain text

BibTeX

Formatted text

Reference Manager

RefWorks

Options

Mark as duplicate

Find it on Scholar

Request removal from index

Revision history

Categories

Science, Logic, and Mathematics

Keywords

Reprint years

DOI

10.1016/j.artint.2007.08.001

Links

PhilArchive

Upload a copy of this work Papers currently archived: 93,867

External links

Setup an account with your affiliations in order to access resources via your University's proxy server

Through your library

Sign in / register and customize your OpenURL resolver
Configure custom resolver

My notes

Sign in to use this feature

Similar books and articles

Pessimistic value iteration for multi-task data sharing in Offline Reinforcement Learning.Chenjia Bai, Lingxiao Wang, Jianye Hao, Zhuoran Yang, Bin Zhao, Zhen Wang & Xuelong Li - 2024 - Artificial Intelligence 326 (C):104048.

Averaged Soft Actor-Critic for Deep Reinforcement Learning.Feng Ding, Guanfeng Ma, Zhikui Chen, Jing Gao & Peng Li - 2021 - Complexity 2021:1-16.

An improved approximation algorithm for maximin shares.Jugal Garg & Setareh Taki - 2021 - Artificial Intelligence 300 (C):103547.

経験に固執しない Profit Sharing 法.Ueno Atsushi Uemura Wataru - 2006 - Transactions of the Japanese Society for Artificial Intelligence 21:81-93.

An optimal approximation algorithm for Bayesian inference.Paul Dagum & Michael Luby - 1997 - Artificial Intelligence 93 (1-2):1-27.

Reinforcement learning with limited reinforcement: Using Bayes risk for active learning in POMDPs.Finale Doshi-Velez, Joelle Pineau & Nicholas Roy - 2012 - Artificial Intelligence 187-188 (C):115-132.

Deep Reinforcement Learning as Foundation for Artificial General Intelligence.Itamar Arel - 2012 - In Pei Wang & Ben Goertzel (eds.), Theoretical Foundations of Artificial General Intelligence. Springer. pp. 89--102.

Learning reward machines: A study in partially observable reinforcement learning.Rodrigo Toro Icarte, Toryn Q. Klassen, Richard Valenzano, Margarita P. Castro, Ethan Waldie & Sheila A. McIlraith - 2023 - Artificial Intelligence 323 (C):103989.

Ga により探索空間の動的生成を行う Q 学習.Matsuno Fumitoshi Ito Kazuyuki - 2001 - Transactions of the Japanese Society for Artificial Intelligence 16:510-520.

The delay-of-reinforcement gradient in maze learning.J. P. Seward - 1942 - Journal of Experimental Psychology 30 (6):464.

Analytics

Added to PP
2020-12-22

Downloads
6 (#1,479,581)

6 months
5 (#836,928)

Historical graph of downloads

How can I increase my downloads?

Citations of this work

No citations found.

Add more citations

References found in this work

Robot shaping: developing autonomous agents through learning.Marco Dorigo & Marco Colombetti - 1994 - Artificial Intelligence 71 (2):321-370.

Add more references