Restricted gradient-descent algorithm for value-function approximation in reinforcement learning

Artificial Intelligence 172 (4-5):454-482 (2008)
  Copy   BIBTEX

Abstract

This article has no associated abstract. (fix it)

Links

PhilArchive



    Upload a copy of this work     Papers currently archived: 93,867

External links

Setup an account with your affiliations in order to access resources via your University's proxy server

Through your library

Similar books and articles

An improved approximation algorithm for maximin shares.Jugal Garg & Setareh Taki - 2021 - Artificial Intelligence 300 (C):103547.
経験に固執しない Profit Sharing 法.Ueno Atsushi Uemura Wataru - 2006 - Transactions of the Japanese Society for Artificial Intelligence 21:81-93.
Ga により探索空間の動的生成を行う Q 学習.Matsuno Fumitoshi Ito Kazuyuki - 2001 - Transactions of the Japanese Society for Artificial Intelligence 16:510-520.
The delay-of-reinforcement gradient in maze learning.J. P. Seward - 1942 - Journal of Experimental Psychology 30 (6):464.

Analytics

Added to PP
2020-12-22

Downloads
6 (#1,479,581)

6 months
5 (#836,928)

Historical graph of downloads
How can I increase my downloads?

Citations of this work

No citations found.

Add more citations

References found in this work

Add more references