More download options

Pessimistic value iteration for multi-task data sharing in Offline Reinforcement Learning

Chenjia Bai, Lingxiao Wang, Jianye Hao, Zhuoran Yang, Bin Zhao, Zhen Wang & Xuelong Li

Artificial Intelligence 326 (C):104048 (2024) Copy BIBT_EX

Abstract

This article has no associated abstract. (fix it)

Cite

Plain text

BibTeX

Formatted text

Reference Manager

RefWorks

Options

Mark as duplicate

Find it on Scholar

Request removal from index

Revision history

Categories

Science, Logic, and Mathematics

Keywords

Reprint years

DOI

10.1016/j.artint.2023.104048

Links

PhilArchive

Upload a copy of this work Papers currently archived: 91,897

External links

Setup an account with your affiliations in order to access resources via your University's proxy server

Through your library

Sign in / register and customize your OpenURL resolver
Configure custom resolver

My notes

Sign in to use this feature

Similar books and articles

Self segmentation of sequences.Ron Sun - unknown

Bidding in Reinforcement Learning: A Paradigm for Multi-Agent Systems.Chad Sessions - unknown

Q(st at):= (I Ã¢â¬â o')Q(st at) + o'(r(st+1).Ron Sun - unknown

Profit Sharing 法における強化関数に関する一考察.Tatsumi Shoji Uemura Wataru - 2004 - Transactions of the Japanese Society for Artificial Intelligence 19:197-203.

Number of common elements and consistency of reinforcement in a discrimination learning task.Robert Stanton French - 1953 - Journal of Experimental Psychology 45 (1):25.

尤度情報に基づく温度分布を用いた強化学習法.鈴木健嗣小堀訓成 - 2005 - Transactions of the Japanese Society for Artificial Intelligence 20:297-305.

強化学習エージェントへの階層化意志決定法の導入―追跡問題を例に―.輿石尚宏謙吾片山 - 2004 - Transactions of the Japanese Society for Artificial Intelligence 19:279-291.

Reinforcement learning: A brief guide for philosophers of mind.Julia Haas - 2022 - Philosophy Compass 17 (9):e12865.

SA w_ S _u: An Integrated Model of Associative and Reinforcement Learning.Vladislav D. Veksler, Christopher W. Myers & Kevin A. Gluck - 2014 - Cognitive Science 38 (3):580-598.

Multi-Agent Reinforcement Learning: Weighting and Partitioning.Ron Sun & Todd Peterson - unknown

Automatic Partitioning for Multi-Agent Reinforcement Learning.Ron Sun - unknown

Safe multi-agent reinforcement learning for multi-robot control.Shangding Gu, Jakub Grudzien Kuba, Yuanpei Chen, Yali Du, Long Yang, Alois Knoll & Yaodong Yang - 2023 - Artificial Intelligence 319 (C):103905.

The Role of Basal Ganglia Reinforcement Learning in Lexical Ambiguity Resolution.Jose M. Ceballos, Andrea Stocco & Chantel S. Prat - 2020 - Topics in Cognitive Science 12 (1):402-416.

An evolutionary game theoretic perspective on learning in multi-agent systems.Karl Tuyls, Ann Nowe, Tom Lenaerts & Bernard Manderick - 2004 - Synthese 139 (2):297 - 330.

経験に固執しない Profit Sharing 法.Ueno Atsushi Uemura Wataru - 2006 - Transactions of the Japanese Society for Artificial Intelligence 21:81-93.

Analytics

Added to PP
2023-11-21

Downloads
16 (#906,902)

6 months
16 (#156,849)

Historical graph of downloads

How can I increase my downloads?

Citations of this work

No citations found.

Add more citations

References found in this work

机器智能的兴起与人之为人理想的终极.Chenyang Li - 2020 - In 智慧与智能. pp. 201-228.

Add more references