Averaged Soft Actor-Critic for Deep Reinforcement Learning

Feng Ding; Guanfeng Ma; Zhikui Chen; Jing Gao; Peng Li

Download from

dx.doi.org

More download options

Averaged Soft Actor-Critic for Deep Reinforcement Learning

Feng Ding, Guanfeng Ma, Zhikui Chen, Jing Gao & Peng Li

Complexity 2021:1-16 (2021) Copy BIBT_EX

Abstract

With the advent of the era of artificial intelligence, deep reinforcement learning has achieved unprecedented success in high-dimensional and large-scale artificial intelligence tasks. However, the insecurity and instability of the DRL algorithm have an important impact on its performance. The Soft Actor-Critic algorithm uses advanced functions to update the policy and value network to alleviate some of these problems. However, SAC still has some problems. In order to reduce the error caused by the overestimation of SAC, we propose a new SAC algorithm called Averaged-SAC. By averaging the previously learned action-state estimates, it reduces the overestimation problem of soft Q-learning, thereby contributing to a more stable training process and improving performance. We evaluate the performance of Averaged-SAC through some games in the MuJoCo environment. The experimental results show that the Averaged-SAC algorithm effectively improves the performance of the SAC algorithm and the stability of the training process.

Cite

Plain text

BibTeX

Formatted text

Zotero

EndNote

Reference Manager

RefWorks

Options

Mark as duplicate

Find it on Scholar

Request removal from index

Revision history

Edit

Author Profiles

Jing Gao

Lan Zhou University

Li Peng

Abilene Christian University

Keywords

Add keywords

Reprint years

DOI

10.1155/2021/6658724

My notes

Similar books and articles

Solving a Joint Pricing and Inventory Control Problem for Perishables via Deep Reinforcement Learning.Rui Wang, Xianghua Gan, Qing Li & Xiao Yan - 2021 - Complexity 2021:1-17.

Reinforcement Learning-Based Collision Avoidance Guidance Algorithm for Fixed-Wing UAVs.Yu Zhao, Jifeng Guo, Chengchao Bai & Hongxing Zheng - 2021 - Complexity 2021:1-12.

A Stable Distributed Neural Controller for Physically Coupled Networked Discrete-Time System via Online Reinforcement Learning.Jian Sun & Jie Li - 2018 - Complexity 2018:1-15.

Online Optimal Control of Robotic Systems with Single Critic NN-Based Reinforcement Learning.Xiaoyi Long, Zheng He & Zhongyuan Wang - 2021 - Complexity 2021:1-7.

Scene Matching Method for Children’s Psychological Distress Based on Deep Learning Algorithm.Junli Su - 2021 - Complexity 2021:1-11.

Computational Functionalism for the Deep Learning Era.Ezequiel López-Rubio - 2018 - Minds and Machines 28 (4):667-688.

Are People Successful at Learning Sequences of Actions on a Perceptual Matching Task?Reiko Yakushijin & Robert A. Jacobs - 2011 - Cognitive Science 35 (5):939-962.

The Archimedean trap: Why traditional reinforcement learning will probably not yield AGI.Samuel Allen Alexander - 2020 - Journal of Artificial General Intelligence 11 (1):70-85.

Learning concepts by arranging appropriate training order.Yao-Tung Hsu, Tzung-Pei Hong & Shian-Shyong Tseng - 2001 - Minds and Machines 11 (3):399-415.

Machine learning by imitating human learning.Chang Kuo-Chin, Hong Tzung-Pei & Tseng Shian-Shyong - 1996 - Minds and Machines 6 (2):203-228.

Evolutionary Reinforcement Learning for Adaptively Detecting Database Intrusions.Seul-Gi Choi & Sung-Bae Cho - 2020 - Logic Journal of the IGPL 28 (4):449-460.

Analysis of Feature Extraction and Anti-Interference of Face Image under Deep Reconstruction Network Algorithm.Jin Yang, Yuxuan Zhao, Shihao Yang, Xinxin Kang, Xinyan Cao & Xixin Cao - 2021 - Complexity 2021:1-15.

A Dynamic Opposite Learning Assisted Grasshopper Optimization Algorithm for the Flexible JobScheduling Problem.Yi Feng, Mengru Liu, Yuqian Zhang & Jinglin Wang - 2020 - Complexity 2020:1-19.

A Theory Explains Deep Learning.Kenneth Kijun Lee & Chase Kihwan Lee - manuscript

Unification neural networks: unification by error-correction learning.Ekaterina Komendantskaya - 2011 - Logic Journal of the IGPL 19 (6):821-847.

Analytics

Added to PP
2021-04-02

Downloads
7 (#1,351,854)

6 months
5 (#652,053)

Historical graph of downloads

How can I increase my downloads?

Author Profiles

Jing Gao

Lan Zhou University

Li Peng

Abilene Christian University

Citations of this work

No citations found.

Add more citations

References found in this work

No references found.

Add more references

Applied ethics	Epistemology	History of Western Philosophy	Meta-ethics	Metaphysics	Normative ethics
Philosophy of biology	Philosophy of language	Philosophy of mind	Philosophy of religion	Science Logic and Mathematics	More ...

Averaged Soft Actor-Critic for Deep Reinforcement Learning

Abstract

Author Profiles

Categories

Keywords

Reprint years

DOI

Links

PhilArchive

External links

Through your library

My notes

Similar books and articles

Analytics

Author Profiles

Citations of this work

References found in this work