版权说明 操作指南
首页 > 成果 > 详情

Relative Entropy of Correct Proximal Policy Optimization Algorithms with Modified Penalty Factor in Complex Environment

认领
导出
Link by DOI
反馈
分享
QQ微信 微博
成果类型:
期刊论文
作者:
Chen, Weimin;Wong, Kelvin Kian Loong;Long, Sifan;Sun, Zhili
通讯作者:
Wong, KKL
作者机构:
[Chen, Weimin; Wong, Kelvin Kian Loong] Hunan City Univ, Sch Informat & Elect, Yiyang 413000, Peoples R China.
[Long, Sifan] Cent South Univ, Sch Comp Sci & Engn, Changsha 410075, Peoples R China.
[Long, Sifan] Natl Univ Def Technol, Sch Comp Sci, Changsha 410073, Peoples R China.
[Sun, Zhili] Univ Surrey, 5G & 6G Innovat Ctr, Inst Commun Syst, Dept Elect & Elect Engn, Guildford GU2 7XH, Surrey, England.
通讯机构:
[Wong, KKL ] H
Hunan City Univ, Sch Informat & Elect, Yiyang 413000, Peoples R China.
语种:
英文
关键词:
correct proximal policy optimization;approximation theory;reinforcement learning;optimization;policy gradient;entropy
期刊:
Entropy
ISSN:
1099-4300
年:
2022
卷:
24
期:
4
页码:
440-
基金类别:
Data curation, S.L.; Funding acquisition, W.C.; Methodology, K.K.L.W.; Resources, Z.S.; Writing—review & editing, W.C. All authors have read and agreed to the published version of the manuscript. This work was supported in part by the National Natural Science Foundation of China under Grant 61672540; in part by the Project of General University Teaching Reform Research in Hunan Province under Grant [2017]352.
机构署名:
本校为第一且通讯机构
摘要:
In the field of reinforcement learning, we propose a Correct Proximal Policy Optimization (CPPO) algorithm based on the modified penalty factor β and relative entropy in order to solve the robustness and stationarity of traditional algorithms. Firstly, In the process of reinforcement learning, this paper establishes a strategy evaluation mechanism through the policy distribution function. Secondly, the state space function is quantified by introducing entropy, whereby the approximation policy is used to approximate the real policy distribution...

反馈

验证码:
看不清楚,换一个
确定
取消

成果认领

标题:
用户 作者 通讯作者
请选择
请选择
确定
取消

提示

该栏目需要登录且有访问权限才可以访问

如果您有访问权限,请直接 登录访问

如果您没有访问权限,请联系管理员申请开通

管理员联系邮箱:yun@hnwdkj.com