×
验证码:
换一张
忘记密码?
记住我
CORC
首页
科研机构
检索
知识图谱
申请加入
托管服务
登录
注册
在结果中检索
科研机构
自动化研究所 [8]
清华大学 [1]
兰州理工大学 [1]
合肥物质科学研究院 [1]
内容类型
期刊论文 [10]
会议论文 [1]
发表日期
2023 [1]
2022 [2]
2021 [1]
2020 [1]
2019 [1]
2018 [2]
更多...
×
知识图谱
CORC
开始提交
已提交作品
待认领作品
已认领作品
未提交全文
收藏管理
QQ客服
官方微博
反馈留言
浏览/检索结果:
共11条,第1-10条
帮助
已选(
0
)
清除
条数/页:
5
10
15
20
25
30
35
40
45
50
55
60
65
70
75
80
85
90
95
100
排序方式:
请选择
作者升序
作者降序
题名升序
题名降序
发表日期升序
发表日期降序
提交时间升序
提交时间降序
Relay Hindsight Experience Replay: Self-guided continual reinforcement learning for sequential object manipulation tasks with sparse rewards
期刊论文
NEUROCOMPUTING, 2023, 卷号: 557
作者:
Luo, Yongle
;
Wang, Yuxin
;
Dong, Kun
;
Zhang, Qiang
;
Cheng, Erkang
收藏
  |  
浏览/下载:9/0
  |  
提交时间:2023/11/10
Deep reinforcement learning
Robotic manipulation
Continual learning
Hindsight experience replay
Sparse reward
Prioritized Experience Replay based on Multi-armed Bandit
期刊论文
EXPERT SYSTEMS WITH APPLICATIONS, 2022, 卷号: 189
作者:
Liu, Ximing
;
Zhu, Tianqing
;
Jiang, Cuiqing
;
Ye, Dayong
;
Zhao, Fuqing
收藏
  |  
浏览/下载:17/0
  |  
提交时间:2022/03/01
Deep reinforcement learning
Q-learning
Deep Q-network
Experience replay
Multi-armed Bandit
Barrier-Certified Learning-Enabled Safe Control Design for Systems Operating in Uncertain Environments
期刊论文
IEEE/CAA Journal of Automatica Sinica, 2022, 卷号: 9, 期号: 3, 页码: 437-449
作者:
Zahra Marvi
;
Bahare Kiumarsi
收藏
  |  
浏览/下载:39/0
  |  
提交时间:2022/03/09
Control barrier functions (CBFs)
experience replay
learning
safety-critical systems
uncertainty
A Novel Heterogeneous Actor-critic Algorithm with Recent Emphasizing Replay Memory
期刊论文
International Journal of Automation and Computing, 2021, 卷号: 18, 期号: 4, 页码: 619-631
作者:
Bao Xi
收藏
  |  
浏览/下载:15/0
  |  
提交时间:2021/07/20
Reinforcement learning (RL)
actor-critic
experience replay
training efficiency
manipulation skill learning
Path Planning for Intelligent Robots Based on Deep Q-learning With Experience Replay and Heuristic Knowledge
期刊论文
IEEE/CAA Journal of Automatica Sinica, 2020, 卷号: 7, 期号: 4, 页码: 1179-1189
作者:
Lan Jiang
收藏
  |  
浏览/下载:15/0
  |  
提交时间:2021/03/11
Deep Q-learning (DQL)
experience replay (ER)
heuristic knowledge (HK)
path planning
Adaptive cruise control via adaptive dynamic programming with experience replay
期刊论文
SOFT COMPUTING, 2019, 卷号: 23, 期号: 12, 页码: 4131-4144
作者:
Wang, Bin
;
Zhao, Dongbin
;
Cheng, Jin
收藏
  |  
浏览/下载:44/0
  |  
提交时间:2019/07/11
Adaptive cruise control
Adaptive dynamic programming
Experience replay
Reinforcement learning
Neural networks
Adaptive Q-Learning for Data-Based Optimal Output Regulation With Experience Replay
期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2018, 卷号: 48, 期号: 12, 页码: 3337-3348
作者:
Luo, Biao
;
Yang, Yin
;
Liu, Derong
收藏
  |  
浏览/下载:52/0
  |  
提交时间:2019/01/08
Data-based
experience replay
neural networks (NNs)
off-policy
optimal control
Q-learning (QL)
Comprehensive comparison of online ADP algorithms for continuous-time optimal control
期刊论文
ARTIFICIAL INTELLIGENCE REVIEW, 2018, 卷号: 49, 期号: 4, 页码: 531-547
作者:
Zhu, Yuanheng
;
Zhao, Dongbin
收藏
  |  
浏览/下载:15/0
  |  
提交时间:2017/09/13
Adaptive Dynamic Programming
Policy Iteration
Integral Reinforcement Learning
Experience Replay
Off-policy
Deep reinforcement learning with Experience Replay based on SARSA
会议论文
*, 2016-9
作者:
Zhao,Dongbin(赵冬斌)
;
Wang,Haitao
;
Shao,Kun
;
Zhu,Yuanheng
收藏
  |  
浏览/下载:18/0
  |  
提交时间:2018/01/04
Deep Learning
Reinforcement Learning
Experience Replay
q Learning
Sarsa Learning
Experience Replay for Optimal Control of Nonzero-Sum Game Systems With Unknown Dynamics
期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2016, 卷号: 46, 期号: 3, 页码: 854-865
作者:
Zhao, Dongbin
;
Zhang, Qichao
;
Wang, Ding
;
Zhu, Yuanheng
收藏
  |  
浏览/下载:16/0
  |  
提交时间:2016/06/14
Adaptive Dynamic Programming (Adp)
Experience Replay
Nonzero-sum (Nzs) Games
Optimal Control
Unknown Dynamics
©版权所有 ©2017 CSpace - Powered by
CSpace