×
验证码:
换一张
忘记密码?
记住我
CORC
首页
科研机构
检索
知识图谱
申请加入
托管服务
登录
注册
在结果中检索
科研机构
厦门大学 [199]
自动化研究所 [151]
清华大学 [90]
北京大学 [86]
地理科学与资源研究... [67]
上海财经大学 [36]
更多...
内容类型
期刊论文 [702]
学位论文 [151]
会议论文 [116]
其他 [46]
SCI/SSCI论... [17]
会议 [2]
更多...
发表日期
2023 [21]
2022 [30]
2021 [52]
2020 [52]
2019 [41]
2018 [62]
更多...
学科主题
管理学 [3]
Computer S... [2]
Energy & F... [2]
Environmen... [2]
Chemical [1]
Computer S... [1]
更多...
×
知识图谱
CORC
开始提交
已提交作品
待认领作品
已认领作品
未提交全文
收藏管理
QQ客服
官方微博
反馈留言
浏览/检索结果:
共1039条,第1-10条
帮助
已选(
0
)
清除
条数/页:
5
10
15
20
25
30
35
40
45
50
55
60
65
70
75
80
85
90
95
100
排序方式:
请选择
作者升序
作者降序
题名升序
题名降序
发表日期升序
发表日期降序
提交时间升序
提交时间降序
Adaptive bias-variance trade-off in advantage estimator for actor-critic algorithms
期刊论文
NEURAL NETWORKS, 2024, 卷号: 169, 页码: 764-777
作者:
Chen, Yurou
;
Zhang, Fengyi
;
Liu, Zhiyong
收藏
  |  
浏览/下载:2/0
  |  
提交时间:2024/02/22
Reinforcement Learning
Policy gradient
Actor-critic
Value function
Bias-variance trade-off
Active Disparity Sampling for Stereo Matching With Adjoint Network
期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 卷号: 33, 页码: 354-365
作者:
Zhang, Chenghao
;
Meng, Gaofeng
;
Tian, Kun
;
Ni, Bolin
;
Xiang, Shiming
收藏
  |  
浏览/下载:4/0
  |  
提交时间:2024/02/21
Stereo matching
domain generalization
active sampling
adjoint network
Bayesian optimization
Constrained Multi-Objective Optimization With Deep Reinforcement Learning Assisted Operator Selection
期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 4, 页码: 919-931
作者:
Fei Ming
;
Wenyin Gong
;
Ling Wang
;
Yaochu Jin
收藏
  |  
浏览/下载:0/0
  |  
提交时间:2024/03/18
Constrained multi-objective optimization
deep Q-learning
deep reinforcement learning (DRL)
evolutionary algorithms
evolutionary operator selection
Adaptive Optimal Discrete-Time Output-Feedback Using an Internal Model Principle and Adaptive Dynamic Programming
期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 1, 页码: 131-140
作者:
Zhongyang Wang
;
Youqing Wang
;
Zdzisław Kowalczuk
收藏
  |  
浏览/下载:0/0
  |  
提交时间:2024/01/02
Adaptive dynamic programming (ADP)
internal model principle (IMP)
output feedback problem
policy iteration (PI)
value iteration (VI)
Peer Incentive Reinforcement Learning for Cooperative Multiagent Games
期刊论文
IEEE TRANSACTIONS ON GAMES, 2023, 卷号: 15, 期号: 4, 页码: 623-636
作者:
Zhang, Tianle
;
Liu, Zhen
;
Pu, Zhiqiang
;
Yi, Jianqiang
收藏
  |  
浏览/下载:1/0
  |  
提交时间:2024/02/22
Cooperative multiagent games
intrinsic reward
multiagent reinforcement learning (MARL)
Starcraft II Micromanagement
Multistep Look-Ahead Policy Iteration for Optimal Control of Discrete-Time Nonlinear Systems With Isoperimetric Constraints
期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 页码: 13
作者:
Li, Tao
;
Wei, Qinglai
;
Wang, Fei-Yue
收藏
  |  
浏览/下载:0/0
  |  
提交时间:2024/02/22
Performance analysis
Optimal control
Dynamic programming
Iterative algorithms
Upper bound
Measurement
Convergence
Adaptive dynamic programming (ADP)
isoperimetric constraints
nonlinear systems
optimal control
policy iteration
Relay Hindsight Experience Replay: Self-guided continual reinforcement learning for sequential object manipulation tasks with sparse rewards
期刊论文
NEUROCOMPUTING, 2023, 卷号: 557
作者:
Luo, Yongle
;
Wang, Yuxin
;
Dong, Kun
;
Zhang, Qiang
;
Cheng, Erkang
收藏
  |  
浏览/下载:9/0
  |  
提交时间:2023/11/10
Deep reinforcement learning
Robotic manipulation
Continual learning
Hindsight experience replay
Sparse reward
Learning for Depth Control of a Robotic Penguin: A Data-Driven Model Predictive Control Approach
期刊论文
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2023, 卷号: 70, 期号: 11, 页码: 11422-11432
作者:
Pan, Jie
;
Zhang, Pengfei
;
Wang, Jian
;
Liu, Mingxin
;
Yu, Junzhi
收藏
  |  
浏览/下载:9/0
  |  
提交时间:2023/11/17
Data-driven model predictive control (MPC)
depth control
motion control
reinforcement learning (RL)
robotic penguin
Tri-HGNN: Learning triple policies fused hierarchical graph neural networks for pedestrian trajectory prediction
期刊论文
PATTERN RECOGNITION, 2023, 卷号: 143, 页码: 11
作者:
Zhu, Wenjun
;
Liu, Yanghong
;
Wang, Peng
;
Zhang, Mengyi
;
Wang, Tian
收藏
  |  
浏览/下载:8/0
  |  
提交时间:2023/11/17
Trajectory prediction
Hierarchical policy
Graph neural networks
Data-efficient model-based reinforcement learning with trajectory discrimination
期刊论文
COMPLEX & INTELLIGENT SYSTEMS, 2023, 页码: 10
作者:
Qu, Tuo
;
Duan, Fuqing
;
Zhang, Junge
;
Zhao, Bo
;
Huang, Wenzhen
收藏
  |  
浏览/下载:4/0
  |  
提交时间:2023/11/16
Reinforcement learning
Deep learning
Continuous control task
World model
©版权所有 ©2017 CSpace - Powered by
CSpace