中国开放科研知识云: 检索

验证码:

换一张

忘记密码？记住我

取消登录

CORC

首页
科研机构
检索
知识图谱
申请加入
托管服务

在结果中检索

科研机构

厦门大学 [199]

自动化研究所 [151]

清华大学 [90]

北京大学 [86]

地理科学与资源研究... [67]

上海财经大学 [36]

内容类型

期刊论文 [702]

学位论文 [151]

会议论文 [116]

SCI/SSCI论... [17]

发表日期

学科主题

Computer S... [2]

Energy & F... [2]

Environmen... [2]

Computer S... [1]

知识图谱

CORC

已提交作品

待认领作品

已认领作品

未提交全文

浏览/检索结果: 共1039条，第1-10条

帮助

已选(0)清除条数/页：排序方式：
	Adaptive bias-variance trade-off in advantage estimator for actor-critic algorithms 期刊论文 NEURAL NETWORKS, 2024, 卷号: 169, 页码: 764-777 作者: Chen, Yurou; Zhang, Fengyi; Liu, Zhiyong 收藏 \| 浏览/下载：2/0 \| 提交时间：2024/02/22 Reinforcement Learning Policy gradient Actor-critic Value function Bias-variance trade-off
	Active Disparity Sampling for Stereo Matching With Adjoint Network 期刊论文 IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 卷号: 33, 页码: 354-365 作者: Zhang, Chenghao; Meng, Gaofeng; Tian, Kun; Ni, Bolin; Xiang, Shiming 收藏 \| 浏览/下载：4/0 \| 提交时间：2024/02/21 Stereo matching domain generalization active sampling adjoint network Bayesian optimization
	Constrained Multi-Objective Optimization With Deep Reinforcement Learning Assisted Operator Selection 期刊论文 IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 4, 页码: 919-931 作者: Fei Ming; Wenyin Gong; Ling Wang; Yaochu Jin 收藏 \| 浏览/下载：0/0 \| 提交时间：2024/03/18 Constrained multi-objective optimization deep Q-learning deep reinforcement learning (DRL) evolutionary algorithms evolutionary operator selection
	Adaptive Optimal Discrete-Time Output-Feedback Using an Internal Model Principle and Adaptive Dynamic Programming 期刊论文 IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 1, 页码: 131-140 作者: Zhongyang Wang; Youqing Wang; Zdzisław Kowalczuk 收藏 \| 浏览/下载：0/0 \| 提交时间：2024/01/02 Adaptive dynamic programming (ADP) internal model principle (IMP) output feedback problem policy iteration (PI) value iteration (VI)
	Peer Incentive Reinforcement Learning for Cooperative Multiagent Games 期刊论文 IEEE TRANSACTIONS ON GAMES, 2023, 卷号: 15, 期号: 4, 页码: 623-636 作者: Zhang, Tianle; Liu, Zhen; Pu, Zhiqiang; Yi, Jianqiang 收藏 \| 浏览/下载：1/0 \| 提交时间：2024/02/22 Cooperative multiagent games intrinsic reward multiagent reinforcement learning (MARL) Starcraft II Micromanagement
	Multistep Look-Ahead Policy Iteration for Optimal Control of Discrete-Time Nonlinear Systems With Isoperimetric Constraints 期刊论文 IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 页码: 13 作者: Li, Tao; Wei, Qinglai; Wang, Fei-Yue 收藏 \| 浏览/下载：0/0 \| 提交时间：2024/02/22 Performance analysis Optimal control Dynamic programming Iterative algorithms Upper bound Measurement Convergence Adaptive dynamic programming (ADP) isoperimetric constraints nonlinear systems optimal control policy iteration
	Relay Hindsight Experience Replay: Self-guided continual reinforcement learning for sequential object manipulation tasks with sparse rewards 期刊论文 NEUROCOMPUTING, 2023, 卷号: 557 作者: Luo, Yongle; Wang, Yuxin; Dong, Kun; Zhang, Qiang; Cheng, Erkang 收藏 \| 浏览/下载：9/0 \| 提交时间：2023/11/10 Deep reinforcement learning Robotic manipulation Continual learning Hindsight experience replay Sparse reward
	Learning for Depth Control of a Robotic Penguin: A Data-Driven Model Predictive Control Approach 期刊论文 IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2023, 卷号: 70, 期号: 11, 页码: 11422-11432 作者: Pan, Jie; Zhang, Pengfei; Wang, Jian; Liu, Mingxin; Yu, Junzhi 收藏 \| 浏览/下载：9/0 \| 提交时间：2023/11/17 Data-driven model predictive control (MPC) depth control motion control reinforcement learning (RL) robotic penguin
	Tri-HGNN: Learning triple policies fused hierarchical graph neural networks for pedestrian trajectory prediction 期刊论文 PATTERN RECOGNITION, 2023, 卷号: 143, 页码: 11 作者: Zhu, Wenjun; Liu, Yanghong; Wang, Peng; Zhang, Mengyi; Wang, Tian 收藏 \| 浏览/下载：8/0 \| 提交时间：2023/11/17 Trajectory prediction Hierarchical policy Graph neural networks
	Data-efficient model-based reinforcement learning with trajectory discrimination 期刊论文 COMPLEX & INTELLIGENT SYSTEMS, 2023, 页码: 10 作者: Qu, Tuo; Duan, Fuqing; Zhang, Junge; Zhao, Bo; Huang, Wenzhen 收藏 \| 浏览/下载：4/0 \| 提交时间：2023/11/16 Reinforcement learning Deep learning Continuous control task World model

相关链接
欧盟学术资源开放存取平台
CALIS高校机构知识库
台湾学术机构典藏
香港机构知识库整合系统
中国科学院机构知识库网格系统

CORC
关于我们
服务条款
隐私条款

联系我们
0931-8270076
cspace@llas.ac.cn
China-OR

中文简体

©版权所有 ©2017 CSpace - Powered by CSpace