×
验证码:
换一张
忘记密码?
记住我
CORC
首页
科研机构
检索
知识图谱
申请加入
托管服务
登录
注册
在结果中检索
科研机构
自动化研究所 [11]
北京大学 [1]
内容类型
期刊论文 [8]
会议论文 [4]
发表日期
2022 [1]
2021 [2]
2018 [1]
2017 [2]
2016 [2]
2015 [2]
更多...
×
知识图谱
CORC
开始提交
已提交作品
待认领作品
已认领作品
未提交全文
收藏管理
QQ客服
官方微博
反馈留言
浏览/检索结果:
共12条,第1-10条
帮助
已选(
0
)
清除
条数/页:
5
10
15
20
25
30
35
40
45
50
55
60
65
70
75
80
85
90
95
100
排序方式:
请选择
作者升序
作者降序
题名升序
题名降序
发表日期升序
发表日期降序
提交时间升序
提交时间降序
Online Minimax Q Network Learning for Two-Player Zero-Sum Markov Games
期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 卷号: 33, 期号: 3, 页码: 1228-1241
作者:
Zhu, Yuanheng
;
Zhao, Dongbin
收藏
  |  
浏览/下载:14/0
  |  
提交时间:2022/06/10
Games
Nash equilibrium
Mathematical model
Markov processes
Convergence
Dynamic programming
Training
Deep reinforcement learning (DRL)
generalized policy iteration (GPI)
Markov game (MG)
Nash equilibrium
Q network
zero sum
Generalized Actor-Critic Learning Optimal Control in Smart Home Energy Management
期刊论文
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2021, 卷号: 17, 期号: 10, 页码: 6614-6623
作者:
Wei, Qinglai
;
Liao, Zehua
;
Shi, Guang
收藏
  |  
浏览/下载:39/0
  |  
提交时间:2021/11/02
Optimal control
Process control
Smart homes
Dynamic programming
Numerical models
Iterative methods
Informatics
Actor-critic learning
adaptive critic designs
adaptive dynamic programming (ADP)
approximate dynamic programming
energy management
optimal control
smart grid
Discrete-Time Non-Zero-Sum Games With Completely Unknown Dynamics
期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2021, 卷号: 51, 期号: 6, 页码: 2929-2943
作者:
Song, Ruizhuo
;
Wei, Qinglai
;
Zhang, Huaguang
;
Lewis, Frank L.
收藏
  |  
浏览/下载:45/0
  |  
提交时间:2021/08/15
Adaptive critic designs
adaptive dynamic programming
approximate dynamic programming
discrete-time
nonzero-sum (NZS)
off-policy
reinforcement learning (RL)
Discrete-Time Stable Generalized Self-Learning Optimal Control With Approximation Errors
期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 卷号: 29, 期号: 4, 页码: 1226-1238
作者:
Wei, Qinglai
;
Li, Benkai
;
Song, Ruizhuo
收藏
  |  
浏览/下载:29/0
  |  
提交时间:2017/02/23
Adaptive Critic Designs
Adaptive Dynamic Programming (Adp)
Approximate Dynamic Programming
Generalized Policy Iteration (Gpi)
Neural Networks
Neurodynamic Programming
Nonlinear Systems
Optimal Control
Reinforcement Learning
Optimal control for discrete-time systems with actuator saturation
期刊论文
OPTIMAL CONTROL APPLICATIONS & METHODS, 2017, 卷号: 38, 期号: 6, 页码: 1071-1080
作者:
Lin, Qiao
;
Wei, Qinglai
;
Zhao, Bo
收藏
  |  
浏览/下载:14/0
  |  
提交时间:2017/05/04
Approximate Dynamic Programming
Discrete Time
Generalized Policy Iteration
Optimal Control
Saturating Actuators
Optimal Control for Discrete-Time Nonlinear Systems with Actuator Saturation Based on Generalized Policy Iteration Adaptive Dynamic Programming Algorithm
会议论文
Sapporo, Hokkaido, Japan, 2017-6-21
作者:
Lin Q(林桥)
;
Qinglai Wei
;
Bo Zhao
收藏
  |  
浏览/下载:15/0
  |  
提交时间:2017/05/04
Generalized Policy Iteration Adaptive Dynamic Programming Algorithm for Optimal Tracking Control of a Class of Nonlinear Systems
会议论文
Yinchuan, China, 2016-5-28
作者:
Lin Q(林桥)
;
Qinglai Wei
;
Derong Liu
收藏
  |  
浏览/下载:9/0
  |  
提交时间:2017/05/04
Neuro-optimal tracking control for a class of discrete-time nonlinear systems via generalized value iteration adaptive dynamic programming approach
期刊论文
SOFT COMPUTING, 2016, 卷号: 20, 期号: 2, 页码: 697-706
作者:
Wei, Qinglai
;
Liu, Derong
;
Xu, Yancai
;
Qinglai Wei
收藏
  |  
浏览/下载:10/0
  |  
提交时间:2016/06/14
Adaptive Dynamic Programming
Approximate Dynamic Programming
Adaptive Critic Designs
Optimal Control
Neural Networks
Nonlinear Systems
Reinforcement Learning
Generalized Policy Iteration Adaptive Dynamic Programming for Discrete-Time Nonlinear Systems
期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2015, 卷号: 45, 期号: 12, 页码: 1577-1591
作者:
Liu, Derong
;
Wei, Qinglai
;
Yan, Pengfei
收藏
  |  
浏览/下载:43/0
  |  
提交时间:2016/03/19
Adaptive critic designs
adaptive dynamic programming (ADP)
approximate dynamic programming
generalized policy iteration
neural networks
neuro-dynamic programming
nonlinear systems
optimal control
reinforcement learning
Infinite Horizon Self-Learning Optimal Control of Nonaffine Discrete-Time Nonlinear Systems
期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 卷号: 26, 期号: 4, 页码: 866-879
作者:
Wei, Qinglai
;
Liu, Derong
;
Yang, Xiong
收藏
  |  
浏览/下载:21/0
  |  
提交时间:2015/09/21
Adaptive critic designs
adaptive dynamic programming (ADP)
approximate dynamic programming
generalized policy iteration
neural networks (NNs)
neurodynamic programming
nonlinear systems
optimal control
reinforcement learning
©版权所有 ©2017 CSpace - Powered by
CSpace