Aggregated Multi-Deep Deterministic Policy Gradient for Self-Driving Policy

	Aggregated Multi-Deep Deterministic Policy Gradient for Self-Driving Policy
	Junta Wu; Huiyun Li
	2018
会议日期	2018
会议地点	法国巴黎
英文摘要	Self-driving is a significant application of deep reinforcement learning. We present a deep reinforcement learning algorithm for control policies of self-driving vehicles. This method aggregates multiple sub-policies based on the deep deterministic policy gradient algorithm and centralized experience replays. The aggregated policy converges to the optimal policy by aggregating those sub-optimal sub-policies. It helps reduce the training time largely since each sub-policy is trained with less time. Experimental results on the open racing car simulator platform demonstrates that the proposed algorithm is able to successfully learn control policies, with a good generalization performance. This method outperforms the deep deterministic policy gradient algorithm with 56.7% less training time.
内容类型	会议论文
源URL	[http://ir.siat.ac.cn:8080/handle/172644/13743]
专题	深圳先进技术研究院_集成所
推荐引用方式 GB/T 7714	Junta Wu,Huiyun Li. Aggregated Multi-Deep Deterministic Policy Gradient for Self-Driving Policy[C]. 见:. 法国巴黎. 2018.

个性服务

查看访问统计

相关权益政策

暂无数据

收藏/分享

所有评论 (0)

暂无评论

评注功能仅针对注册用户开放，请您登录

您在知识库使用过程中有什么好的想法或者建议可以反馈给我们。
标题：	*
内容：
Email：	*
验证码：	刷新

相关链接