A loss-balanced multi-task model for simultaneous detection and segmentation
Zhang, Wenwen1,2; Wang, Kunfeng3; Wang, Yutong2,4; Yan, Lan2,5; Wang, Fei-Yue2,6
刊名NEUROCOMPUTING
2021-03-07
卷号428页码:65-78
关键词Object detection Semantic segmentation Multi-task learning Scene understanding
ISSN号0925-2312
DOI10.1016/j.neucom.2020.11.024
通讯作者Wang, Kunfeng(kunfeng.wang@ia.ac.cn)
英文摘要Scene understanding comes in many flavors, two of the most popular being object detection and semantic segmentation, which act as two important aspects for scene understanding, and are applied to many areas, such as autonomous driving and intelligent surveillance. Although much progress has already been made, the two tasks of object detection and semantic segmentation are often investigated independently. In practice, scene understanding is complicated, and comprises many sub-tasks, so that research of learning multiple tasks simultaneously with a single model is feasible. With the interrelated goals of these two tasks, there is a strong motivation to improve the object detection accuracy with the help of semantic segmentation, and vice versa. In this paper, we propose a loss-balanced multi-task model for simultaneous object detection and semantic segmentation. We explore multi-task learning with sharing parameters based on deep learning to realize improved object detection and segmentation, and propose a single-stage deep architecture based on multi-task learning, jointly performing object detection and semantic segmentation to boost each other. With no more computation load in the inference compared with the baselines of SSD and FCN, we show that these two tasks, object detection and semantic segmentation, benefit from each other. Experimental results on Pascal VOC and COCO show that our method improves much in object detection and semantic segmentation compared with the corresponding baselines of both tasks. (c) 2020 Elsevier B.V. All rights reserved.
资助项目National Key R&D Program of China[2020YFC2003900] ; Intel Collaborative Research Institute for Intelligent and Automated Connected Vehicles (ICRI-IACV) ; National Natural Science Foundation of China[62076020] ; Fundamental Research Funds for the Central Universities
WOS关键词SEMANTIC SEGMENTATION ; OBJECT DETECTION
WOS研究方向Computer Science
语种英语
出版者ELSEVIER
WOS记录号WOS:000611057000007
资助机构National Key R&D Program of China ; Intel Collaborative Research Institute for Intelligent and Automated Connected Vehicles (ICRI-IACV) ; National Natural Science Foundation of China ; Fundamental Research Funds for the Central Universities
内容类型期刊论文
源URL[http://ir.ia.ac.cn/handle/173211/43103]  
专题自动化研究所_复杂系统管理与控制国家重点实验室_先进控制与自动化团队
通讯作者Wang, Kunfeng
作者单位1.Xi An Jiao Tong Univ, Sch Software Engn, Xian 710049, Peoples R China
2.Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, Beijing 100190, Peoples R China
3.Beijing Univ Chem Technol, Coll Informat Sci & Technol, Beijing 100029, Peoples R China
4.Univ Chinese Acad Sci, Beijing 100049, Peoples R China
5.Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China
6.Macau Univ Sci & Technol, Inst Syst Engn, Macau 999078, Peoples R China
推荐引用方式
GB/T 7714
Zhang, Wenwen,Wang, Kunfeng,Wang, Yutong,et al. A loss-balanced multi-task model for simultaneous detection and segmentation[J]. NEUROCOMPUTING,2021,428:65-78.
APA Zhang, Wenwen,Wang, Kunfeng,Wang, Yutong,Yan, Lan,&Wang, Fei-Yue.(2021).A loss-balanced multi-task model for simultaneous detection and segmentation.NEUROCOMPUTING,428,65-78.
MLA Zhang, Wenwen,et al."A loss-balanced multi-task model for simultaneous detection and segmentation".NEUROCOMPUTING 428(2021):65-78.
个性服务
查看访问统计
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。


©版权所有 ©2017 CSpace - Powered by CSpace