A loss-balanced multi-task model for simultaneous detection and segmentation | |
Zhang, Wenwen1,2; Wang, Kunfeng3; Wang, Yutong2,4; Yan, Lan2,5; Wang, Fei-Yue2,6 | |
刊名 | NEUROCOMPUTING |
2021-03-07 | |
卷号 | 428页码:65-78 |
关键词 | Object detection Semantic segmentation Multi-task learning Scene understanding |
ISSN号 | 0925-2312 |
DOI | 10.1016/j.neucom.2020.11.024 |
通讯作者 | Wang, Kunfeng(kunfeng.wang@ia.ac.cn) |
英文摘要 | Scene understanding comes in many flavors, two of the most popular being object detection and semantic segmentation, which act as two important aspects for scene understanding, and are applied to many areas, such as autonomous driving and intelligent surveillance. Although much progress has already been made, the two tasks of object detection and semantic segmentation are often investigated independently. In practice, scene understanding is complicated, and comprises many sub-tasks, so that research of learning multiple tasks simultaneously with a single model is feasible. With the interrelated goals of these two tasks, there is a strong motivation to improve the object detection accuracy with the help of semantic segmentation, and vice versa. In this paper, we propose a loss-balanced multi-task model for simultaneous object detection and semantic segmentation. We explore multi-task learning with sharing parameters based on deep learning to realize improved object detection and segmentation, and propose a single-stage deep architecture based on multi-task learning, jointly performing object detection and semantic segmentation to boost each other. With no more computation load in the inference compared with the baselines of SSD and FCN, we show that these two tasks, object detection and semantic segmentation, benefit from each other. Experimental results on Pascal VOC and COCO show that our method improves much in object detection and semantic segmentation compared with the corresponding baselines of both tasks. (c) 2020 Elsevier B.V. All rights reserved. |
资助项目 | National Key R&D Program of China[2020YFC2003900] ; Intel Collaborative Research Institute for Intelligent and Automated Connected Vehicles (ICRI-IACV) ; National Natural Science Foundation of China[62076020] ; Fundamental Research Funds for the Central Universities |
WOS关键词 | SEMANTIC SEGMENTATION ; OBJECT DETECTION |
WOS研究方向 | Computer Science |
语种 | 英语 |
出版者 | ELSEVIER |
WOS记录号 | WOS:000611057000007 |
资助机构 | National Key R&D Program of China ; Intel Collaborative Research Institute for Intelligent and Automated Connected Vehicles (ICRI-IACV) ; National Natural Science Foundation of China ; Fundamental Research Funds for the Central Universities |
内容类型 | 期刊论文 |
源URL | [http://ir.ia.ac.cn/handle/173211/43103] |
专题 | 自动化研究所_复杂系统管理与控制国家重点实验室_先进控制与自动化团队 |
通讯作者 | Wang, Kunfeng |
作者单位 | 1.Xi An Jiao Tong Univ, Sch Software Engn, Xian 710049, Peoples R China 2.Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, Beijing 100190, Peoples R China 3.Beijing Univ Chem Technol, Coll Informat Sci & Technol, Beijing 100029, Peoples R China 4.Univ Chinese Acad Sci, Beijing 100049, Peoples R China 5.Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing 100049, Peoples R China 6.Macau Univ Sci & Technol, Inst Syst Engn, Macau 999078, Peoples R China |
推荐引用方式 GB/T 7714 | Zhang, Wenwen,Wang, Kunfeng,Wang, Yutong,et al. A loss-balanced multi-task model for simultaneous detection and segmentation[J]. NEUROCOMPUTING,2021,428:65-78. |
APA | Zhang, Wenwen,Wang, Kunfeng,Wang, Yutong,Yan, Lan,&Wang, Fei-Yue.(2021).A loss-balanced multi-task model for simultaneous detection and segmentation.NEUROCOMPUTING,428,65-78. |
MLA | Zhang, Wenwen,et al."A loss-balanced multi-task model for simultaneous detection and segmentation".NEUROCOMPUTING 428(2021):65-78. |
个性服务 |
查看访问统计 |
相关权益政策 |
暂无数据 |
收藏/分享 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论