Towards More Flexible and Accurate Object Tracking with Natural Language: Algorithms and Benchmark
Wang, Xiao1; Shu, Xiujun1,6; Zhang, Zhipeng5; Jiang, Bo4; Wang, Yaowei1; Tian, Yonghong1,3; Wu, Feng1,2
2021
会议日期2021-7
会议地点Virtual
英文摘要

Tracking by natural language specification is a new rising research topic that aims at locating the target object in the video sequence based on its language description. Compared with traditional bounding box (BBox) based tracking, this setting guides object tracking with high-level seman- tic information, addresses the ambiguity of BBox, and links local and global search organically together. Those benefits may bring more flexible, robust and accurate tracking performance in practical scenarios. However, existing natural language initialized trackers are developed and compared on benchmark datasets proposed for tracking-by-BBox, which can’t reflect the true power of tracking-by-language. In this work, we propose a new benchmark specifically dedicated to the tracking-by language, including a large scale dataset, strong and diverse baseline methods. Specifically, we collect 2k video sequences (contains a total of 1,244,340 frames, 663 words) and split 1300/700 for the train/testing respectively. We densely annotate one sentence in English and corresponding bounding boxes of the target object for each video. We also introduce two new challenges into TNL2K for the object tracking task, i.e., adversarial samples and modality switch. A strong baseline method based on an adaptive local-global-search scheme is proposed for future works to compare. We believe this benchmark will greatly boost related researches on natural language guided tracking.

语种英语
内容类型会议论文
源URL[http://ir.ia.ac.cn/handle/173211/48572]  
专题自动化研究所_模式识别国家重点实验室_视频内容安全团队
作者单位1.Peng Cheng Laboratory
2.University of Science and Technology of China
3.Department of Computer Science and Technology, Peking University
4.School of Computer Science and Technology, Anhui University
5.National Laboratory of Pattern Recognition, Institute of Automation, Chinese Academy of Sciences
6.School of Electronic and Computer Engineering, Peking University
推荐引用方式
GB/T 7714
Wang, Xiao,Shu, Xiujun,Zhang, Zhipeng,et al. Towards More Flexible and Accurate Object Tracking with Natural Language: Algorithms and Benchmark[C]. 见:. Virtual. 2021-7.
个性服务
查看访问统计
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。


©版权所有 ©2017 CSpace - Powered by CSpace