Cross-model retrieval with deep learning for business application
Wang, Yufei3; Wang, Huanting1,2; Yang, Jiating2; Chen, Jianbo1
2021-03-09
会议日期2020-11-14
会议地点Busan, Korea, Republic of
关键词Cross-modal retrieval Audio features Deep hashing Useful information
卷号1802
期号3
DOI10.1088/1742-6596/1802/3/032035
英文摘要

Cross-modal retravel has been used in many fields, such as business and search engines. Most search engines for business are text-based, but text-based search engines are limited by equipment and the strict requirement for knowledge. Text-based search needs keyboards to finish the search process, which requires users to have the knowledge of using keyboards. Compared to the text-based search, audio-based search has advantages. First, it avoids the traditional ways of inputting information. And it gets rid of the gap in time between inputting information for searching and getting useful information. In this paper, we propose a way to use audio to search images for business applications. We use deep learning to implement cross-modal retrieval systems between images and audio. We first extract features from images and audio respectively. And then we implement a neural network with two identical networks to learn the correspondence between images and audio. The first network extracts the features from images and audio further for calculation, and the second network learns whether two features from different modalities are related. This research provides a new way for business applications to search for information more instantly. © Published under licence by IOP Publishing Ltd.

产权排序2
会议录7th International Conference on Computer-Aided Design, Manufacturing, Modeling and Simulation, CDMMS 2020 - 2. Algorithm Design and Computational Science
会议录出版者IOP Publishing Ltd
语种英语
ISSN号17551307;17551315
内容类型会议论文
源URL[http://ir.opt.ac.cn/handle/181661/94577]  
专题西安光学精密机械研究所_光学影像学习与分析中心
通讯作者Yang, Jiating
作者单位1.University of Chinese Academy of Sciences, Beijing; 100049, China
2.Xian Institute of Optics and Precision Mechanics, Chinese Academy of Sciences, Xian, China
3.Simon Fraser University, 8888 University Dr, Bumaby; BC; V5A 1S6, Canada
推荐引用方式
GB/T 7714
Wang, Yufei,Wang, Huanting,Yang, Jiating,et al. Cross-model retrieval with deep learning for business application[C]. 见:. Busan, Korea, Republic of. 2020-11-14.
个性服务
查看访问统计
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。


©版权所有 ©2017 CSpace - Powered by CSpace