MFCC-based perceptual hashing for compressed domain of speech content identification

	MFCC-based perceptual hashing for compressed domain of speech content identification
	Zhang, Qiu-Yu 1; Liu, Yang-Wei 1; Di, Yan-Jun 1; Zhang, Qian-Yun 2; Xing, Peng-Fei 1
刊名	Journal of Chemical and Pharmaceutical Research
	2014
卷号	6 期号:7 页码:379-386
关键词	Algorithms Internet Robustness (control systems) Speech Speech recognition Auditory feature Compressed domain MDCT coefficients Mel frequency cepstral co-efficient MFCC features Perceptual hashing Speech content Tampering localizations
英文摘要	Current research on speech content identification aim primarily at raw wideband speech signals, which are generally transmitted in a compressed format. This makes it unable to meet the demand of speech content identification in compressed domain. This paper proposes a new speech perceptual hashing algorithm for speech content identification with compressed domain based on MFCC (Mel Frequency Cepstral Coefficient), to solve problems of real-time speech content identification and large quantity of voice message information over the mobile Internet. This algorithm extracts MFCC feature based on the raw wideband method. The process begins by extracting the MDCT coefficients, which are the intermediately decoded results of compressed speeches in MP3 format. These coefficients are translated to MFCC parameters and the binary hashing values are then generated from these parameters, combined with human auditory features. This algorithm uses highly compressed data to realize fast identification for speech content. Experimental results show that the proposed algorithm can realize tampering localization and increase 5% in efficiency when compared with raw wideband algorithms, with the precondition of robustness and discrimination. © 2014, Journal of Chemical and Pharmaceutical Research. All rights reserved.
语种	英语
出版者	Journal of Chemical and Pharmaceutical Research, 3/668 Malviya Nagar, Jaipur, Rajasthan, India
内容类型	期刊论文
源URL	[http://ir.lut.edu.cn/handle/2XXMBERH/113617]
专题	计算机与通信学院
作者单位	1.School of Computer and Communication, Lanzhou University of Technology, Lanzhou, China; 2.School of Communication and Information Engineering, Shanghai University, Shanghai, China
推荐引用方式 GB/T 7714	Zhang, Qiu-Yu,Liu, Yang-Wei,Di, Yan-Jun,et al. MFCC-based perceptual hashing for compressed domain of speech content identification[J]. Journal of Chemical and Pharmaceutical Research,2014,6(7):379-386.
APA	Zhang, Qiu-Yu,Liu, Yang-Wei,Di, Yan-Jun,Zhang, Qian-Yun,&Xing, Peng-Fei.(2014).MFCC-based perceptual hashing for compressed domain of speech content identification.Journal of Chemical and Pharmaceutical Research,6(7),379-386.
MLA	Zhang, Qiu-Yu,et al."MFCC-based perceptual hashing for compressed domain of speech content identification".Journal of Chemical and Pharmaceutical Research 6.7(2014):379-386.