Speech Enhancement Based on Analysis-Synthesis Framework with Improved Parameter Domain Enhancement

doi:10.1007/s11265-015-1025-1

CORC > 自动化研究所 > 中国科学院自动化研究所 > 模式识别国家重点实验室 > 人机语音交互团队

	Speech Enhancement Based on Analysis-Synthesis Framework with Improved Parameter Domain Enhancement
	Liu, Bin1 ; Tao, Jianhua1 ; Wen, Zhengqi1 ; Mo, Fuyuan 2; Bin Liu
刊名	JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY
	2016-02-01
卷号	82 期号:2 页码:141-150
关键词	Analysis-synthesis Framework Multi-band Summary Correlogram Denoising Autoencoder Speech Enhancement Speech Coding
DOI	10.1007/s11265-015-1025-1
文献子类	Article
英文摘要	This paper presents a speech enhancement approach based on analysis-synthesis framework. An improved multi-band summary correlogram (MBSC) algorithm is proposed for pitch estimation and voiced/unvoiced (V/UV) detection. The proposed pitch detection algorithm achieves a lower pitch detection error compared with the reference algorithm. The denoising autoencoder (DAE) is applied to enhance the line spectrum frequencies (LSFs). The reconstruction loss could be decreased compare with the swallow model. The proposed approach is evaluated using the perceptual evaluation of speech quality (PESQ) and the experimental results show that the proposed approach improves the performance of speech enhancement compared with the conventional speech enhancement approach. In addition, it could be applied to parametric speech coding even at low bit rate and low signal-noise ratio (SNR) environments.
WOS关键词	SPECTRAL AMPLITUDE ESTIMATOR ; ERROR ; NOISE ; ALGORITHM ; PRIORS
WOS研究方向	Computer Science ; Engineering
语种	英语
WOS记录号	WOS:000371299700002
资助机构	National High-Tech Research and Development Program of China(863 Program)(2015AA016305) ; National Natural Science Foundation of China (NSFC)(61425017 ; Major Program for the National Social Science Fund of China(13ZD 189) ; Integration and application of basic science data in Chinese information processing field(XXH12504-1-11) ; 61403386 ; 61305003 ; 61332017 ; 61375027 ; 61273288 ; 61233009 ; 61203258)
内容类型	期刊论文
源URL	[http://ir.ia.ac.cn/handle/173211/11357]
专题	自动化研究所_模式识别国家重点实验室_人机语音交互团队
通讯作者	Bin Liu
作者单位	1.Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China 2.Chinese Acad Sci, Inst Acoust, Beijing 100190, Peoples R China
推荐引用方式 GB/T 7714	Liu, Bin,Tao, Jianhua,Wen, Zhengqi,et al. Speech Enhancement Based on Analysis-Synthesis Framework with Improved Parameter Domain Enhancement[J]. JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY,2016,82(2):141-150.
APA	Liu, Bin,Tao, Jianhua,Wen, Zhengqi,Mo, Fuyuan,&Bin Liu.(2016).Speech Enhancement Based on Analysis-Synthesis Framework with Improved Parameter Domain Enhancement.JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY,82(2),141-150.
MLA	Liu, Bin,et al."Speech Enhancement Based on Analysis-Synthesis Framework with Improved Parameter Domain Enhancement".JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY 82.2(2016):141-150.