Speech Enhancement Based on Analysis-Synthesis Framework with Improved Parameter Domain Enhancement
Liu, Bin1; Tao, Jianhua1; Wen, Zhengqi1; Mo, Fuyuan2; Bin Liu
刊名JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY
2016-02-01
卷号82期号:2页码:141-150
关键词Analysis-synthesis Framework Multi-band Summary Correlogram Denoising Autoencoder Speech Enhancement Speech Coding
DOI10.1007/s11265-015-1025-1
文献子类Article
英文摘要This paper presents a speech enhancement approach based on analysis-synthesis framework. An improved multi-band summary correlogram (MBSC) algorithm is proposed for pitch estimation and voiced/unvoiced (V/UV) detection. The proposed pitch detection algorithm achieves a lower pitch detection error compared with the reference algorithm. The denoising autoencoder (DAE) is applied to enhance the line spectrum frequencies (LSFs). The reconstruction loss could be decreased compare with the swallow model. The proposed approach is evaluated using the perceptual evaluation of speech quality (PESQ) and the experimental results show that the proposed approach improves the performance of speech enhancement compared with the conventional speech enhancement approach. In addition, it could be applied to parametric speech coding even at low bit rate and low signal-noise ratio (SNR) environments.
WOS关键词SPECTRAL AMPLITUDE ESTIMATOR ; ERROR ; NOISE ; ALGORITHM ; PRIORS
WOS研究方向Computer Science ; Engineering
语种英语
WOS记录号WOS:000371299700002
资助机构National High-Tech Research and Development Program of China(863 Program)(2015AA016305) ; National Natural Science Foundation of China (NSFC)(61425017 ; Major Program for the National Social Science Fund of China(13ZD 189) ; Integration and application of basic science data in Chinese information processing field(XXH12504-1-11) ; 61403386 ; 61305003 ; 61332017 ; 61375027 ; 61273288 ; 61233009 ; 61203258)
内容类型期刊论文
源URL[http://ir.ia.ac.cn/handle/173211/11357]  
专题自动化研究所_模式识别国家重点实验室_人机语音交互团队
通讯作者Bin Liu
作者单位1.Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China
2.Chinese Acad Sci, Inst Acoust, Beijing 100190, Peoples R China
推荐引用方式
GB/T 7714
Liu, Bin,Tao, Jianhua,Wen, Zhengqi,et al. Speech Enhancement Based on Analysis-Synthesis Framework with Improved Parameter Domain Enhancement[J]. JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY,2016,82(2):141-150.
APA Liu, Bin,Tao, Jianhua,Wen, Zhengqi,Mo, Fuyuan,&Bin Liu.(2016).Speech Enhancement Based on Analysis-Synthesis Framework with Improved Parameter Domain Enhancement.JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY,82(2),141-150.
MLA Liu, Bin,et al."Speech Enhancement Based on Analysis-Synthesis Framework with Improved Parameter Domain Enhancement".JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY 82.2(2016):141-150.
个性服务
查看访问统计
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。


©版权所有 ©2017 CSpace - Powered by CSpace