Speech Enhancement Based on Analysis-Synthesis Framework with Improved Parameter Domain Enhancement | |
Liu, Bin1; Tao, Jianhua1; Wen, Zhengqi1; Mo, Fuyuan2; Bin Liu | |
刊名 | JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY |
2016-02-01 | |
卷号 | 82期号:2页码:141-150 |
关键词 | Analysis-synthesis Framework Multi-band Summary Correlogram Denoising Autoencoder Speech Enhancement Speech Coding |
DOI | 10.1007/s11265-015-1025-1 |
文献子类 | Article |
英文摘要 | This paper presents a speech enhancement approach based on analysis-synthesis framework. An improved multi-band summary correlogram (MBSC) algorithm is proposed for pitch estimation and voiced/unvoiced (V/UV) detection. The proposed pitch detection algorithm achieves a lower pitch detection error compared with the reference algorithm. The denoising autoencoder (DAE) is applied to enhance the line spectrum frequencies (LSFs). The reconstruction loss could be decreased compare with the swallow model. The proposed approach is evaluated using the perceptual evaluation of speech quality (PESQ) and the experimental results show that the proposed approach improves the performance of speech enhancement compared with the conventional speech enhancement approach. In addition, it could be applied to parametric speech coding even at low bit rate and low signal-noise ratio (SNR) environments. |
WOS关键词 | SPECTRAL AMPLITUDE ESTIMATOR ; ERROR ; NOISE ; ALGORITHM ; PRIORS |
WOS研究方向 | Computer Science ; Engineering |
语种 | 英语 |
WOS记录号 | WOS:000371299700002 |
资助机构 | National High-Tech Research and Development Program of China(863 Program)(2015AA016305) ; National Natural Science Foundation of China (NSFC)(61425017 ; Major Program for the National Social Science Fund of China(13ZD 189) ; Integration and application of basic science data in Chinese information processing field(XXH12504-1-11) ; 61403386 ; 61305003 ; 61332017 ; 61375027 ; 61273288 ; 61233009 ; 61203258) |
内容类型 | 期刊论文 |
源URL | [http://ir.ia.ac.cn/handle/173211/11357] |
专题 | 自动化研究所_模式识别国家重点实验室_人机语音交互团队 |
通讯作者 | Bin Liu |
作者单位 | 1.Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China 2.Chinese Acad Sci, Inst Acoust, Beijing 100190, Peoples R China |
推荐引用方式 GB/T 7714 | Liu, Bin,Tao, Jianhua,Wen, Zhengqi,et al. Speech Enhancement Based on Analysis-Synthesis Framework with Improved Parameter Domain Enhancement[J]. JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY,2016,82(2):141-150. |
APA | Liu, Bin,Tao, Jianhua,Wen, Zhengqi,Mo, Fuyuan,&Bin Liu.(2016).Speech Enhancement Based on Analysis-Synthesis Framework with Improved Parameter Domain Enhancement.JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY,82(2),141-150. |
MLA | Liu, Bin,et al."Speech Enhancement Based on Analysis-Synthesis Framework with Improved Parameter Domain Enhancement".JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY 82.2(2016):141-150. |
个性服务 |
查看访问统计 |
相关权益政策 |
暂无数据 |
收藏/分享 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论