CORC  > 清华大学
基于交叉熵顺序统计滤波的语音端点检测算法
钱彦旻 ; 刘加 ; QIAN Yanmin ; LIU Jia
2010-06-09 ; 2010-06-09
关键词语音信号处理 端点检测 子带交叉熵 顺序统计滤波 speech signal processing voice activity detection sub-band cross-entropy order statistics filter TN912.3
其他题名Cross-entropy OSF-based voice activity detection algorithm
中文摘要为提高语音端点检测在强噪声环境下的准确率,提出了一种基于交叉熵顺序统计滤波的语音端点检测算法。该算法以子带交叉熵为语音/非语音的区分特征,首先将每帧语音的频谱划分成若干个子带,估计出每个子带能量与背景噪声之间的交叉熵,然后把相继若干帧的子带能量交叉熵经过一组顺序统计滤波器,最后根据各帧交叉熵的值对输入的语音进行分类。实验结果表明:该算法能够有效地区分语音和非语音。特别是在强噪声环境下依然能够保持很高的检测率,具有鲁棒性。通过实验结果比较,该算法在性能上优于最近提出的基于能量顺序统计滤波和单纯交叉熵判别的两种方法。; Voice activity detection in strong noise environments is improved by an algorithm based on the cross-entropy with an order statistics filter.The algorithm makes use of the sub-band cross-entropy as the speech/non-speech discrimination feature.The analyses first divides the speech spectrum into several sub-bands and then estimates the cross-entropy between the speech signal and the non-speech signal.An order statistics filter is applied to a sequence of the sub-band cross-entropies to obtain the cross-entropy of each frame.The speech and non-speech signal are classified based on the cross-entropy.Tests show that the algorithm effectively distinguishes speech from non-speech,even in high noise environments.Thus the algorithm outperforms two other the recently reported algorithms.; 国家自然科学基金资助项目(60776800); 国家“八六三”高技术项目(2006AA0101012007AA04Z2232008AA02Z414)
语种中文 ; 中文
内容类型期刊论文
源URL[http://hdl.handle.net/123456789/54167]  
专题清华大学
推荐引用方式
GB/T 7714
钱彦旻,刘加,QIAN Yanmin,等. 基于交叉熵顺序统计滤波的语音端点检测算法[J],2010, 2010.
APA 钱彦旻,刘加,QIAN Yanmin,&LIU Jia.(2010).基于交叉熵顺序统计滤波的语音端点检测算法..
MLA 钱彦旻,et al."基于交叉熵顺序统计滤波的语音端点检测算法".(2010).
个性服务
查看访问统计
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。


©版权所有 ©2017 CSpace - Powered by CSpace