Improving generation performance of speech emotion recognition by denoising autoencoders
Linlin Chao; Jianhua Tao; Minghao Yang; Ya Li
2014
会议日期2014-9
会议地点Singapore
关键词Speech Emotion Recognition
页码341-344
英文摘要1; A speech emotion recognition algorithm should generalize well when the target person’s speech samples and prior knowledge about their emotional content are not included in the training data. In order to achieve this objective, we utilize denoising autoencoders based approach to solve this task. In this study, a relatively small dataset, which contains close to 1500 persons’ emotion sentences, is introduced. By unsupervised pre-training with this dataset, denoising autoencoders learn features which contain more emotion-specific information than speaker-specific information in data successfully. Experiment results in CASIA dataset show that this denoising autoencoders based approach can improve the generation performance of speech emotion recognition significantly.
会议录The 9th International Symposium on Chinese Spoken Language Processing
语种英语
内容类型会议论文
源URL[http://ir.ia.ac.cn/handle/173211/11847]  
专题自动化研究所_模式识别国家重点实验室_人机语音交互团队
通讯作者Linlin Chao
作者单位Institute of Automation, Chinese Academy of Sciences
推荐引用方式
GB/T 7714
Linlin Chao,Jianhua Tao,Minghao Yang,et al. Improving generation performance of speech emotion recognition by denoising autoencoders[C]. 见:. Singapore. 2014-9.
个性服务
查看访问统计
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。


©版权所有 ©2017 CSpace - Powered by CSpace