CORC  > 北京大学  > 计算机科学技术研究所
Audio event recognition based on DBN features from multiple filter-bank representations
Guo, Feng ; Chen, Xiaoou ; Yang, Deshun
2015
英文摘要In the audio event classification or detection research field, the representation of the audio itself is important. Many researchers tried to apply Deep Belief Network (DBN) to learn new representations of the audio. The mel filter-bank feature, which is obtained based on mel scale, is commonly used as the low level representation of the audio in the pre-processing procedure of DBN. However, the mel bands used in mel filter-bank feature may not be sufficient for the comprehensive representation of the diverse audio events in the real world and then it will make it difficult for DBN to learn good audio features. In this paper, two steps are taken to explore and tackle the problem. In the first step, we conduct a comparison of the effects among different arrangements of frequency bands to DBN feature learning in the audio event recognition. Here the arrangements of frequency bands include mel bands, bark bands, linear bands and pyramid bands. In the second step, in order to utilize the different classification capabilities of the DBN features on different audio events, we adopt the Adaboost algorithm to fuse them. We conduct the experiments on real datasets collected from findsound website, and the results verifies that our proposed audio event classification system, which uses diverse features selected by Adaboost from all sets of DBN features, outperforms the one using only one kind of DBN feature set. ? 2015 IEEE.; EI
语种中文
出处17th IEEE International Workshop on Multimedia Signal Processing, MMSP 2015
DOI标识10.1109/MMSP.2015.7340807
内容类型其他
源URL[http://ir.pku.edu.cn/handle/20.500.11897/436456]  
专题计算机科学技术研究所
推荐引用方式
GB/T 7714
Guo, Feng,Chen, Xiaoou,Yang, Deshun. Audio event recognition based on DBN features from multiple filter-bank representations. 2015-01-01.
个性服务
查看访问统计
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。


©版权所有 ©2017 CSpace - Powered by CSpace