Multi topic distribution model for topic discovery in twitter | |
Zheng Lei; Han Kai | |
2013 | |
会议名称 | 2013 IEEE 7th International Conference on Semantic Computing, ICSC 2013 |
会议地点 | Irvine, CA, United states |
英文摘要 | Micro logging websites, like Twitter, as a new social media form are growing increasingly popular. Compared with the traditional medias, such as New York Times, tweets are structured data form and with shorter length. Although traditional topic modeling algorithms have been studied well, few algorithms are specially designed to mine Twitter data according to its own features. In this paper, according to the structure of Twitter data, we introduce Multi Topic Distribution Model to mine topics. In addition, we have observed that one tweet mostly discusses either public issues or personal lives. Former studies equally analyze all tweets and fail to discover interests of each individual. With the help of features of Twitter data, dividing topics into two types in semantics, our model not only efficiently discover topics, but also is able to indicate which topics are interested by an user and which topics are hot issues of the Twitter community. Through Gibbs sampling for approximate inference, the experiments are conducted in the TREC2011 data set. Experimental results on the data set have shown an comparison between our model and Latent Dirichlet Allocation, Author Topic Model. We also illustrate an example of topics which are interested by the whole community and several users. |
收录类别 | EI |
语种 | 英语 |
内容类型 | 会议论文 |
源URL | [http://ir.siat.ac.cn:8080/handle/172644/5271] |
专题 | 深圳先进技术研究院_其他 |
作者单位 | 2013 |
推荐引用方式 GB/T 7714 | Zheng Lei,Han Kai. Multi topic distribution model for topic discovery in twitter[C]. 见:2013 IEEE 7th International Conference on Semantic Computing, ICSC 2013. Irvine, CA, United states. |
个性服务 |
查看访问统计 |
相关权益政策 |
暂无数据 |
收藏/分享 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论