上海交通大学学报(自然版)

• 自动化技术、计算机技术 • 上一篇    下一篇

基于LDA话题关联的话题演化

楚克明,李芳
  

  1. (上海交通大学 电子信息与电气工程学院, 上海 200240)
  • 收稿日期:2010-05-11 修回日期:1900-01-01 出版日期:2010-11-30 发布日期:2010-11-30

Topic Evolution Based on LDA and Topic Association

CHU KeMing,LI Fang

  

  1. (School of Electronic, Information and Electrical Engineering, Shanghai Jiaotong University, Shanghai 200240, China)
  • Received:2010-05-11 Revised:1900-01-01 Online:2010-11-30 Published:2010-11-30

摘要: 话题演化可以帮助人们快速获取信息和了解趋势.提出了一种挖掘话题随时间变化的方法,通过话题抽取和话题关联实现话题的演化.对不同时间段的文集进行话题的自动抽取,话题数目在不同时间段是可变的;计算相邻时间段中任意2个话题的分布距离和话题的特征向量相似度实现话题的关联.实验结果证明,该方法不但可以描述同一个话题随时间的强度变化,还可以描述新话题的产生,旧话题的消失以及话题内容随时间的演化.

关键词: 话题探测, 话题关联, 话题演化, 潜在狄里特里分配

Abstract: Topic evolution will help people to learn information quickly. In this paper, a method was proposed to discover topic’s evolution over time by topic detection and relating topics in different time periods. The method applies LDA model on temporal documents to extract topics. The number of topics in different time periods is different. Relating topics in consecutive time periods is based on JensenShannon divergence and features similarity. Experiments show that the method can detect new topics and describe topic’s evolution over time effectively. It not only shows that the topics evolve with time, but also that the content of topics change with time.

中图分类号: