情报科学 ›› 2022, Vol. 40 ›› Issue (5): 73-83.

• 业务研究 • 上一篇    下一篇

基于舆情大数据的网络群体性事件动态识别模型与应对策略研究 


  

  • 出版日期:2022-05-01 发布日期:2022-05-30

  • Online:2022-05-01 Published:2022-05-30

摘要: 【目的/意义】随着互联网在社会中的影响力逐渐增大,面对网络群体性事件对社会生活的冲击,需把握网
络群体性事件的演化规律,确定事件类别,提炼事件特征,基于不同类别的网络群体性事件,提出有针对性的应对
措施。【方法
/过程】通过LDA主题模型与K-means算法相结合,利用LDA模型实现文本潜在语义的识别,最终运用
SVM算法进行网络群体性事件聚类分析,得到五类网络群体性事件。【结果/结论】构建的网络群体性事件动态识别
模型,通过大量的文本训练,在事件聚类数为
5时具有良好的解释性,完成了网络群体性事件的客观分类,分别为:
经济型、社会型、文化型、民族型和环境型,为政府分类应对策略提供依据。【创新
/局限】利用 LDA主题模型和 Kmeans算法,减少了模型的迭代次数,确定最佳主题数,提高了网络群体性事件识别结果的准确性,但是运用慧科新
闻数据库搜集到的文本数据范围有限,且分类结果反应的事件特征具有一定局限性,后续研究可进一步扩大动态
文本数据库,对分类算法进行改进和深化。

Abstract: Purpose/significanceWith the increasing influence of the Internet in society,facing the impact of network mass incidents on social life,it is necessary to grasp the evolution law of network mass incidents,determine the type of event,refine the characteristics of incidents,and put forward targeted countermeasures based on different types of network mass incidents. Method/processThrough the combination of LDA topic model and K-means algorithm,LDA model is used to realize the recognition of text potential semantics.Finally,support vector machine algorithm is used to perform cluster analysis of network group events,and obtained five types of network group eventsResult/conclusionTo build network dynamic identification model of mass incidents, through a large number of text training,in the event the clustering number for 5 good explanatory,completed the network of mass incidents objective classification,are:economy,society,culture form,ethnic group and the environment,provide the basis for the government classification strategy.Innova⁃tion/limitationThis paper used the LDA model and K - means algorithm,reduce the number of iterations of the model,to determine the best theme,improve the accuracy of the network group incidents recognition results,but the use of text data collected by the wisers news database scope finitely,and the classification results reflect the features of events has certain limitations,further research can fur⁃
ther expand the dynamic text database,improve and deepen the classification algorithm.