情报科学 ›› 2024, Vol. 42 ›› Issue (2): 12-23.

• 专论 • 上一篇    下一篇

融合BERT和主题模型的谣言检测方法

  

  • 出版日期:2024-02-05 发布日期:2024-06-07

  • Online:2024-02-05 Published:2024-06-07

摘要:

【目的/意义】在谣言检测过程中,针对文本的上下文语义特征和主题语义特征没有得到充分挖掘的问题,
提出了一种融合BERT和主题模型的谣言检测方法,提升谣言检测效果。【方法/过程】利用BERT模型挖掘文本动
态上下文语义特征,利用主题模型挖掘文本主题语义特征;同时结合了微博影响力和用户可信度特征,并在微博影
响力特征设计阶段考虑了时效性因素;将以上特征进行充分融合,构建谣言检测模型。【结果/结论】以微博真实数
据进行实证分析,实验结果表明,该方法在进行谣言检测任务时效果较好,准确率最高达到了 93.68%,相较于表现
性能最好的传统机器学习方法、深度学习方法、融合特征方法分别提升了约 9.92%、
3
.29%、
7
.75%,能够实现对谣言
的有效检测。此外,在微博影响力特征设计阶段考虑时效性因素更有助于提升谣言检测效果。【创新/局限】谣言检
测效果仍有一定提升空间,未来可尝试结合用户评论内容特征进一步提升谣言检测效果。

Abstract:

【Purpose/significance】In the process of rumor detection, the contextual semantic features and topic semantic features of the
text have not been sufficiently addressed. This study proposes a rumor detection method that combines BERT and topic models to en⁃
hance the effectiveness of rumor detection.
Method/process】Utilizing the BERT model to extract dynamic contextual semantic fea⁃
tures from text, and employing topic modeling techniques to mine topic-based semantic features, this paper also integrates both Weibo
influence and user credibility features. In the design phase of the Weibo influence feature, time-sensitive factors are taken into consid⁃
eration. The aforementioned features are fully integrated to build a rumor detection model.【Result/conclusion】An empirical analysis
was conducted using real-world data from Weibo, and the experimental results demonstrate that the proposed method exhibits superior
performance in rumor detection tasks, achieving a peak accuracy of 93.68%. This represents improvements of approximately 9.92%,
3.29%, and 7.75% compared to the best-performing traditional machine learning methods, deep learning methods, and feature fusion
methods, respectively, enabling effective rumor detection. Furthermore, considering the timeliness factor during the design phase of
Weibo influence features contributes to enhancing the efficacy of rumor detection.【Innovation/limitation】There is still room for im⁃
provement in rumor detection performance. In the future, it may be worthwhile to explore incorporating user comment content features
to further enhance the effectiveness of rumor detection.