情报科学 ›› 2025, Vol. 43 ›› Issue (5): 43-57.

• 理论研究 • 上一篇    下一篇

ChatGPT大语言模型的评论情感分类预测与主题识别研究

  

  • 出版日期:2025-05-05 发布日期:2025-09-01

  • Online:2025-05-05 Published:2025-09-01

摘要: 【目的/意义】探究大众对于ChatGPT大语言模型的情感和主要关注点,有助于人工智能企业推进AIGC技 术的开发,可为政府制定相应技术监管策略提供建议。【方法/过程】爬取微博数据并进行预处理后,构建Bert模型进 行情感分类,并结合 LDA主题聚类及 ARIMA 时间序列模型,揭示公众对 ChatGPT 大语言模型的关注焦点和态度 倾向,预测用户评论情感走向。【结果/结论】大众对以ChatGPT为代表的大语言模型的态度因人而异,情感分布较 为均衡。用户的主要关注点呈现复杂化的特点,未来以ChatGPT为代表的大语言模型在公众的认可度和情感态度 方面有着良好预期。【创新/局限】对情感分类结果进行细粒度情感评论LDA主题聚类的同时,将粗粒度分类结果与 ARIMA时间序列模型结合,打破单纯依赖静态情感分析的局限性,多角度、多方面地对大众之于ChatGPT大语言 模型的态度、关注点和未来预期进行分析,深入探究ChatGPT大语言模型的发展趋势和潜在风险。

Abstract: 【Purpose/significance】This research aims to explore the public's sentiments and key concerns regarding ChatGPT and large language models, which can assist AI companies in advancing AIGC technology development and provide recommendations for governments in formulating corresponding regulatory strategies.【Method/process】After web-scraping and preprocessing data from Weibo, a BERT model was constructed for sentiment classification. The Latent Dirichlet Allocation (LDA) model was used for topic clustering, and the ARIMA time series model was employed to reveal the public's focal points and sentiment tendencies toward Chat‑ GPT, as well as to predict the emotional trajectory of user comments【. Result/conclusion】The public's attitudes toward large language models, represented by ChatGPT, vary among individuals, with a relatively balanced distribution of sentiments. Users' key concerns are characterized by increasing complexity, and there are positive expectations for public recognition and emotional attitudes toward large language models like ChatGPT in the future.【Innovation/limitation】By integrating fine-grained sentiment classification with LDA topic clustering and combining coarse-grained classification results with the ARIMA time series model, this study overcomes the limitations of relying solely on static sentiment analysis. It provides a multi-perspective and comprehensive analysis of the public's at‑ titudes, concerns, and future expectations regarding ChatGPT, offering in-depth insights into the developmental trends and potential risks of large language models like ChatGPT.