情报科学 ›› 2024, Vol. 42 ›› Issue (4): 79-88.

• 理论研究 • 上一篇    下一篇

嬗变与发展:中国政府数据开放管理央地政策文本的
内容主题挖掘与分析

  

  • 出版日期:2024-04-05 发布日期:2024-06-08

  • Online:2024-04-05 Published:2024-06-08

摘要:

【目的/意义】识别我国政府数据开放管理央地政策文本的内容主题及演化特征。【方法/过程】获取我国央
地政策文本,对政策文本进行筛选和剔除,运用LDA主题模型实现对政府数据开放管理政策文本内容的主题识别、
演化分析。【结果/结论】发现当前所发布的政府数据开放管理央地政策文本具有明显的生命周期特征,目前阶段处
于平缓期,其内容主题可以分为四大类,政府数据开放的数据安全管理是头号热点主题,每个主题随时间推移呈现
不同演化趋势。【创新/局限】采集大样本政策文本作为数据源,每篇只保留与主题高度相关的段落,利用LDA主题
模型进行政策文本的主题识别与分析,直观地揭示了中国政府数据开放管理央地政策文本的现状与趋势。

Abstract:

【Purpose/significance】Identify the content themes and evolutionary features of the central and local policy texts on open
government data management in China.
【Method/process】Obtain the central and local policy texts in China, filter and eliminate the
policy texts, and use LDA topic model to realize the topic identification and evolution analysis of the content of this paper on open gov⁃
ernment data management policy.【Result/conclusion】It is found that the currently released open government data management cen⁃
tral and local policy texts have obvious life cycle characteristics, and the current stage is in a plateau, and their content themes can be
divided into four major categories, with data security management of government data openness being the number one hot topic, and
each topic showing different evolutionary trends over time
.
【Innovation/limitation】A large sample of policy texts is collected as the data
source, and only the passages with highly relevant themes are retained in each text. The LDA theme model is used to identify and ana⁃
lyze the themes of the policy texts, which intuitively reveals the current situation and trends of the central and local policy texts of open
government data management in China.