情报科学 ›› 2023, Vol. 41 ›› Issue (3): 100-108.

• 业务研究 • 上一篇    下一篇

网络文化遗产信息资源知识图谱的构建及其应用研究

  

  • 出版日期:2023-03-01 发布日期:2023-04-10

  • Online:2023-03-01 Published:2023-04-10

摘要: 【目的/意义】面对网络中大量由非结构化数据构成的文化遗产信息资源,如何从中抽取知识构建知识图谱
并进行应用研究,是新媒体时代进行文化遗产知识深度利用的基础。【方法/过程】文章首先根据信息资源的内容与
结构特征按照主题与类型进行分类,随后采用有针对性的关键词抽取方法获取概括信息资源主题的关键词,通过
SPARQL检索在外部知识库中进行文化遗产信息资源的命名实体识别,最后利用词汇相似度算法依托本体进行知
识融合,构建文化遗产信息资源知识图谱。【结果/结论】在实验中进行了网络文化遗产信息资源的知识抽取与知识
图谱构建,利用深度学习进行文化遗产知识推理,开展了知识图谱的应用研究。研究结果表明文章方法能够充分
利用网络中的文化遗产信息资源进行知识图谱构建,满足多种应用场景下分析需求。【创新/局限】由于文化遗产领
域内容庞大,有关研究数据有待进一步扩充以更好的研究文章方法的适用性。

Abstract: 【Purpose/significance】Faced with a large amount of cultural relic information resources on the Internet, how to extract
knowledge from them to build a knowledge graph and conduct application research is the basis for deep utilization of cultural relic
knowledge in the new media era.【Method/process】The article first classifies information resources according to their content and
structural features, then uses keyword extraction methods to obtain keywords that summarize the topics of information resources, per? forms named entity identification of cultural relic information resources in knowledge bases through SPARQL, and finally uses lexical similarity algorithms to rely on ontology for knowledge fusion to build knowledge graph.【Result/conclusion】Knowledge extraction and knowledge graph construction of online cultural relic information resources were carried out in the experiments. The results showed that the article method can make full use of the cultural relic information resources on the Internet for knowledge graph construction and meet the analysis requirements in a variety of application scenarios.【Innovation/limitation】Due to the huge content of cultural relic domain, the relevant research data need to be further expanded to better investigate the applicability of the article method.