情报科学 ›› 2025, Vol. 43 ›› Issue (8): 20-28.

• 理论研究 • 上一篇    下一篇

基于句子生成模型的跨学科学术创新机会发现研究

  

  • 出版日期:2025-08-05 发布日期:2025-12-12

  • Online:2025-08-05 Published:2025-12-12

摘要: 【目的/意义】学术创新机会发现是确定创新方向和引领创新发展的重要途径,基于句子生成模型发现学术 创新机会,能够促进学术创新机会发现的智能化发展,推动科研创新。【方法/过程】以管理学为源学科,情报学为目 标学科,首先对管理学核心期刊(CSSCI)中的文献聚类,以各类团的主题词为句子起点,利用情报学语料微调 GPT-2模型生成句子。设计创新贡献度指标,计算生成句的创新贡献度得分,将创新贡献度得分最高的句子视为 学术创新机会。最后,通过方差分析验证本文方法的稳健性,从国家政策、学术需求两个维度验证了学术创新机会 的有效性。【结果/结论】本文方法准确识别到管理学向情报学输入的学术创新机会,如“加速专利技术转移,提升专 利应用价值”“促进公共数据资源的开放流通与利用”等。分析发现,本文方法也可推广应用于异质领域颠覆性创 新机会的识别研究。【创新/局限】为进一步丰富跨学科创新机会的发现方法,引入句子生成模型,在新句子的基础 上发现学术创新机会。然而,本研究所用语料均为中文,未研究外文文献中的创新机会,所发现的创新机会尚不 完善。

Abstract: 【Purpose/significance】Academic innovation opportunity discovery is an important way to determine the direction of innova⁃ tion and lead the development of innovation. Discovering academic innovation opportunities based on sentence generation models can promote the intelligent development of academic innovation opportunity discovery and promote scientific research innovation【. Method/ process】Taking management as the source discipline and information science as the target discipline, the literature in the core jour⁃ nals of management(CSSCI) is firstly clustered, and the subject words of each cluster are used as the starting point of the sentences, which are generated by using the intelligence corpus fine-tuning GPT-2 model. Design the innovation contribution index, calculate the innovation contribution score of the generated sentences, and regard the sentences with the highest innovation contribution score as academic innovation opportunities.Finally,the robustness of this paper's methodology is verified by ANOVA, and the validity of aca⁃ demic innovation opportunities from the perspectives of national policies and academic needs was verified.【Result/conclusion】 Through empirical research, the methodology of this paper accurately identifies academic innovation opportunities imported from man⁃ agement to information science,such as "accelerating the transfer of patented technology and enhancing the application value of pat⁃ ents" and "promoting the open circulation and utilisation of public data resources", etc. The analysis finds that the methodology of this paper can also be generalised to identify disruptive innovation opportunities in heterogeneous fields. It is found that the methodology of this paper can also be extended to the identification of disruptive innovation opportunities in heterogeneous fields【. Innovation/limita⁃ tion】To further enrich the method of discovering interdisciplinary innovation opportunities, a sentence generation model is introduced to discover academic innovation opportunities on the basis of new sentences. However, the corpus of this study is all in Chinese, and the innovation opportunities in foreign language literature are not studied, and the innovation opportunities discovered are not yet per⁃ fect.