情报科学 ›› 2023, Vol. 41 ›› Issue (3): 127-135.

• 业务研究 • 上一篇    下一篇

新兴技术识别与演化路径分析方法研究 ——以集成电路领域为例

  

  • 出版日期:2023-03-01 发布日期:2023-04-10

  • Online:2023-03-01 Published:2023-04-10

摘要: 【目的/意义】通过综合使用论文和专利数据源,开展新兴技术的识别与演化路径方法研究,期望以此提高
学科领域新兴技术探测研究的准确性和科学性。【方法/过程】首先,针对集成电路这一特定领域使用 Word2Vec语
义相似度与字符串相似度相结合的方法构建词袋,并利用 LDA 主题模型发现并识别集成电路领域隐含的技术主
题,构建新颖度、强度和热度等多维指标对新兴技术进行对比和甄别。其次,划分时间窗,采用余弦相似度算法计
算相邻时间窗内主题间的相似性,以可视化路径的形式将筛选结果进行呈现,以此判断主题演化关系类型。【结果/
结论】研究发现,集成电路领域呈现学界与业界研究成果相互促进的良好态势,结合论文与专利数据的新兴技术识
别方法,可以有效且清晰的发现集成电路研究领域的热点型、增长型、成熟型和潜在型技术主题,并通过新兴技术
演化路径的构建,揭示了领域科学与技术间知识的交互与转移。【创新/局限】本研究创新性采用 Word2Vec语义相
似度与字符串相似度相结合的方法构建词袋,提高了词袋构建质量,为后续基于LDA主题模型识别隐含的技术主
题奠定了基础,但在数据源的多样性、时滞性问题,以及模型阈值设置的客观性上还存在局限性,需要进一步加强
相关研究。

Abstract: 【Purpose/significance】Expecting to improve the accuracy and scientific, both papers and patent data sources are compre?
hensively used in this study to identify emerging technologies and construct evolution paths.【Method/process】Aiming at the specific
area of integrated circuit, this paper combines semantic similarity based on Word2Vec Model and string similarity to build the bag of
words. And LDA topic model is applied to extract the implicit technical themes in this field, after which the paper establishes multidi?
mensional index to compare and identify the emerging technologies topics, including novelty, strength and popularity. Then the paper
divides the time window uses dot product cosine similarity to compute the similarity between topics in adjacent windows to detect the
birth, succession, merge, split and extinction of topic evolution relationship, which are presented visually.【Result/conclusion】Accord?
ing to this study, within the field of integrated circuits, the scientific research and the patent research have formed mutual promotion.
By combining the papers and patent data sources, technology topics could be detected as popular, growing, mature and potential types and their evolution paths could be constructed in a more effective way, which reveals the interaction and transfer of knowledge between science and technology.【Innovation/limitation】In this study, the combination of Word2Vec semantic similarity and string similarity was innovatively used to construct word bags, which improved the quality of word bags and laid the foundation for the subsequent iden? tification of implicit technical topics based on LDA topic model. However, this study still has limitations in the diversity of data sources, time delay, and objectivity of model threshold setting, which need to be further strengthened.