情报科学 ›› 2024, Vol. 42 ›› Issue (11): 101-111.

• 业务研究 • 上一篇    下一篇

数据故事的Freytag金字塔型情节结构的自动生成方法

  

  • 出版日期:2024-11-01 发布日期:2025-04-08

  • Online:2024-11-01 Published:2025-04-08

摘要: 【目的/意义】情节是故事中一系列事件的有序安排,是数据故事的骨架。本研究借鉴经典的 Freytag金字 塔型情节结构,提出数据故事的情节结构的生成方法,对数据故事的工程化研发具有重要意义。【方法/过程】首先, 界定了数据故事情节的概念,并讨论了其与文学故事中情节概念的区别,并明确了本研究所使用的Freytag金字塔 型情节结构。其次,探讨了情节的基本要素,即事件与情节结构,基于此提出情节的自动生成与呈现方法,主要涉 及事件推荐、事件采样、情节可视化和情节映射四项任务。最后,选取 UCI Breast-Cancer 公开数据集,利用 SPLIME、信息熵决策树、SMOTE等算法,实现数据故事情节的生成过程,并通过树状图和 Hype Cycle曲线图可视化 展示生成的数据故事情节。【结果/结论】在数据故事化领域首次明确界定了数据故事情节的类型及设计要素,提出 了Freytag金字塔型情节结构的自动生成方法。【创新/局限】本研究弥补了现阶段在数据故事构成要素方面研究的 缺失,提出了实验可行的数据故事情节生成方法及情节结构映射思路,为后续深入研究数据故事情节提供了一定 的启示。

Abstract: 【Purpose/significance】As a link between events and story, the plot is one of the pillars of data storytelling. It is important to discuss the theoretical research, automatic generation and engineering development of data story plot.【Method/process】Firstly, the con⁃ cept of data story plot and its difference from literary plot are defined, and the Freytag′s Pyramid structure used in this study is clarified. Secondly, the elements of plot design are discussed in terms of the events in the plot and the plot structure. and based on this, an auto⁃ matic plot generation and presentation method is proposed, mainly involving event recommendation, event sampling, plot visualisation and plot mapping. Then, the paper takes Freytag′s Pyramid structure as an example, draws on SP-LIME, information entropy decision tree and SMOTE algorithms to achieve data story plot generation. Finally, the paper shows the plot generation effect through a tree dia⁃ gram and a Hype Cycle graph.【Result/conclusion】For the first time in the field of data storytelling, types of data story plot and design el⁃ ements are proposed, and an automatic generation method for Freytag′s Pyramid structure is proposed.【Innovation/limitation】This study makes up for the lack of research at this stage on the constituent elements of data storytelling, proposes an experimentally feasible plot gen⁃ eration method and plot structure mapping ideas, and provides certain insights for the subsequent in-depth study of data story plot.