Please wait a minute...
档案学研究  2025, Vol. 39 Issue (6): 12-18    DOI: 10.16065/j.cnki.issn1002-1620.2025.06.002
  基础理论研究 本期目录 | 过刊浏览 |
数据归档背景下对来源的再认识——基于档案学与计算机科学的双向考察
文利君
武汉大学信息管理学院 武汉 433072
Rethinking the Concept of Provenance in the Context of Data Archiving:A Bidirectional Examination Based on Archival Science and Computer Science
WEN Lijun
School of Information Management, Wuhan University, Wuhan 433072
全文: HTML    PDF(1275 KB)  
输出: BibTeX | EndNote (RIS)      
摘要: 

数据归档是档案领域重点探索的领域,而数据来源是其归档和管理的重要因素。为优化档案领域对数据来源的认知,本文对档案领域和数据溯源中的来源进行了梳理和比较,研究发现:二者对来源的认知均呈现出对形成者和广泛的形成背景的重视;但在来源的关注重点、呈现方式、信息量等方面存在差异。基于此,提出:在数据归档背景下,档案领域应将对数据来源的理解由相对静态向动态过程扩充;提高对来源的重视;参与对数据的前端控制,并提升数据保存能力。

关键词 来源数据溯源数据归档来源原则    
Abstract

Data archiving is a key area of exploration in the archival field, and the provenance of data is an important factor in its archiving and management. To optimize the archival field's understanding of data provenance, this study sorts out and compares the concept of provenance as interpreted in both archival science and data provenance research. The findings reveal that both domains emphasize the importance of creators and the broader contextual background in which records or data are generated. However, differences exist in terms of their focal concerns, modes of representation, the amount of information regarding provenance. Based on these insights, this study proposes that, in the context of data archiving, archival science should expand its conception of provenance from a relatively static notion to one that encompasses dynamic processes; increase the emphasis placed on provenance; actively engage in the front-end control of data; and strengthen its capacity for long-term data preservation.

Key wordsprovenance    data provenance    data archiving    the principal of provenance
出版日期: 2025-12-28
引用本文:

文利君. 数据归档背景下对来源的再认识——基于档案学与计算机科学的双向考察[J]. 档案学研究, 2025, 39(6): 12-18.
WEN Lijun. Rethinking the Concept of Provenance in the Context of Data Archiving:A Bidirectional Examination Based on Archival Science and Computer Science. Archives Science Study, 2025, 39(6): 12-18.

链接本文:

https://journal12.magtechjournal.com/Jwk_dax/CN/10.16065/j.cnki.issn1002-1620.2025.06.002      或      https://journal12.magtechjournal.com/Jwk_dax/CN/Y2025/V39/I6/12

[1] 金波, 添志鹏, 杨鹏. 大数据时代档案数据治理运行机制建构[J]. 档案学研究, 2023(4):65-73.
[2] [27] 王芳, 赵洪, 马嘉悦, 等. 数据科学视角下数据溯源研究与实践进展[J]. 中国图书馆学报, 2019(5):79-100.
[3] NIU J. Provenance: crossing boundaries[J]. Archives and Manuscripts, 2013(41):105-115.
[4] MOREAU L. The foundations for provenance on the web[J]. Foundations and Trends in Web Science, 2010(2):99-241.
[5] BUNEMAN P, KHANNA S, WANG-CHIEW T. Why and where: a characterization of data provenance[C]// Database Theory—ICDT 2001: 8th International Conference. London: Springer Berlin Heidelberg, 2001: 316-330.
[6] 黄霄羽. 来源原则“重新发现”的发展进程与基本内涵—电子时代来源原则重新定位的思考之一[J]. 北京档案, 2004(10):18-21.
[7] 连志英, 蒋玲, 张晓. 档案来源观的后现代转向[J]. 档案学通讯, 2023(5):4-10.
[8] 祭鸿雁. “新来源观”:实质与意义探析[J]. 档案学通讯, 2003(1):21-25.
[9] 靳颖. 论来源原则的发展趋向[J]. 浙江档案, 2004(3):21-22.
[10] 黄霄羽. 北美档案界对来源原则的“重新发现”[J]. 档案学通讯, 2001(2):73-76.
[11] 特里·库克. 对数字时代来源原则的反思[J]. 李音,译. 档案学研究, 2011(1):82-85.
[12] 徐拥军. 档案后保管范式与知识管理[J]. 档案学通讯, 2008(2):27-31.
[13] 何嘉荪, 楼淑君. 后保管时代基础理论研究之三—新来源观解析[J]. 浙江档案, 2013(3):9-14.
[14] ICA. ISAD(G):General International Standard Archival Description: Second edition[EB/OL].[2025-03-10]. https://www.ica.org/resource/isadg-general-international-standard-archival-description-second-edition/.
[15] The Library of Congress. Encoded Archival Description[EB/OL].[2025-03-10]. https://www.loc.gov/ead/.
[16] MICHETTI G. Provenance: an archival perspective[M]// Building Trust in Information:Perspectives on the Frontiers of Provenance. Cham: Springer International Publishing, 2016: 59-68.
[17] MILES S, WONG S C, FANG W, et al. Provenance-based validation of e-science experiments[J]. Journal of Web Semantics, 2007(5):28-38.
[18] [25] BUNEMAN P, KHANNA S, TAN W C. Data provenance: some basic issues[C]// International Conference on Foundations of Software Technology and Theoretical Computer Science. Berlin:Springer Berlin Heidelberg, 2000: 87-93.
[19] PÉREZ B, RUBIO J, SÁENZ-ADÁN C. A systematic review of provenance systems[J]. Knowledge and Information Systems, 2018(57):495-543.
[20] RAGAN E D, ENDERT A, SANYAL J, et al. Characterizing provenance in visualization and data analysis: an organizational framework of provenance types and purposes[J]. IEEE Transactions on Visualization and Computer Graphics, 2015(22):31-40.
[21] DAI C, LIN D, BERTINO E, et al. An approach to evaluate data trustworthiness based on data provenance[C]// Workshop on Secure Data Management. Berlin:Springer Berlin Heidelberg, 2008: 82-98.
[22] Open Provenance. The OPM Provenance Model(OPM)[EB/OL].[2025-03-10]. https://openprovenance.org/opm/.
[23] MOREAU L, FREIRE J, FUTRELLE J, et al. The open provenance model: an overview[C]// International Provenance and Annotation Workshop. Berlin:Springer Berlin Heidelberg, 2008: 323-326.
[24] MISSIER P, BELHAJJAME K, CHENEY J. The W3C PROV family of specifications for modelling provenance metadata[C]// Proceedings of the 16th International Conference on Extending Database Technology. New York: Association for Computing Machinery, 2013: 773-776.
[26] SIMMHAN Y L, PLALE B, GANNON D. A survey of data provenance in e-science[J]. ACM Sigmod Record, 2005(34):31-36.
[28] GLAVIC B. Big data provenance: challenges and implications for benchmarking[C]// Workshop on Big Data Benchmarks. Berlin:Springer Berlin Heidelberg, 2012: 72-80.
[1] 周祺, 周昊, 张照余. 基于行业经验的三维数模长期保存关键要素与策略探析[J]. 档案学研究, 2025, 39(4): 115-124.
[2] 龙家庆. 澳大利亚文件系列体系的历史演进、思想内核与当代镜鉴[J]. 档案学研究, 2025, 39(4): 132-140.
[3] 洪佳惠. 档案“异质/同质—分离/连续”问题研赜——从马比荣与热尔蒙之争说起[J]. 档案学研究, 2024, 38(6): 39-45.
[4] 陆阳, 葛泽钰. 理论旅行与知识生产:来源原则的中国旅程嬗变考察[J]. 档案学研究, 2024, 38(3): 19-27.
[5] 顾伟. 照片类电子档案元数据真实性研究[J]. 档案学研究, 2022, 36(1): 92-96.
[6] 赵跃, 孙晶琼, 段先娥. 档案化:档案科学介入数据资源管理的理性思考[J]. 档案学研究, 2020, 34(5): 83-91.
[7] 张衍, 黄清晨. 后保管理论与文件连续体理论关系的重新审视[J]. 档案学研究, 2020, 34(1): 25-31.
[8] 孙逊, 于英香, 孙安. 人物档案多级著录应用研究—— 以上海交通大学钱学森图书馆特藏“629袋”为例[J]. 档案学研究, 2019, 33(4): 66-71.
[9] 陈永生, 杨茜茜, 王沐晖, 苏焕宁. 基于互联网政务服务平台的文件归档与管理:记录观[J]. 档案学研究, 2019, 33(3): 16-23.
[10] 归吉官. 试析文件的历史联系(结构)运动—— 兼论来源原则的生命力[J]. 档案学研究, 2019, 33(1): 31-37.
[11] 陶水龙. 三维数据归档策略研究[J]. 档案学研究, 2018, 32(6): 101-104.
[12] 陈永生, 苏焕宁, 杨茜茜, 王沐晖. 基于互联网政务服务平台的文件归档与管理:事由观[J]. 档案学研究, 2018, 32(2): 4-13.
[13] 孙大东. 中国档案学范式尚未形成——基于批判性视域的考量[J]. 档案学研究, 2016, 30(5): 16-20.
[14] 丁海斌, 周晓芸. 简论档案人化自然[J]. 档案学研究, 2014, 28(3): 4-8.
[15] 宋魏巍. 欧洲大陆国家档案鉴定理论与鉴定方法论发展述评[J]. 档案学研究, 2013, 27(3): 81-86.