Please wait a minute...
档案学研究  2022, Vol. 36 Issue (2): 69-76    DOI: 10.16065/j.cnki.issn1002-1620.2022.02.010
  档案资源建设 本期目录 | 过刊浏览 |
国外网页资源归档的协同采集模式及其启示
王璐婷
东南大学档案馆 南京 210096
Collaborative Collection of Web Archiving Abroad and Its Enlightenment
WANG Luting
Southeast University Archives, Nanjing 210096
全文: HTML    PDF(1329 KB)  
输出: BibTeX | EndNote (RIS)      
摘要: 

立足网页资源归档协同采集模式旨在推动我国网页资源归档项目可持续开展。本文通过生命周期模型对人类和机器协同工作环节进行解析,通过国外网页资源归档项目案例分析对多机构协作采集模式的实现展开讨论,最终为我国网页资源归档实践开展提出四点建议。

关键词 网页资源归档生命周期模型协同采集    
Abstract

Research on collaborative collection aims at promoting sustainable development of web archiving in China. Web archiving life cycle model is used to analyze the collaborative work between human and machine, and the realization of multi-organization collaborative collection mode is discussed based on the case study of web archiving projects abroad. Finally, four suggestions are put forward for the development of web archiving in China.

Key wordsweb archiving    life cycle model    collaborative collection
出版日期: 2023-04-18
引用本文:

王璐婷. 国外网页资源归档的协同采集模式及其启示[J]. 档案学研究, 2022, 36(2): 69-76.
WANG Luting. Collaborative Collection of Web Archiving Abroad and Its Enlightenment. Archives Science Study, 2022, 36(2): 69-76.

链接本文:

http://journal12.magtechjournal.com/Jwk_dax/CN/10.16065/j.cnki.issn1002-1620.2022.02.010      或      http://journal12.magtechjournal.com/Jwk_dax/CN/Y2022/V36/I2/69

[1] IIPC Training Session 3 Slides[EB/OL].[2021-01-19]. https://netpreserve.org/download/iipc-training-session-beginners-3-slides/.
[2] Niels Brügger. Digital Humanities in the 21st Century: Digital Material as a Driving Force[J]. Digital Humanities Quarterly, 2016(3):39-53.
[3] [22] Paul Koerbin. Operational Challenges and Innovation for National Web Archiving[EB/OL].[2021-01-19]. https://library.alia.org.au/operational-challenges-and-innovation-national-web-archiving.
[4] A Guide to Web Preservation[EB/OL].[2021-01-19]. https://jiscpowr.jiscinvolve.org/wp/files/2010/06/Guide-2010-final.pdf.
[5] Paul Koerbin. The PANDORA Digital Archiving System (PANDAS):Managing Web Archiving in Australia [EB/OL].[2021-01-19]. http://pandora.nla.gov.au/pan/21336/20080620-0137/www.nla.gov.au/nla/staffpaper/2004/koerbin2.html.
[6] The UK Government Web Archive:Guidance for Digital and Records Management Teams[EB/OL].[2021-01-19]. https://www.nationalarchives.gov.uk/documents/web-archiving-technical-guidance.pdf.
[7] Molly Bragg,Kristine Hanna. The Web Archiving Life Cycle Model[EB/OL].[2021-01-19]. http://ait.blog.archive.org/files/2014/04/archiveit_life_cycle_model.pdf.
[8] Archive-It Crawling Technology[EB/OL].[2021-01-19]. https://support.archive-it.org/hc/en-us/articles/115001081186-Archive-It-Crawling-Technology.
[9] How and When to Use Brozzler[EB/OL].[2021-01-19]. https://support.archive-it.org/hc/en-us/articles/360000351986.
[10] [27] Announcing a New Partnership:California Digital Library,UC Libraries,and Internet Archive's Archive-It Service[EB/OL].[2021-01-19]. https://cdlib.org/cdlinfo/2015/01/14/announcing-a-new-partnership-california-digital-library-uc-libraries-and-internet-archives-archive-it-service/.
[11] Novel Coronavirus Outbreak:Help Us Collect Websites[EB/OL].[2021-01-19]. https://netpreserveblog.wordpress.com/2020/02/13/cdg-collection-novel-coronavirus/.
[12] The URL Nomination Tool[EB/OL].[2021-01-19]. https://digital2.library.unt.edu/nomination/eth2020/add/.
[13] Soumen Chakrabarti, Byron Dom, Prabhakar Raghavan, Sridhar Rajagopalan, David Gibson, Jon Kleinberg. Automatic Resource Compilation by Analyzing Hyperlink Structure and Associated Text[J]. Computer Networks and ISDN Systems, 1998(1):65-74.
[14] [17] End of Term Presidential Harvest 2008[EB/OL].[2021-01-19]. http://eotarchive.cdlib.org/search?browse-all=yes.
[15] Preserving Public Government Information: The 2008 End of Term Crawl Project[EB/OL].[2021-01-19]. https://digital.library.unt.edu/ark:/67531/metadc28366/m2/1/high_res_d/eotproject_CNI_1208-2_mep.pdf.
[16] Bagit Library[EB/OL].[2021-01-19]. https://sourceforge.net/projects/loc-xferutils/.
[18] Murray, Kathleen. Improving Access to Web Archives Through Innovative Analysis of PDF Content [C]. Archiving Conference, 2013:186-192.
[19] Social Feed Manger[EB/OL].[2021-01-19]. https://gwu-libraries.github.io/sfm-ui/.
[20] Kahle Brewster. Preserving the Internet[J]. Scientific American, 1997(3):82-83.
[21] Koehler Wallace. Web Page Change and Persistence-A Four-Year Longitudinal Study[J]. Journal of the American Society for Information Science and Technology, 2002(2):162-171.
[23] 吴振新, 曲云鹏, 李成文, 等. 基于开源软件搭建网络信息资源采集与保存平台[J]. 现代图书情报技术, 2009(Z1):6-10.
[24] Tools & Software[EB/OL].[2021-01-19]. https://netpreserve.org/web-archiving/tools-and-software/.
[25] Umich Web Archiving Course[EB/OL].[2021-01-19]. https://www.si.umich.edu/programs/courses/639.
[26] Digital Preservation Coalition[EB/OL].[2021-01-19]. https://www.dpconline.org/events/past-events/beginners-web-archiving-training.
[1] 李甜. 数字管护(Digital Curation)视域下科研档案管理创新研究[J]. 档案学研究, 2021, 35(3): 113-120.