WARC implementation guidelines (open access)

WARC implementation guidelines

This report gathers advice and best practice to help institutions designing and creating WARC files for collection management, access, preservation, and interoperability with collections from different institutions.
Date: January 27, 2009
Creator: Oury, Clément
System: The UNT Digital Library
Crowdsourcing Workshop & Use Cases (open access)

Crowdsourcing Workshop & Use Cases

This report describes a crowdsourcing workshop at the 2012 International Internet Preservation Coalition General Assembly. This report contains a workshop report, the discussion paper "Can Crowdsourcing Play a Role in Archiving the Web?, workshop schedule, a list of resources, questions to ask of crowdsourcing sites, crowdsourcing use case templates, and the article "The Crowd & the Library: The Agony and Exstasy of 'Crowdsourcing' Our Cultural Heritage."
Date: May 4, 2012
Creator: Pennock, Maureen E.; Hockx-Yu, Helen & Owens, Trevor
System: The UNT Digital Library
Characterizing Change in Web Archiving (open access)

Characterizing Change in Web Archiving

This report attempts to define the characteristics and dimensions of change in web content
Date: August 27, 2004
Creator: Boyko, Andrew
System: The UNT Digital Library
Web Archiving within the KB and some preliminary results with JHove and DROID (open access)

Web Archiving within the KB and some preliminary results with JHove and DROID

This report documents web archiving activities within the Koninklijke Bibliotheek using JHove and DROID, and the running times of DROID and JHove tests.
Date: September 2007
Creator: Koninklijke Bibliotheek
System: The UNT Digital Library
Archiving Web Browser Plug-ins (open access)

Archiving Web Browser Plug-ins

This report explores issues related to the archiving of Web Browser Plug-ins.
Date: January 9, 2004
Creator: Bang, Sverre
System: The UNT Digital Library
Test Bed Taxonomy for Crawler (open access)

Test Bed Taxonomy for Crawler

This report contains an annotated taxonomy of challenges that web crawler may encounter online.
Date: July 2004
Creator: Boyko, Andrew; Anderson, Martha & Jones, Gina
System: The UNT Digital Library
Web Archives: The Future(s) (open access)

Web Archives: The Future(s)

This report aims to stimulate further discussion among web archivists and researchers about the future ways in which web archives can be used by researchers.
Date: June 30, 2011
Creator: Meyer, Eric T.; Thomas, Arthur & Schroeder, Ralph
System: The UNT Digital Library
Prototypes related to IIPC Access Working Group Use Cases (open access)

Prototypes related to IIPC Access Working Group Use Cases

This report provides use cases illustrating that a web archive has many types of users nad several methods for access are needed.
Date: May 2006
Creator: International Internet Preservation Consortium. Access Working Group
System: The UNT Digital Library
Harvesting Practices Report (open access)

Harvesting Practices Report

This report summarizes the results of the International Internet Preservation Consortium (IIPC) Harvesting Practices Survey, developed in order to understand, analyze and to collate the current Internet archiving processes and experiences amongst IIPC members.
Date: June 10, 2011
Creator: Mayr, Michaela
System: The UNT Digital Library
Information and documentation — Statistics and Quality Indicators for Web Archiving (open access)

Information and documentation — Statistics and Quality Indicators for Web Archiving

This technical report defines statistical terms and quality criteria for Web archiving. It considers the needs and practices across a wide range of heritage and research organisations such as national and research libraries, archives, museums, research centres and heritage foundations.
Date: 2012
Creator: unknown
System: The UNT Digital Library
Web Harvesting Survey (open access)

Web Harvesting Survey

This report contains a survey of the conditions found on web sites that influence the harvesting of content and the quality of an archival crawl.
Date: July 2004
Creator: Marill, Jennifer; Boyko, Andrew; Ashenfelder, Michael & Jones, Gina
System: The UNT Digital Library
Internet Archives Compatibility Initiative (open access)

Internet Archives Compatibility Initiative

This report describes a set of definitions and criteria ensure practical object compatibility for for a web archive storage standard.
Date: January 2016
Creator: Organisation Information Archivierung
System: The UNT Digital Library
Web Archive Profiling Via Sampling Final Report (open access)

Web Archive Profiling Via Sampling Final Report

This report covers the results, deliverables, and ongoing status of the International Internet Preservation Consortium (IIPC) funded project "Web Archive Profiling Via Sampling" with links to code, datasets, presentations, and papers as appropriate.
Date: September 16, 2016
Creator: Alam, Sawood; Nelson, Michael L.; Van de Sompel, Herbert; Balakireva, Lyudmila; Shankar, Harihar; Bornand, Nicolas J. et al.
System: The UNT Digital Library
Evaluating Twittervane: Project Final Report (open access)

Evaluating Twittervane: Project Final Report

This report provides the final update on the Twittervane project, a prototype application capable of collecting and analyzing Twitter feeds and outputting URLs mentioned in the Tweets.
Date: June 16, 2013
Creator: Pitt, Mary & Hockx-Yu, Helen
System: The UNT Digital Library
International Internet Preservation Consortium Quarterly highlight report: November - December 2010 (open access)

International Internet Preservation Consortium Quarterly highlight report: November - December 2010

This report summarizes the activities of the International Internet Preservation Consortium Preservation Working Group from the fourth quarter of 2010.
Date: January 2011
Creator: International Internet Preservation Consortium. Preservation Working Group
System: The UNT Digital Library
Archival data format requirements (open access)

Archival data format requirements

This report describes requirements for an archival storage format suited to preserve collections of internet data.
Date: unknown
Creator: Sloth Christensen, Steen
System: The UNT Digital Library
Preserving Access – Making More Informed Guesses About What Works (open access)

Preserving Access – Making More Informed Guesses About What Works

This report discusses the problem of progressive technological change on current and future access to web archives using the PANDORA web archive as a primary case study.
Date: November 23, 2009
Creator: Davis, Maxine
System: The UNT Digital Library
Iipc Web Archiving Metadata Set (open access)

Iipc Web Archiving Metadata Set

This report presents a set of metadata elements for web archiving.
Date: November 9, 2004
Creator: Masanès, Julien
System: The UNT Digital Library
Long-Term Preservation of Web Archives - Experimenting With Emulation and Migration Methodologies (open access)

Long-Term Preservation of Web Archives - Experimenting With Emulation and Migration Methodologies

This report documents describes a project to evaluate emulation and migration as long-term preservation solutions for web archives.
Date: December 10, 2009
Creator: Stawowczyk Long, Andrew
System: The UNT Digital Library
Bit Preservation Specifications (open access)

Bit Preservation Specifications

This report provides specifications for a bit preservation system for the archive storage of the data that represents and is associated with the digital objects over which a library has custodianship.
Date: October 26, 2005
Creator: Hafken, David; Hamidzadeh, Babak; Littman, Justin & Madden, Elizabeth
System: The UNT Digital Library
How to Fit In? Integrating a Web Archiving Program in Your Organization: Workshop Report and Evaluation (open access)

How to Fit In? Integrating a Web Archiving Program in Your Organization: Workshop Report and Evaluation

Report on the 2012 International Internet Preservation Consortium sponsored workshop on "How to fit in? Integrate a web archiving program in your organization. This report summarizes the details and information shared from this workshop.
Date: April 2013
Creator: Stirling, Peter & Kõuts, Jaanus
System: The UNT Digital Library
Live Archiving Proxy Project Closure (open access)

Live Archiving Proxy Project Closure

This report documents the closing of the Live Archiving Proxy Project to build an HTTP proxy that is able to capture the traffic that flows through it, and delegate the handling fo the captured data to a writer using a simple network protocol. This report summarizes the project and its deliverables, and includes feedback on the project from the British Library, the Internet Memory Foundation, and Netarkivet.dk.
Date: June 19, 2013
Creator: unknown
System: The UNT Digital Library
JHoNas final report: Foster WARC usage in scalable Web Archiving workflows using Jhove2 and NetarchiveSuite (open access)

JHoNas final report: Foster WARC usage in scalable Web Archiving workflows using Jhove2 and NetarchiveSuite

This report documents the work done in connection with the JHoNas project. The overall goal of the JHoNas project is to enhance existing tools in order to ease the adaptation of WARC as teh prefered archiving format for digital preservation.
Date: 2016
Creator: Clarke, Nicholas
System: The UNT Digital Library
IIPC Preservation Working Group Report to IIPC Steering Committee (open access)

IIPC Preservation Working Group Report to IIPC Steering Committee

This report summarizes the formation of the Preservation Workgroup and its work conducted between June to December 2007.
Date: December 11, 2007
Creator: International Internet Preservation Consortium. Preservation Working Group
System: The UNT Digital Library