TwitterVane Administrators Guide (open access)

TwitterVane Administrators Guide

This document serves as the TwitterVane Administrators Guide. It describes how to control, monitor, and maintain Tweet Stream and Tweet processing for the TwitterVane web application.
Date: February 21, 2013
Creator: unknown
Object Type: Text
System: The UNT Digital Library
IIPC Preservation Working Group Table of Threats and Potential Solutions (open access)

IIPC Preservation Working Group Table of Threats and Potential Solutions

This text contains a table outlining threats to long-term preservation efforts, and potential standards, tools, or approaches to address these threats.
Date: February 10, 2007
Creator: International Internet Preservation Consortium. Preservation Working Group
Object Type: Text
System: The UNT Digital Library
WARC implementation guidelines (open access)

WARC implementation guidelines

This report gathers advice and best practice to help institutions designing and creating WARC files for collection management, access, preservation, and interoperability with collections from different institutions.
Date: January 27, 2009
Creator: Oury, Clément
Object Type: Report
System: The UNT Digital Library
Crowdsourcing Workshop & Use Cases (open access)

Crowdsourcing Workshop & Use Cases

This report describes a crowdsourcing workshop at the 2012 International Internet Preservation Coalition General Assembly. This report contains a workshop report, the discussion paper "Can Crowdsourcing Play a Role in Archiving the Web?, workshop schedule, a list of resources, questions to ask of crowdsourcing sites, crowdsourcing use case templates, and the article "The Crowd & the Library: The Agony and Exstasy of 'Crowdsourcing' Our Cultural Heritage."
Date: May 4, 2012
Creator: Pennock, Maureen E.; Hockx-Yu, Helen & Owens, Trevor
Object Type: Report
System: The UNT Digital Library
Putting it all together: creating a unified web harvesting workflow at the Bibliothèque nationale de France (open access)

Putting it all together: creating a unified web harvesting workflow at the Bibliothèque nationale de France

This article presents the complete web harvesting workflow at the Bibliothèque Nationale de France for the International Internet Preservation Consortium sponsored workshop "How to fit in? Integrating a web archiving program in your organisation."
Date: November 2012
Creator: Le Follic, Annick; Stirling, Peter & Wendland, Bert
Object Type: Article
System: The UNT Digital Library
Characterizing Change in Web Archiving (open access)

Characterizing Change in Web Archiving

This report attempts to define the characteristics and dimensions of change in web content
Date: August 27, 2004
Creator: Boyko, Andrew
Object Type: Report
System: The UNT Digital Library
Web Archiving within the KB and some preliminary results with JHove and DROID (open access)

Web Archiving within the KB and some preliminary results with JHove and DROID

This report documents web archiving activities within the Koninklijke Bibliotheek using JHove and DROID, and the running times of DROID and JHove tests.
Date: September 2007
Creator: Koninklijke Bibliotheek
Object Type: Report
System: The UNT Digital Library
Archiving Web Browser Plug-ins (open access)

Archiving Web Browser Plug-ins

This report explores issues related to the archiving of Web Browser Plug-ins.
Date: January 9, 2004
Creator: Bang, Sverre
Object Type: Report
System: The UNT Digital Library

Twittervane: Crowd Sourcing for Web Archiving

Presentation for the 2012 International Internet Preservation Consortium General Assembly. This presentation provides an overview and update on the Twittervane project.
Date: 2012
Creator: Hockx-Yu, Helen; Johnson, Stephen & Pennock, Maureen E.
Object Type: Presentation
System: The UNT Digital Library
Web Harvesting Survey (open access)

Web Harvesting Survey

This document contains a survey to identify and classify many of the conditions found on web sites that influence the harvesting of content and the quality of an archival crawl.
Date: March 8, 2004
Creator: Library of Congress
Object Type: Text
System: The UNT Digital Library
Test Bed Taxonomy for Crawler (open access)

Test Bed Taxonomy for Crawler

This report contains an annotated taxonomy of challenges that web crawler may encounter online.
Date: July 2004
Creator: Boyko, Andrew; Anderson, Martha & Jones, Gina
Object Type: Report
System: The UNT Digital Library
Web Archives: The Future(s) (open access)

Web Archives: The Future(s)

This report aims to stimulate further discussion among web archivists and researchers about the future ways in which web archives can be used by researchers.
Date: June 30, 2011
Creator: Meyer, Eric T.; Thomas, Arthur & Schroeder, Ralph
Object Type: Report
System: The UNT Digital Library
Prototypes related to IIPC Access Working Group Use Cases (open access)

Prototypes related to IIPC Access Working Group Use Cases

This report provides use cases illustrating that a web archive has many types of users nad several methods for access are needed.
Date: May 2006
Creator: International Internet Preservation Consortium. Access Working Group
Object Type: Report
System: The UNT Digital Library
Harvesting Practices Report (open access)

Harvesting Practices Report

This report summarizes the results of the International Internet Preservation Consortium (IIPC) Harvesting Practices Survey, developed in order to understand, analyze and to collate the current Internet archiving processes and experiences amongst IIPC members.
Date: June 10, 2011
Creator: Mayr, Michaela
Object Type: Report
System: The UNT Digital Library
Facing the Challenge of Web Archives Preservation: the Role and Work of the IIPC Preservation Working Group (open access)

Facing the Challenge of Web Archives Preservation: the Role and Work of the IIPC Preservation Working Group

This paper documents the results of a survey about the current state of preservation in International Internet Preservation Consortium (IIPC) member web archives.
Date: October 2014
Creator: Goethals, Andrea; Oury, Clément; Pearson, David; Sierman, Barbara & Steinke, Tobias
Object Type: Paper
System: The UNT Digital Library
International Internet Preservation Consortium Strategic Plan 2016 - 2017 (open access)

International Internet Preservation Consortium Strategic Plan 2016 - 2017

This document sets forth the mission and goals of the International Internet Preservation Consortium as well as its 2016-2017 strategic plan.
Date: 2016
Creator: International Internet Preservation Consortium
Object Type: Text
System: The UNT Digital Library
Twittervane Guide (open access)

Twittervane Guide

This document contains a guide for using Twittervane, a tool that can extract and analyse URLs embedded in a tweet, allowing for the capture of URLs related to a specific topic of interest in a collection.
Date: unknown
Creator: unknown
Object Type: Text
System: The UNT Digital Library
Information and documentation — Statistics and Quality Indicators for Web Archiving (open access)

Information and documentation — Statistics and Quality Indicators for Web Archiving

This technical report defines statistical terms and quality criteria for Web archiving. It considers the needs and practices across a wide range of heritage and research organisations such as national and research libraries, archives, museums, research centres and heritage foundations.
Date: 2012
Creator: unknown
Object Type: Report
System: The UNT Digital Library
A Vision of the Role and Future of Web Archives (open access)

A Vision of the Role and Future of Web Archives

This text was presented at the 2012 General Assembly of the International Internet Preservation Coalition, and appears as a three-part blog post in The Signal, a blog hosted by the Library of Congress. This text discusses the role and future of web archives.
Date: 2012
Creator: Leetaru, Kalev H.
Object Type: Text
System: The UNT Digital Library
Web Harvesting Survey (open access)

Web Harvesting Survey

This report contains a survey of the conditions found on web sites that influence the harvesting of content and the quality of an archival crawl.
Date: July 2004
Creator: Marill, Jennifer; Boyko, Andrew; Ashenfelder, Michael & Jones, Gina
Object Type: Report
System: The UNT Digital Library
Internet Archives Compatibility Initiative (open access)

Internet Archives Compatibility Initiative

This report describes a set of definitions and criteria ensure practical object compatibility for for a web archive storage standard.
Date: January 2016
Creator: Organisation Information Archivierung
Object Type: Report
System: The UNT Digital Library
Web Archive Profiling Via Sampling Final Report (open access)

Web Archive Profiling Via Sampling Final Report

This report covers the results, deliverables, and ongoing status of the International Internet Preservation Consortium (IIPC) funded project "Web Archive Profiling Via Sampling" with links to code, datasets, presentations, and papers as appropriate.
Date: September 16, 2016
Creator: Alam, Sawood; Nelson, Michael L.; Van de Sompel, Herbert; Balakireva, Lyudmila; Shankar, Harihar; Bornand, Nicolas J. et al.
Object Type: Report
System: The UNT Digital Library
Evaluating Twittervane: Project Final Report (open access)

Evaluating Twittervane: Project Final Report

This report provides the final update on the Twittervane project, a prototype application capable of collecting and analyzing Twitter feeds and outputting URLs mentioned in the Tweets.
Date: June 16, 2013
Creator: Pitt, Mary & Hockx-Yu, Helen
Object Type: Report
System: The UNT Digital Library
TwitterVane Installation Guide (open access)

TwitterVane Installation Guide

This document serves as the TwitterVane Installation Guide. It describes how install, configure and deploy the TwitterVane web application.
Date: February 19, 2013
Creator: unknown
Object Type: Text
System: The UNT Digital Library