Extracting "Documents" from Web Archives

Presentation was given at the 2019 Texas Conference on Digital Libraries in Austin, Texas. This presentation discusses an IMLS funded research grant to use machine learning techniques to help identify high-value publications from web archives.
Date: May 22, 2019
Creator: Phillips, Mark Edward; Caragea, Cornelia; Patel, Krutarth & Fox, Nathaniel T.
System: The UNT Digital Library

Web archives: A preliminary exploration of user expectations vs. reality

Presentation for the Workshop on Web Archiving and Digital Libraries (WADL) as part of the 2017 ACM/IEEE-CS Joint Conference on Digital Libraries (JCDL). This presentation discusses a paper examining how users perceive the process of web archiving.
Date: June 22, 2017
Creator: Reyes Ayala, Brenda
System: The UNT Digital Library

Leveraging Machine Learning to Extract Content-Rich Publications from Web Archives

Poster presented at the 2019 Texas Conference on Digital Libraries (TCDL-2019). This poster discusses about ways of Identifying content-rich documents among the wealth of materials available via web archives. This research attempts to answers the following two research questions: 1. What role do web-published documents and publications play in developing collections in the broad categories of institutional repositories, state government documents, and publications from the federal government? 2. What are the characteristics of web-published documents and publications that help content selectors identify them for inclusion in their local collection
Date: May 22, 2019
Creator: Fox, Nathaniel T. & Phillips, Mark Edward
System: The UNT Digital Library

Exploratory Analysis of the End of Term Web Archive: Comparing Two Collections

This presentation describes a comparison of two web archives of the US Federal web domain collected at the end of the Bush administration (2008) and the end of the first Obama administration (2012). This exploratory analysis tracks changes in media types, TLDs, domains, and subdomains.
Date: June 22, 2016
Creator: Phillips, Mark Edward; Chudnov, Dan & Jacobs, James R.
System: The UNT Digital Library

CyberCemetery: Archiving Historically Significant Federal Websites

Presentation for the 2015 Society of Southwest Archivists Annual Meeting. This presentation discusses the CyberCemetery and archiving historically significant federal websites.
Date: May 22, 2015
Creator: Phillips, Mark Edward
System: The UNT Digital Library

Preserving Public Government Information: The End of Term Web Archive

Presentation for the 2013 International Internet Preservation Consortium General Assembly. Details the challenges and goals of the Library of Congress project to create end of term archives for government websites.
Date: April 22, 2013
Creator: Grotke, Abigail & Carpenter, Kris
System: The UNT Digital Library

Texas Borderlands Newspaper Collection: Newspaper Preservation and Access, One Page at a Time

Presentation given at the Texas Conference on Digital Libraries 2019 in Austin, Texas. This presentation discusses the Texas Digital Newspaper Program (TDNP) and the Texas Borderlands Newspaper Collection, a project funded by the TexTreasures program of the Texas State Library and Archives Commission (TSLAC).
Date: May 22, 2019
Creator: Krahmer, Ana & Phillips, Mark Edward
System: The UNT Digital Library

Preserving the Archival Histories of Space Flight

Presentation for the 2017 Digital Frontiers Conference. This presentation discusses the data management strategies and practices for building an archive for space history.
Date: September 22, 2017
Creator: Coopersmith, Jonathan
System: The UNT Digital Library

Memory in Uncertainty: Web Preservation in the Polycrisis - A New Design Congress Report

This report was shared in relation to a presentation for the IIPC General Assembly and Web Archiving Conference held on May 10-12, 2023 in Hilversum, Netherlands. This report presents the findings of a study by The New Design Congress to investigate web archive practices, tools, integrity, and security. Their research finds that the field of web archiving and its landscape of tools and institutions are out of step with the realities of rising instability and complexity of the 21st century.
Date: November 22, 2022
Creator: The New Design Congress
System: The UNT Digital Library

A Partnership Born of Urgency and Civic Responsibility: Preserving Access to Government Websites Through the CyberCemetery

This presentation discusses preserving access to government websites through the CyberCemetery. It includes information about what the CyberCemetery is, its purpose, the development, archival process, technical details, users by country, types of content, and using the CyberCemetery.
Date: April 22, 2010
Creator: Hoffman, Starr
System: The UNT Digital Library

New member presentations - Old Dominion University

Presentation for the 2013 International Internet Preservation Consortium General Assembly. Discusses the goals and projects of the web archiving program at Old Dominion University.
Date: April 22, 2013
Creator: Nelson, Michael L.
System: The UNT Digital Library

IIPC 2013 General Assembly - New Member Presentation for Valério Pereira da Silva

Presentation for the 2013 International Internet Preservation Consortium General Assembly. Presentation introduces Organisation Information Archivierung, a new IIPC member, and describes who they are and their current projects.
Date: April 22, 2013
Creator: da Silva, Valério Pereira
System: The UNT Digital Library

Re-embodying Data: Encountering the "forgotten pandemic" of 1918

Presentation for the 2017 Digital Frontiers Conference. This presentation discusses an installation to "re-embody" pandemic data, including the technologies, methods, and data used to construct the encounter.
Date: September 22, 2017
Creator: Grumbach, Elizabeth & Wernimont, Jacqueline
System: The UNT Digital Library

ArcLink: Additional API support for Wayback Machines

Presentation for the 2013 International Internet Preservation Consortium General Assembly. Discusses the functionality of ArcLink, a Wayback Machine extension tool.
Date: April 22, 2013
Creator: AlSum, Ahmed
System: The UNT Digital Library

Organizational Practices: A Digital Repository's Perspective

This presentation examines collection organization and preservation from the perspective of The Portal to Texas History, a digital repository for cultural heritage and historic materials from across the state of Texas. It focuses on how knowing one’s organizational scope, mission, and collection goals can help shape their materials management.
Date: April 22, 2020
Creator: Mangum, Jake & McIntosh, Marcia
System: The UNT Digital Library

IIPC 2013 General Assembly - Officer Updates

Presentation for the 2013 International Internet Preservation Consortium General Assembly. This presentation features updates about the state of the IIPCs programs, projects, and membership from consortium officers.
Date: April 22, 2013
Creator: Binns, Aaron; Oury, Clément & Potter, Abbey
System: The UNT Digital Library

Member Updates: National Diet Library, Japan

Presentation for the 2013 International Internet Preservation Consortium General Assembly. Summary of current status of ongoing projects at the National Diet Library in Japan.
Date: April 22, 2013
Creator: Shimura, Tsutomu
System: The UNT Digital Library