Month

Improving Access to Web Archives through Innovative Analysis of PDF Content (open access)

Improving Access to Web Archives through Innovative Analysis of PDF Content

This paper discusses improving access to web archives through innovative analysis of PDF content. The paper discusses the overall workflow and describes the tools used to extract document features. Findings suggest opportunities for the development of retrieval tools that will provide new ways of selecting content and building collections from large Web archives.
Date: April 2013
Creator: Phillips, Mark Edward & Murray, Kathleen R.
System: The UNT Digital Library