7 Matching Results

Results open in a new window/tab.

[Dataset of Web Archiving Research Articles]

Datasets used in the presentation, "Towards Building a Collection of Web Archiving Research Articles." The files included here were used to conduct several Machine Learning classification experiments that result in a corpus of scholarly research articles on the topic of web archiving.
Date: August 2014
Creator: Reyes Ayala, Brenda & Caragea, Cornelia
System: The UNT Digital Library

N. W. Ayer & Son's American Newspaper Annual and Directory OCR Text Dataset

Dataset of OCR text from N. W. Ayer & Son's American Newspaper Annual and Directory. This dataset includes volumes covering the years 1910 to 1922. In all there are 25 volumes comprised of 16,669 pages of text.
Date: August 22, 2018
Creator: Andrews, Pamela
System: The UNT Digital Library

Link Resolver Testing

This excel file accompanies a workshop presentation titled 'Is it really that bad? Verifying the extent of full-text linking problems'.
Date: August 9, 2012
Creator: Harker, Karen
System: The UNT Digital Library

Hurricane Harvey Twitter Dataset

This dataset contains Twitter JSON data for Tweets related to Hurricane Harvey and the subsequent flooding along the Texas gulf region. This dataset was created using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 7,041,866 Tweets make up the combined dataset.
Date: 2017-08-18/2017-09-22
Creator: Phillips, Mark Edward
System: The UNT Digital Library

#Kaepernick7 and #ISupportKaepernickBecause Twitter Dataset

This dataset contains Twitter JSON data for Tweets related to the hashtags #Kaepernick7 and ISupportKaepernickBecause This dataset was created using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 573,379 Tweets make up the combined dataset.
Date: 2016-08-20/2016-08-31
Creator: Phillips, Mark Edward
System: The UNT Digital Library

Ethics Gaming Survey Results

Dataset generated for a National Science Foundation grant project, "EAGER: Prototyping a Virtue Ethics Game." These files contain the research results of the pre-test and post-test surveys.
Date: August 29, 2013
Creator: Oppong, Joseph R.
System: The UNT Digital Library

Hurricane Dorian Twitter Dataset

This dataset contains Twitter JSON data for Tweets related to Hurricane Dorian which is the most intense tropical cyclone on record to strike the Bahamas, and is regarded as the worst natural disaster in the country's history. This dataset was created using the twarc (https://github.com/DocNow/twarc) package that makes use of Twitter's search API. A total of 3,000,553 Tweets and 84,216 media files make up the combined dataset.
Date: 2019-08-25/2019-09-14
Creator: Phillips, Mark Edward
System: The UNT Digital Library