Tropical Storm Imelda Twitter Dataset

This dataset contains Twitter JSON data for Tweets related to Tropical Storm Imelda and the subsequent flooding in the south Texas region. This dataset was created using the twarc (https://github.com/DocNow/twarc) package that makes use of Twitter's search API. A total of 76,420 Tweets and 4,429 media files make up the combined dataset.
Date: 2019-09-10/2019-09-21
Creator: Phillips, Mark Edward
System: The UNT Digital Library

Hurricane Dorian Twitter Dataset

This dataset contains Twitter JSON data for Tweets related to Hurricane Dorian which is the most intense tropical cyclone on record to strike the Bahamas, and is regarded as the worst natural disaster in the country's history. This dataset was created using the twarc (https://github.com/DocNow/twarc) package that makes use of Twitter's search API. A total of 3,000,553 Tweets and 84,216 media files make up the combined dataset.
Date: 2019-08-25/2019-09-14
Creator: Phillips, Mark Edward
System: The UNT Digital Library

[Response Data: Survey of Benchmarks in Metadata Quality]

Complete, anonymized dataset of responses to the Survey of Benchmarks in Metadata Quality. Date, time, IP addresses, and geographic data has been omitted. Responses that included project, organization, and/or repository names were removed from this data, as well as potentially identifying names, acronyms, and/or links.
Date: July 2019
Creator: Digital Library Federation. Assessment Interest Group. Metadata Working Group. Benchmarks Sub-Group.
System: The UNT Digital Library

Water Quality Corridor Management for Restoration (WQCM-R) Modeling Dataset

The dataset was developed to support research intended to develop a spatially-explicit model that prioritizes riparian areas in terms of potential for ecosystem restoration specifically to improve water quality downstream of the riparian area, and ultimately improve drinking water quality. The model was developed and then tested on the Lewisville Lake watershed (north central Texas, just north of Dallas, Texas, USA). The dataset contains environmental data for 90 sub-watersheds that form the overall Lewisville Lake watershed with a corresponding identification map.
Date: June 10, 2019
Creator: Atkinson, Samuel F.
System: The UNT Digital Library

Notre Dame Cathedral Fire Dataset

This dataset contains Twitter JSON data for Tweets related to the fire at Notre Dame Cathedral in Paris, France. This dataset was created using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 8,046,185 Tweets and 163,055 media files make up the combined dataset.
Date: 2019-04-08/2019-04-29
Creator: Phillips, Mark Edward
System: The UNT Digital Library

Gaming Census Dataset

This dataset represents survey feedback gathered about games in libraries, collections, cataloging, outreach, and programming.
Date: December 3, 2018
Creator: Brannon, Sian; Robson, Diane & Dewitt-Miller, Erin
System: The UNT Digital Library

University of North Texas Libraries Serials Transparency List FY 2017-2018

This dataset represents contains information regarding subscriptions purchased by UNT Libraries, along with pricing information for the 2013-14, 2014-15, 2015-16, 2016-17, and 2017-18 fiscal years.
Date: October 31, 2018
Creator: University of North Texas. Libraries. Collection Development.
System: The UNT Digital Library

2018 Texas Sentate Debate Twitter Dataset

This dataset contains Twitter JSON data for Tweets related to the United States Senate race between Beto O'Rourke and Ted Cruz. This dataset contains Tweets captured around their first debate on September 21, 2018. This dataset was created using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 3,006,198 Tweets and 101,050 media files make up the combined dataset.
Date: 2018-09-12/2018-10-03
Creator: Phillips, Mark Edward
System: The UNT Digital Library

The Portal to Texas History's Texas State Publications Collection Dataset

This dataset contains a set of 2,448 PDF files from the Texas State Publications collection in The Portal to Texas History.
Date: September 12, 2018
Creator: Phillips, Mark Edward
System: The UNT Digital Library

Unlabeled PDF Dataset of Technical Reports USDA.gov domain in the EOT 2008 Web Archive

This dataset contains a sample of 10,000 PDF documents from the usda.gov domain in the End of Term (EOT) 2008 Web Archive. These samples are unlabeled and uncategorized.
Date: September 12, 2018
Creator: Phillips, Mark Edward
System: The UNT Digital Library

UNT Scholarly Works PDF Dataset

This dataset contains a set of 4,534 PDF files from the UNT Scholarly Works collection, the institutional repository for UNT in the UNT Digital Library.
Date: September 12, 2018
Creator: Phillips, Mark Edward
System: The UNT Digital Library

Hurricane Florence Twitter Dataset

This dataset contains Twitter JSON data for Tweets related to Hurricane Florence and the subsequent flooding along the Carolina coastal region. This dataset was created using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 4,971,575 Tweets and 347,205 media files make up the combined dataset.
Date: 2018-09-05/2018-10-03
Creator: Phillips, Mark Edward
System: The UNT Digital Library

Labeled PDF Dataset from End of Term (EOT) 2008 Web Archive

This dataset contains a random sample of 2000 PDF documents from the usda.gov domain in the End of Term (EOT) 2008 Web Archive. These samples were categorized as being of interest for possible inclusion in the Technical Report Archive and Image Library (TRAIL). Each PDF has been sorted into two categories, Technical_Report and Not_Technical_Report.
Date: July 2018
Creator: Kirkwood, Patricia; Phillips, Mark Edward & Caldwell, Christopher
System: The UNT Digital Library

Labeled PDF Dataset from Texas Records and Information Locator (TRAIL) Web Archive

This dataset contains a random sample of 2000 PDF documents from the Texas Records and Information Locator (TRAIL) Web Archive from the Texas State Library and Archives Commission. Each PDF has been sorted into two categories, TX_Pub_In_Scope and Not_TX_Pub.
Date: July 2018
Creator: Tarver, Hannah & Phillips, Mark Edward
System: The UNT Digital Library

University of North Texas Libraries Serials Transparency List

Dataset containing information regarding subscriptions purchased by UNT Libraries, along with pricing information for the 2013-14, 2014-15, and 2015-16 fiscal years.
Date: April 2018
Creator: University of North Texas. Libraries. Collection Development.
System: The UNT Digital Library

Labeled PDF Dataset from UNT.edu

This dataset contains a random sample of 2000 PDF documents from the Spring 2017 Web Archive of the unt.edu domain. (https://digital.library.unt.edu/ark:/67531/metadc993363/) that have been sorted into two categories, ForRepo and NotForRepo.
Date: November 15, 2017
Creator: Andrews, Pamela & Phillips, Mark Edward
System: The UNT Digital Library

#DescribeTrumpWithOneWord Twitter Dataset

This dataset contains Twitter JSON data for Tweets related to the hashtag #DescribeTrumpWithOneWord. This dataset was created using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 15,676 Tweets make up the combined dataset.
Date: 2017-09-02/2017-09-22
Creator: Phillips, Mark Edward
System: The UNT Digital Library

Hurricane Harvey Twitter Dataset

This dataset contains Twitter JSON data for Tweets related to Hurricane Harvey and the subsequent flooding along the Texas gulf region. This dataset was created using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 7,041,866 Tweets make up the combined dataset.
Date: 2017-08-18/2017-09-22
Creator: Phillips, Mark Edward
System: The UNT Digital Library

Badlands National Park Twitter Dataset

This dataset contains Twitter JSON data for Tweets related to the Badlands National Park (BadlandsNPS) user's tweets related to climate change and the Trump administration. This dataset was collected a few days before and following the phenomenon on Twitter. This dataset was created using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 321,821 Tweets make up the combined dataset.
Date: 2017-01-15/2017-01-29
Creator: Phillips, Mark Edward
System: The UNT Digital Library

#Kaepernick7 and #ISupportKaepernickBecause Twitter Dataset

This dataset contains Twitter JSON data for Tweets related to the hashtags #Kaepernick7 and ISupportKaepernickBecause This dataset was created using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 573,379 Tweets make up the combined dataset.
Date: 2016-08-20/2016-08-31
Creator: Phillips, Mark Edward
System: The UNT Digital Library

2016 Democratic National Convention in Philadelphia Twitter Dataset

This dataset is comprised of tweets that are related to the 2016 Democratic National Committee meeting in Philadelphia, Pennsylvania that took place on July 25–28, 2016. This dataset was created using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 15,676 Tweets make up the combined dataset.
Date: 2016-07-15/2016-08-01
Creator: Phillips, Mark Edward
System: The UNT Digital Library

Dallas Police Shooting Twitter Dataset

This dataset contains Twitter JSON data for several Twitter search queries that were collected the week following the shooting of police officers in Dallas, Texas on July 7th 2017, using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 7,146,993 Tweets make up the combined dataset.
Date: 2016-07-05/2016-07-14
Creator: Phillips, Mark Edward
System: The UNT Digital Library

Succession Planning Through Mentoring in the Library Survey [Dataset]

This dataset shows the results from a succession planning and mentoring survey conducted by UNT Libraries.
Date: May 8, 2016
Creator: Leuzinger, Julie & Rowe, Jennifer
System: The UNT Digital Library

Portal to Texas History Newspaper OCR Text Dataset: Abilene

Dataset of OCR text from The Portal to Texas History and the Texas Digital Newspaper Program. This dataset includes titles from Abilene Texas from the years 1888 to 1923. Titles included in this dataset include: Abilene Daily Reporter, Abilene Morning Reporter, Abilene Semi-Weekly Farm Reporter, Abilene Semi-Weekly Reporter, Abilene Weekly Reporter, The Abilene Reporter, The Abilene Semi-Weekly Reporter, and the Abilene Weekly Reporter. In all there are 7,208 issues comprised of 62,871 pages of text.
Date: November 12, 2015
Creator: Phillips, Mark Edward
System: The UNT Digital Library