Resource Type

Language

Portal to Texas History Newspaper OCR Text Dataset: Bryan

Dataset of OCR text from The Portal to Texas History and the Texas Digital Newspaper Program. This dataset includes titles from Bryan Texas from the years 1883 to 1922. Titles included in this dataset include: Bryan Daily Eagle, Bryan Daily Eagle and Pilot, Bryan Morning Eagle, Bryan Morning Eagle and Pilot, The Brazos Weekly Pilot, The Bryan Daily Eagle, The Bryan Eagle, and The Bryan Weekly Eagle and Pilot . In all there are 5,843 issues comprised of 27,360 pages of text.
Date: November 12, 2015
Creator: Phillips, Mark Edward
System: The UNT Digital Library

Portal to Texas History Newspaper OCR Text Dataset: Houston

Dataset of OCR text from The Portal to Texas History and the Texas Digital Newspaper Program. This dataset includes titles from Houston, Texas from the years 1893 to 1924. Titles included in this dataset include: The Houston Daily Post and The Houston Post. In all there are 9,855 issues comprised of 184,900 pages of text.
Date: November 12, 2015
Creator: Phillips, Mark Edward
System: The UNT Digital Library

Portal to Texas History Newspaper OCR Text Dataset: Fort Worth

Dataset of OCR text from The Portal to Texas History and the Texas Digital Newspaper Program. This dataset includes titles from Fort Worth Texas from the years 1883 to 1896. Titles included in this dataset include: Fort Worth Daily Gazette, Fort Worth Gazette, and Fort Worth Weekly Gazette. In all there are 4,146 issues comprised of 36,199 pages of text.
Date: November 12, 2015
Creator: Phillips, Mark Edward
System: The UNT Digital Library

Portal to Texas History Newspaper OCR Text Dataset: El Paso

Dataset of OCR text from The Portal to Texas History and the Texas Digital Newspaper Program. This dataset includes titles from El Paso Texas from the years 1881 to 1921. Titles included in this dataset include: El Paso Daily Herald, El Paso Daily Times, El Paso Herald, El Paso International Daily Times, El Paso Morning Times, El Paso Sunday Times, El Paso Times, The El Paso Daily Times, and The El Paso Time. In all there are 17,104 issues comprised of 177,640 pages of text.
Date: November 12, 2015
Creator: Phillips, Mark Edward
System: The UNT Digital Library

UNT Libraries Digital Collections Feedback Comments Dataset

Dataset of user feedback and comments for the UNT Libraries Digital Collections including The Portal to Texas History and the UNT Digital Library. Comments were from a legacy comment system and contain entries from 2005 to 2010.
Date: 2005-10-13/2010-01-13
Creator: Phillips, Mark Edward
System: The UNT Digital Library

[Dataset: Complete typecasting teaching tool set]

3D dataset model of a moveable hand mould, matrix, punch, and type piece for a majiscule sans serif B. This dataset includes all the files necessary to print an entire set of the typecasting teaching toolkit in one file: the two-part handmould, punch, matrix, and two pieces of type (one with the jet attached as well as a detachable jet piece and an individual type piece). The resulting 3D printed model will replicate the historical artifact used to design/cast type during the hand press period. These models are for teaching purposes only and cannot be used to cast type using molten type metal, nor can they be used for printing.
Date: April 1, 2017
Creator: Jacobs, Courtney E.; McIntosh, Marcia; O'Sullivan, Kevin M. & Strait, Bob
System: The UNT Digital Library

ERCOT/2021 Texas Power Crisis Twitter Dataset

This dataset contains Twitter JSON data for Tweets related to the This dataseic Reliability Countil of Texas (ERCOT) during the 2021 Texas power crisis from February 10th, thru February 27th, 2021. The dataset was created using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 612,082 Tweets make up the combined dataset.
Date: 2021-02-09/2021-02-24
Creator: Phillips, Mark Edward
System: The UNT Digital Library

#DiaperDon Twitter Dataset

This dataset contains Twitter JSON data for Tweets related to the hashtag #DiaperDon. This dataset was created using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 866,987 Tweets make up the combined dataset.
Date: 2020-11-18/2020-12-01
Creator: Phillips, Mark Edward
System: The UNT Digital Library

Hurricane Ida Twitter Dataset

This dataset contains Twitter JSON data for Tweets related to Hurricane Ida which was a deadly and distructive Category 4 Atlantic hurricane that made landfall in Lousiana in 2021. This dataset was created using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 1,868,703 Tweets make up the combined dataset.
Date: 2021-08-20/2021-09-22
Creator: Phillips, Mark Edward
System: The UNT Digital Library

One Million Pages of Texas Newspapers: Dataset

This dataset represents the first million pages of Texas newspapers added to The Portal to Texas History as part of the Texas Digital Newspaper Program. The dataset consists of 123,184 newspaper issues from 569 titles, comprising 1,000,003 pages. Additionally the 3,349,156 item uses associated with this dataset as of April 7, 2013 are included.
Date: April 7, 2013
Creator: Phillips, Mark Edward & Hicks, William
System: The UNT Digital Library

Water Quality Corridor Management for Restoration (WQCM-R) Modeling Dataset

The dataset was developed to support research intended to develop a spatially-explicit model that prioritizes riparian areas in terms of potential for ecosystem restoration specifically to improve water quality downstream of the riparian area, and ultimately improve drinking water quality. The model was developed and then tested on the Lewisville Lake watershed (north central Texas, just north of Dallas, Texas, USA). The dataset contains environmental data for 90 sub-watersheds that form the overall Lewisville Lake watershed with a corresponding identification map.
Date: June 10, 2019
Creator: Atkinson, Samuel F.
System: The UNT Digital Library

Quality Assurance Practices in Web Archiving [Dataset]

This dataset contains the results of a survey of quality assurance practices within the field of web archiving and its practitioners. To understand current QA practices, the authors surveyed institutions engaged in web archiving, which included national libraries, colleges and universities, and museums and art libraries. The survey was administered online. It includes the completed responses of 54 participants. The data has been anonymized for privacy reasons. This dataset was used in the "Current Quality Assurance Practices in Web Archiving" paper, available from the UNT Digital Library.
Date: December 2014
Creator: Reyes Ayala, Brenda; Phillips, Mark Edward & Ko, Lauren
System: The UNT Digital Library

[Dataset: Paper Mould Version 1]

3D dataset model of a hand papermaking mould consisting of two parts: a mould frame and a deckle. In this version (v1), the mould frame and the Mould surface are printed together as one piece. The user will need to print both files to have a complete papermaking mould. The resulting 3D printed model will replicate the historical artifact used in Great Britain and parts of Europe in the nineteenth and twentieth centuries with sight variations. This papermaking mould is functional and can be used to cast a 4 ¼ x 5 ½” sheet of paper. While the size of sheet this mould produces is small, the mould frame, its ribs, and the deckle are full size as found in larger traditional European papermaking moulds.
Date: November 10, 2020
Creator: Queen, Brian
System: The UNT Digital Library

Tropical Storm Imelda Twitter Dataset

This dataset contains Twitter JSON data for Tweets related to Tropical Storm Imelda and the subsequent flooding in the south Texas region. This dataset was created using the twarc (https://github.com/DocNow/twarc) package that makes use of Twitter's search API. A total of 76,420 Tweets and 4,429 media files make up the combined dataset.
Date: 2019-09-10/2019-09-21
Creator: Phillips, Mark Edward
System: The UNT Digital Library

Hurricane Dorian Twitter Dataset

This dataset contains Twitter JSON data for Tweets related to Hurricane Dorian which is the most intense tropical cyclone on record to strike the Bahamas, and is regarded as the worst natural disaster in the country's history. This dataset was created using the twarc (https://github.com/DocNow/twarc) package that makes use of Twitter's search API. A total of 3,000,553 Tweets and 84,216 media files make up the combined dataset.
Date: 2019-08-25/2019-09-14
Creator: Phillips, Mark Edward
System: The UNT Digital Library

Meeting Science for Academic Librarians

This dataset contains results from survey of academic librarians about experiences in meetings and preferences for meeting components.
Date: August 10, 2020
Creator: Brannon, Sian & Leuzinger, Julie
System: The UNT Digital Library

Political Science Curriculum Map

This dataset provides a data analysis of how student learning objective from PSCI syllabi map to threshold concepts from the ACRL Framework for Information Literacy for Higher Education (2016) and the AAC&U Information Literacy Value Rubric (2013). The data includes non-core course for courses offered from the Fall 2017 semester to the Spring 2020 semester. This data analysis is conducted every three years. This curriculum map excludes core course previously as they were examined in the UNT Libraries Core Curriculum Map.
Date: May 11, 2020
Creator: Henson, Brea
System: The UNT Digital Library

[Response Data: Survey of Benchmarks in Metadata Quality]

Complete, anonymized dataset of responses to the Survey of Benchmarks in Metadata Quality. Date, time, IP addresses, and geographic data has been omitted. Responses that included project, organization, and/or repository names were removed from this data, as well as potentially identifying names, acronyms, and/or links.
Date: July 2019
Creator: Digital Library Federation. Assessment Interest Group. Metadata Working Group. Benchmarks Sub-Group.
System: The UNT Digital Library

Labeled PDF Dataset from Texas Records and Information Locator (TRAIL) Web Archive

This dataset contains a random sample of 2000 PDF documents from the Texas Records and Information Locator (TRAIL) Web Archive from the Texas State Library and Archives Commission. Each PDF has been sorted into two categories, TX_Pub_In_Scope and Not_TX_Pub.
Date: July 2018
Creator: Tarver, Hannah & Phillips, Mark Edward
System: The UNT Digital Library

The Portal to Texas History's Texas State Publications Collection Dataset

This dataset contains a set of 2,448 PDF files from the Texas State Publications collection in The Portal to Texas History.
Date: September 12, 2018
Creator: Phillips, Mark Edward
System: The UNT Digital Library

[Response Data: Improving Subjects in the Digital Collections with Data Survey]

Complete, anonymized dataset of responses to the "Improving Subjects in the Digital Collections with Data" survey. Date, time, IP addresses, and geographic data has been omitted.
Date: August 2021
Creator: Tarver, Hannah; Miles, Chassidy & Zipperer, Rachael
System: The UNT Digital Library

Hurricane Laura Twitter Dataset

This dataset contains Twitter JSON data for Tweets related to Hurricane Laura that formed August 20, 2020 and dissipated August 29, 2020. This dataset was created using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 1,168,178 Tweets make up the combined dataset.
Date: 2020-08-18/2020-09-02
Creator: Phillips, Mark Edward
System: The UNT Digital Library

Ruth Bader Ginsburg Remembrance Twitter Dataset

This dataset contains Twitter JSON data for Tweets related to the passing of Ruth Bader Ginsburg on September 18, 2020. This dataset was created using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 4,195,270 Tweets make up the combined dataset.
Date: 2020-09-10/2020-10-04
Creator: Phillips, Mark Edward
System: The UNT Digital Library

3D Printable Lowercase Type Setting Kit

Individual 3D dataset files for lowercase type letters a through z.
Date: January 21, 2021
Creator: Strait, Bob
System: The UNT Digital Library