Portal to Texas History Newspaper OCR Text Dataset: Houston

Dataset of OCR text from The Portal to Texas History and the Texas Digital Newspaper Program. This dataset includes titles from Houston, Texas from the years 1893 to 1924. Titles included in this dataset include: The Houston Daily Post and The Houston Post. In all there are 9,855 issues comprised of 184,900 pages of text.
Date: November 12, 2015
Creator: Phillips, Mark Edward
System: The UNT Digital Library

Portal to Texas History Newspaper OCR Text Dataset: Fort Worth

Dataset of OCR text from The Portal to Texas History and the Texas Digital Newspaper Program. This dataset includes titles from Fort Worth Texas from the years 1883 to 1896. Titles included in this dataset include: Fort Worth Daily Gazette, Fort Worth Gazette, and Fort Worth Weekly Gazette. In all there are 4,146 issues comprised of 36,199 pages of text.
Date: November 12, 2015
Creator: Phillips, Mark Edward
System: The UNT Digital Library

Portal to Texas History Newspaper OCR Text Dataset: El Paso

Dataset of OCR text from The Portal to Texas History and the Texas Digital Newspaper Program. This dataset includes titles from El Paso Texas from the years 1881 to 1921. Titles included in this dataset include: El Paso Daily Herald, El Paso Daily Times, El Paso Herald, El Paso International Daily Times, El Paso Morning Times, El Paso Sunday Times, El Paso Times, The El Paso Daily Times, and The El Paso Time. In all there are 17,104 issues comprised of 177,640 pages of text.
Date: November 12, 2015
Creator: Phillips, Mark Edward
System: The UNT Digital Library

Dallas Police Shooting Twitter Dataset

This dataset contains Twitter JSON data for several Twitter search queries that were collected the week following the shooting of police officers in Dallas, Texas on July 7th 2017, using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 7,146,993 Tweets make up the combined dataset.
Date: 2016-07-05/2016-07-14
Creator: Phillips, Mark Edward
System: The UNT Digital Library

Hurricane Harvey Twitter Dataset

This dataset contains Twitter JSON data for Tweets related to Hurricane Harvey and the subsequent flooding along the Texas gulf region. This dataset was created using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 7,041,866 Tweets make up the combined dataset.
Date: 2017-08-18/2017-09-22
Creator: Phillips, Mark Edward
System: The UNT Digital Library

2018 Texas Sentate Debate Twitter Dataset

This dataset contains Twitter JSON data for Tweets related to the United States Senate race between Beto O'Rourke and Ted Cruz. This dataset contains Tweets captured around their first debate on September 21, 2018. This dataset was created using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 3,006,198 Tweets and 101,050 media files make up the combined dataset.
Date: 2018-09-12/2018-10-03
Creator: Phillips, Mark Edward
System: The UNT Digital Library

Water Quality Corridor Management for Restoration (WQCM-R) Modeling Dataset

The dataset was developed to support research intended to develop a spatially-explicit model that prioritizes riparian areas in terms of potential for ecosystem restoration specifically to improve water quality downstream of the riparian area, and ultimately improve drinking water quality. The model was developed and then tested on the Lewisville Lake watershed (north central Texas, just north of Dallas, Texas, USA). The dataset contains environmental data for 90 sub-watersheds that form the overall Lewisville Lake watershed with a corresponding identification map.
Date: June 10, 2019
Creator: Atkinson, Samuel F.
System: The UNT Digital Library

Tropical Storm Imelda Twitter Dataset

This dataset contains Twitter JSON data for Tweets related to Tropical Storm Imelda and the subsequent flooding in the south Texas region. This dataset was created using the twarc (https://github.com/DocNow/twarc) package that makes use of Twitter's search API. A total of 76,420 Tweets and 4,429 media files make up the combined dataset.
Date: 2019-09-10/2019-09-21
Creator: Phillips, Mark Edward
System: The UNT Digital Library

ERCOT/2021 Texas Power Crisis Twitter Dataset

This dataset contains Twitter JSON data for Tweets related to the This dataseic Reliability Countil of Texas (ERCOT) during the 2021 Texas power crisis from February 10th, thru February 27th, 2021. The dataset was created using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 612,082 Tweets make up the combined dataset.
Date: 2021-02-09/2021-02-24
Creator: Phillips, Mark Edward
System: The UNT Digital Library

Hurricane Ida Twitter Dataset

This dataset contains Twitter JSON data for Tweets related to Hurricane Ida which was a deadly and distructive Category 4 Atlantic hurricane that made landfall in Lousiana in 2021. This dataset was created using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 1,868,703 Tweets make up the combined dataset.
Date: 2021-08-20/2021-09-22
Creator: Phillips, Mark Edward
System: The UNT Digital Library

Hurricane Laura Twitter Dataset

This dataset contains Twitter JSON data for Tweets related to Hurricane Laura that formed August 20, 2020 and dissipated August 29, 2020. This dataset was created using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 1,168,178 Tweets make up the combined dataset.
Date: 2020-08-18/2020-09-02
Creator: Phillips, Mark Edward
System: The UNT Digital Library