117 Matching Results

Results open in a new window/tab.

Goldfish Dataset

Dataset generated for research on goldfish.
Date: July 11, 2014
Creator: Kuprasertkul, Nina; Mehta, Sumedha & Wadawadigi, Akash
System: The UNT Digital Library

UNT Libraries Metadata Edit Dataset

This dataset contains data samples from metadata records extracted from the UNT Libraries' Digital Collections. It contains one sample per metadata record version in the system with aggregate counts of fields and also hash values of an element as well. Data was collected in March 2014 with dates from May 19, 2004 to February 4, 2014.
Date: April 2014
Creator: Phillips, Mark Edward
System: The UNT Digital Library

"Yes All Women" Twitter Dataset

This dataset contains Twitter JSON data for several Twitter search queries that were collected around the #YesAllWomen Twitter "conversation" between May 25, 2014 and June 8, 2014 using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 2,805,763 Tweets and 34,532 images make up the combined dataset.
Date: June 8, 2014
Creator: Phillips, Mark Edward
System: The UNT Digital Library

UNT Libraries Edit Event Dataset 2014

Dataset containing metadata edit events for the UNT Libraries Digital Collections from January 1, 2014 until December 31, 2014. There are a total of 94,222 samples in the dataset from 193 different metadata editors.
Date: February 2015
Creator: Phillips, Mark Edward
System: The UNT Digital Library

Digital Public Library of America: Bulk Metadata Download Feb 2015

Dataset containing metadata contributed to the Digital Public Library of America and normalized into their internal format.
Date: February 2015
Creator: Digital Public Library of America
System: The UNT Digital Library

University of North Texas Libraries Serials Transparency List

Dataset containing information regarding subscriptions purchased by UNT Libraries, along with pricing information for the 2013-14, 2014-15, and 2015-16 fiscal years.
Date: April 2018
Creator: University of North Texas. Libraries. Collection Development.
System: The UNT Digital Library

Hurricane Florence Twitter Dataset

This dataset contains Twitter JSON data for Tweets related to Hurricane Florence and the subsequent flooding along the Carolina coastal region. This dataset was created using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 4,971,575 Tweets and 347,205 media files make up the combined dataset.
Date: 2018-09-05/2018-10-03
Creator: Phillips, Mark Edward
System: The UNT Digital Library

2018 Texas Sentate Debate Twitter Dataset

This dataset contains Twitter JSON data for Tweets related to the United States Senate race between Beto O'Rourke and Ted Cruz. This dataset contains Tweets captured around their first debate on September 21, 2018. This dataset was created using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 3,006,198 Tweets and 101,050 media files make up the combined dataset.
Date: 2018-09-12/2018-10-03
Creator: Phillips, Mark Edward
System: The UNT Digital Library

Congressional Globe OCR Dataset

Dataset of OCR text from the Congressional Globe collection in the UNT Digital Library. In all there are 112 volumes and 104,615 pages of text in this dataset.
Date: April 6, 2015
Creator: Phillips, Mark Edward
System: The UNT Digital Library

Organic Consumer Body Image Communities Sustainable Farming

Dataset contains responses and analysis from a survey of 688 students at a Southwest U.S. university regarding health conscious consumerism and sustainable food practices.
Date: 2015
Creator: Connors, Priscilla L.; Strübel, Jessica & Strzelecka, Marianna
System: The UNT Digital Library

Dallas Police Shooting Twitter Dataset

This dataset contains Twitter JSON data for several Twitter search queries that were collected the week following the shooting of police officers in Dallas, Texas on July 7th 2017, using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 7,146,993 Tweets make up the combined dataset.
Date: 2016-07-05/2016-07-14
Creator: Phillips, Mark Edward
System: The UNT Digital Library

Badlands National Park Twitter Dataset

This dataset contains Twitter JSON data for Tweets related to the Badlands National Park (BadlandsNPS) user's tweets related to climate change and the Trump administration. This dataset was collected a few days before and following the phenomenon on Twitter. This dataset was created using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 321,821 Tweets make up the combined dataset.
Date: 2017-01-15/2017-01-29
Creator: Phillips, Mark Edward
System: The UNT Digital Library

Hurricane Harvey Twitter Dataset

This dataset contains Twitter JSON data for Tweets related to Hurricane Harvey and the subsequent flooding along the Texas gulf region. This dataset was created using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 7,041,866 Tweets make up the combined dataset.
Date: 2017-08-18/2017-09-22
Creator: Phillips, Mark Edward
System: The UNT Digital Library

#Kaepernick7 and #ISupportKaepernickBecause Twitter Dataset

This dataset contains Twitter JSON data for Tweets related to the hashtags #Kaepernick7 and ISupportKaepernickBecause This dataset was created using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 573,379 Tweets make up the combined dataset.
Date: 2016-08-20/2016-08-31
Creator: Phillips, Mark Edward
System: The UNT Digital Library

2015 FIFA Corruption Scandal Twitter Dataset

This dataset is comprised of tweets that are related to the 2015 FIFA corruption scandal. This dataset was created using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 8,615,937 Tweets make up the combined dataset.
Date: 2015-05-21/2015-06-05
Creator: Phillips, Mark Edward
System: The UNT Digital Library

#DescribeTrumpWithOneWord Twitter Dataset

This dataset contains Twitter JSON data for Tweets related to the hashtag #DescribeTrumpWithOneWord. This dataset was created using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 15,676 Tweets make up the combined dataset.
Date: 2017-09-02/2017-09-22
Creator: Phillips, Mark Edward
System: The UNT Digital Library

2016 Democratic National Convention in Philadelphia Twitter Dataset

This dataset is comprised of tweets that are related to the 2016 Democratic National Committee meeting in Philadelphia, Pennsylvania that took place on July 25–28, 2016. This dataset was created using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 15,676 Tweets make up the combined dataset.
Date: 2016-07-15/2016-08-01
Creator: Phillips, Mark Edward
System: The UNT Digital Library

Labeled PDF Dataset from End of Term (EOT) 2008 Web Archive

This dataset contains a random sample of 2000 PDF documents from the usda.gov domain in the End of Term (EOT) 2008 Web Archive. These samples were categorized as being of interest for possible inclusion in the Technical Report Archive and Image Library (TRAIL). Each PDF has been sorted into two categories, Technical_Report and Not_Technical_Report.
Date: July 2018
Creator: Kirkwood, Patricia; Phillips, Mark Edward & Caldwell, Christopher
System: The UNT Digital Library

Unlabeled PDF Dataset of Technical Reports USDA.gov domain in the EOT 2008 Web Archive

This dataset contains a sample of 10,000 PDF documents from the usda.gov domain in the End of Term (EOT) 2008 Web Archive. These samples are unlabeled and uncategorized.
Date: September 12, 2018
Creator: Phillips, Mark Edward
System: The UNT Digital Library

Gaming Census Dataset

This dataset represents survey feedback gathered about games in libraries, collections, cataloging, outreach, and programming.
Date: December 3, 2018
Creator: Brannon, Sian; Robson, Diane & Dewitt-Miller, Erin
System: The UNT Digital Library

Extended Date/Time Format (EDTF) Dates Research Datasets

Two datasets, each with 390,751 date samples from the UNT Libraries' digital collections. These samples were compiled for research regarding the Extended Date/Time Format (EDTF) standard. The first dataset contains a concatenated list of date values from the metadata records in The Portal to Texas History, the UNT Digital Library, and The Gateway to Oklahoma History. The "classified" dataset includes labels expressing whether each date is EDTF-valid and the level of conformance.
Date: February 28, 2013
Creator: Phillips, Mark Edward
System: The UNT Digital Library

Portal to Texas History Newspaper OCR Text Dataset: Gainesville

Dataset of OCR text from The Portal to Texas History and the Texas Digital Newspaper Program. This dataset includes titles from Gainesville Texas from the years 1888 to 1897. Titles included in this dataset include: The Daily Hesperian, and The Gainesville Daily Hesperian. In all there are 2,286 issues comprised of 9,359 pages of text.
Date: November 12, 2015
Creator: Phillips, Mark Edward
System: The UNT Digital Library

Portal to Texas History Newspaper OCR Text Dataset: Temple

Dataset of OCR text from The Portal to Texas History and the Texas Digital Newspaper Program. This dataset includes titles from Temple Texas from the years 1907 to 1922. Titles included in this dataset include: Temple Daily Telegram. In all there are 4,627 issues comprised of 44,633 pages of text.
Date: November 12, 2015
Creator: Phillips, Mark Edward
System: The UNT Digital Library

Portal to Texas History Newspaper OCR Text Dataset: Abilene

Dataset of OCR text from The Portal to Texas History and the Texas Digital Newspaper Program. This dataset includes titles from Abilene Texas from the years 1888 to 1923. Titles included in this dataset include: Abilene Daily Reporter, Abilene Morning Reporter, Abilene Semi-Weekly Farm Reporter, Abilene Semi-Weekly Reporter, Abilene Weekly Reporter, The Abilene Reporter, The Abilene Semi-Weekly Reporter, and the Abilene Weekly Reporter. In all there are 7,208 issues comprised of 62,871 pages of text.
Date: November 12, 2015
Creator: Phillips, Mark Edward
System: The UNT Digital Library