Language

2015 FIFA Corruption Scandal Twitter Dataset

This dataset is comprised of tweets that are related to the 2015 FIFA corruption scandal. This dataset was created using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 8,615,937 Tweets make up the combined dataset.
Date: 2015-05-21/2015-06-05
Creator: Phillips, Mark Edward
System: The UNT Digital Library

2016 Democratic National Convention in Philadelphia Twitter Dataset

This dataset is comprised of tweets that are related to the 2016 Democratic National Committee meeting in Philadelphia, Pennsylvania that took place on July 25–28, 2016. This dataset was created using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 15,676 Tweets make up the combined dataset.
Date: 2016-07-15/2016-08-01
Creator: Phillips, Mark Edward
System: The UNT Digital Library

2018 Texas Sentate Debate Twitter Dataset

This dataset contains Twitter JSON data for Tweets related to the United States Senate race between Beto O'Rourke and Ted Cruz. This dataset contains Tweets captured around their first debate on September 21, 2018. This dataset was created using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 3,006,198 Tweets and 101,050 media files make up the combined dataset.
Date: 2018-09-12/2018-10-03
Creator: Phillips, Mark Edward
System: The UNT Digital Library

[Age of the UNT Libraries Collection Dataset, 2013]

Dataset generated for the University of North Texas Libraries collection tabulating the number of items published by decade within each subject area.
Date: December 2013
Creator: University of North Texas. Libraries.
System: The UNT Digital Library

ALA Values and LGBT Social Justice

This dataset contains survey results from librarians regarding their stance on American Library Association values and social justice in relation to LGBTQ issues.
Date: May 30, 2017
Creator: Keralis, Spencer D. C. & Elkins, Aaron
System: The UNT Digital Library

Badlands National Park Twitter Dataset

This dataset contains Twitter JSON data for Tweets related to the Badlands National Park (BadlandsNPS) user's tweets related to climate change and the Trump administration. This dataset was collected a few days before and following the phenomenon on Twitter. This dataset was created using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 321,821 Tweets make up the combined dataset.
Date: 2017-01-15/2017-01-29
Creator: Phillips, Mark Edward
System: The UNT Digital Library

Biological Systems Dataset

Dataset generated for research on biological systems.
Date: July 11, 2014
Creator: Kuprasertkul, Nina; Mehta, Sumedha & Wadawadigi, Akash
System: The UNT Digital Library

Blood Brain Organic Solute Descriptors

Dataset generated for research on blood brain organic solute descriptors.
Date: July 11, 2014
Creator: Kupra, Nina; Mehta, Sumedha & Wadawadigi, Akash
System: The UNT Digital Library

Blood Fat Solute Descriptors Dataset

Dataset generated for research on blood fat solute descriptors.
Date: July 11, 2014
Creator: Kuprasertkul, Nina; Mehta, Sumedha & Wadawadigi, Akash
System: The UNT Digital Library

Blood Liver Organic Solute Descriptors Dataset

Dataset generated for research on blood liver organic solute descriptors.
Date: July 11, 2014
Creator: Kuprasertkul, Nina; Mehta, Sumedha & Wadawadigi, Akash
System: The UNT Digital Library

Bluegill Dataset

Dataset generated for research on bluegills.
Date: July 11, 2014
Creator: Kuprasertkul, Nina; Mehta, Sumedha & Wadawadigi, Akash
System: The UNT Digital Library

Coda Archival Digital Repository Dataset

This dataset contains information extracted from the UNT Libraries' Coda Digital Repository. It contains information related to number of files, size, and ingest date of digital objects added to that system. It can be used for analysis and investigation of the growth and makeup of digital repositories.
Date: April 1, 2014
Creator: Phillips, Mark Edward & Ko, Lauren
System: The UNT Digital Library

Congressional Globe OCR Dataset

Dataset of OCR text from the Congressional Globe collection in the UNT Digital Library. In all there are 112 volumes and 104,615 pages of text in this dataset.
Date: April 6, 2015
Creator: Phillips, Mark Edward
System: The UNT Digital Library

Dallas Police Shooting Twitter Dataset

This dataset contains Twitter JSON data for several Twitter search queries that were collected the week following the shooting of police officers in Dallas, Texas on July 7th 2017, using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 7,146,993 Tweets make up the combined dataset.
Date: 2016-07-05/2016-07-14
Creator: Phillips, Mark Edward
System: The UNT Digital Library

DataRes Project Institution Policy Scan Data

Dataset from the DataRes Project indicating the name of the institutions in the study, funding awarded by the National Science Foundation (NSF) and the National Institute of Health (NIH) during the 2010-2011 fiscal year, whether institutions have a Data Management Policy, and the URL is a policy exists.
Date: 2011-10/2013-09
Creator: Keralis, Spencer D. C.; Stark, Shannon; Najmi, Anjum; Freese, Ephraim & Ugartechea, Monica
System: The UNT Digital Library

DataRes Project Primary Survey

Dataset from the DataRes Project. This dataset is the primary survey on data management needs of researchers.
Date: June 2012
Creator: Keralis, Spencer D. C.; Stark, Shannon; Halbert, Martin & Moen, William E.
System: The UNT Digital Library

DataRes Project Secondary Survey

Dataset from the DataRes Project. This dataset is the secondary survey on data management needs of researchers.
Date: October 2012
Creator: Keralis, Spencer D. C.; Stark, Shannon; Halbert, Martin & Moen, William E.
System: The UNT Digital Library

[Dataset of Web Archiving Research Articles]

Datasets used in the presentation, "Towards Building a Collection of Web Archiving Research Articles." The files included here were used to conduct several Machine Learning classification experiments that result in a corpus of scholarly research articles on the topic of web archiving.
Date: August 2014
Creator: Reyes Ayala, Brenda & Caragea, Cornelia
System: The UNT Digital Library

[Dataset Supplemental Material and References]

Supplemental materials and references accompanying a series of chemistry datasets.
Date: July 11, 2014
Creator: Kuprasertkul, Nina; Mehta, Sumedha & Wadawadigi, Akash
System: The UNT Digital Library

#DescribeTrumpWithOneWord Twitter Dataset

This dataset contains Twitter JSON data for Tweets related to the hashtag #DescribeTrumpWithOneWord. This dataset was created using the twarc (https://github.com/edsu/twarc) package that makes use of Twitter's search API. A total of 15,676 Tweets make up the combined dataset.
Date: 2017-09-02/2017-09-22
Creator: Phillips, Mark Edward
System: The UNT Digital Library

Digital Public Library of America: Bulk Metadata Download Feb 2015

Dataset containing metadata contributed to the Digital Public Library of America and normalized into their internal format.
Date: February 2015
Creator: Digital Public Library of America
System: The UNT Digital Library

Ethics Gaming Survey Results

Dataset generated for a National Science Foundation grant project, "EAGER: Prototyping a Virtue Ethics Game." These files contain the research results of the pre-test and post-test surveys.
Date: August 29, 2013
Creator: Oppong, Joseph R.
System: The UNT Digital Library

Extended Date/Time Format (EDTF) Dates Research Datasets

Two datasets, each with 390,751 date samples from the UNT Libraries' digital collections. These samples were compiled for research regarding the Extended Date/Time Format (EDTF) standard. The first dataset contains a concatenated list of date values from the metadata records in The Portal to Texas History, the UNT Digital Library, and The Gateway to Oklahoma History. The "classified" dataset includes labels expressing whether each date is EDTF-valid and the level of conformance.
Date: February 28, 2013
Creator: Phillips, Mark Edward
System: The UNT Digital Library

Gaming Census Dataset

This dataset represents survey feedback gathered about games in libraries, collections, cataloging, outreach, and programming.
Date: December 3, 2018
Creator: Brannon, Sian; Robson, Diane & Dewitt-Miller, Erin
System: The UNT Digital Library