Degree Department

Determining Event Durations: Models and Error Analysis (open access)

Determining Event Durations: Models and Error Analysis

This paper presents models to predict event durations.
Date: June 1, 2018
Creator: Vempala, Alakananda; Blanco, Eduardo & Palmer, Alexis
Object Type: Paper
System: The UNT Digital Library
Definition and Goals of Descriptive Linguistic Fieldwork (open access)

Definition and Goals of Descriptive Linguistic Fieldwork

Book chapter defining descriptive linguistic fieldwork, explores tasks that fall under this definition, outlines goals of descriptive linguistic fieldwork, and identifies aspirations and limitations of linguistic fieldworkers.
Date: 2011
Creator: Chelliah, Shobhana Lakshmi & de Reuse, Willem
Object Type: Book Chapter
System: The UNT Digital Library
Reproducible Research in Linguistics: A Position Statement on Data Citation and Attribution in Our Field (open access)

Reproducible Research in Linguistics: A Position Statement on Data Citation and Attribution in Our Field

This article is a position statement on reproducible research in linguistics, including data citation and attribution, that represents the collective views of some 41 colleagues.
Date: December 6, 2017
Creator: Berez-Kroeker, Andrea; Gawne, Lauren; Kung, Susan Smythe; Kelly, Barbara F.; Heston, Tyler; Holton, Gary et al.
Object Type: Article
System: The UNT Digital Library
Prenominal possessives in Yiddish: mayn khaver versus mayner a khaver (open access)

Prenominal possessives in Yiddish: mayn khaver versus mayner a khaver

Article provides a systematic comparison and detailed analysis of two prenominal possessive constructions in Yiddish, the familiar mayn khaver ‘my friend’ and the less well-known mayner a khaver ‘a friend of mine.’
Date: February 21, 2022
Creator: Roehrs, Dorian
Object Type: Article
System: The UNT Digital Library
Synthetic data for annotation and extraction of family history information from clinical text (open access)

Synthetic data for annotation and extraction of family history information from clinical text

This article investigates the use of synthetic data for the annotation and automated extraction of family history information relating to cases of cardiac disease from Norwegian clinical text. This work assesses the validity and applicability of the annotated synthetic corpus using machine learning techniques. The methodology outlined in this article may be useful in other situations where limited availability of clinical text hinders NLP tasks.
Date: July 14, 2021
Creator: Brekke, Pål H.; Kasicheyanula, Taraka; Pilán, Ildikó; Nytrø, Øystein & Øvrelid, Lilja
Object Type: Article
System: The UNT Digital Library
Temporally-oriented possession: A corpus for tracking possession over time (open access)

Temporally-oriented possession: A corpus for tracking possession over time

This paper presents a new corpus of Wikipedia articles annotated with temporally-oriented possession or tracking concrete objects as they change hands over time.
Date: January 2019
Creator: Chinnappa, Dhivya; Palmer, Alexis & Blanco, Eduardo
Object Type: Paper
System: The UNT Digital Library
Challenges to Representing Personal Names and Language Names in Language Archives: Examples from Northeast India (open access)

Challenges to Representing Personal Names and Language Names in Language Archives: Examples from Northeast India

Article reviewing one particular challenge to data management relevant to South Asia, which is the complexity of names (of individuals, groups, and languages). It was presented at the 1st International Workshop on Digital Language Archives held on September 30-October 1, 2021 as part of the ACM/IEEE Joint Conference on Digital Libraries 2021.
Date: October 7, 2021
Creator: Burke, Mary & Chelliah, Shobhana Lakshmi
Object Type: Article
System: The UNT Digital Library
Classifying Semantic Clause Types: Modeling Context and Genre Characteristics with Recurrent Neural Networks and Attention (open access)

Classifying Semantic Clause Types: Modeling Context and Genre Characteristics with Recurrent Neural Networks and Attention

This paper introduces an attention mechanism that pinpoints relevant context not only for the current instance, but also for the larger context.
Date: August 2017
Creator: Becker, Maria; Staniek, Michael; Nastase, Vivi; Palmer, Alexis & Frank, Anette
Object Type: Article
System: The UNT Digital Library
Semantic Clause Types and Modality as Features for Argument Analysis (open access)

Semantic Clause Types and Modality as Features for Argument Analysis

This article investigates the role of semantic clause types and modality in argumentative texts.
Date: August 17, 2017
Creator: Becker, Maria; Palmer, Alexis & Frank, Anette
Object Type: Article
System: The UNT Digital Library
What do complexity measures measure? Correlating and validating corpus-based measures of morphological complexity (open access)

What do complexity measures measure? Correlating and validating corpus-based measures of morphological complexity

Article describes how the authors present an analysis of eight measures used for quantifying morphological complexity of natural languages. The measures they study are corpus-based measures of morphological complexity with varying requirements for corpus annotation.
Date: September 22, 2022
Creator: Çöltekin, Çağrı & Rama, Taraka
Object Type: Article
System: The UNT Digital Library

A partial example of a Data Management Plan, with post-project comments

Presentation for the 2017 Symposium on Developing Infrastructure for Computational Resources on South Asian Languages. This presentation provides an example of a data management plan for a linguistics archiving project.
Date: November 17, 2017
Creator: de Reuse, Willem
Object Type: Presentation
System: The UNT Digital Library

Ethics in Data Sharing

Presentation for the 2017 Symposium on Developing Infrastructure for Computational Resources on South Asian Languages. This presentation discusses the ethics involved in data management and data sharing.
Date: November 17, 2017
Creator: Munshi, Sadaf
Object Type: Presentation
System: The UNT Digital Library

Harmonized Annotation and Data Formats

Presentation for the 2017 Symposium on Developing Infrastructure for Computational Resources on South Asian Languages. This presentation asks the authors various questions regarding harmonized annotation and data.
Date: November 17, 2017
Creator: Simons, Gary F.; Aristar-Dry, Helen; Palmer, Alexis & Kung, Susan Smythe
Object Type: Presentation
System: The UNT Digital Library

The Lamkang Community Uses for CoRSAL

Presentation for the 2017 Symposium on Developing Infrastructure for Computational Resources on South Asian Languages. This presentation looks at language archiving needs and issues from the perspective of the Lamkang community.
Date: November 17, 2017
Creator: Khular, Sumshot
Object Type: Presentation
System: The UNT Digital Library

A View from CL/NLP

Presentation for the 2017 Symposium on Developing Infrastructure for Computational Resources on South Asian Languages. This presentation provides an overview of the data structures needed for computational linguistics and natural language processing.
Date: November 17, 2017
Creator: Palmer, Alexis
Object Type: Presentation
System: The UNT Digital Library

Archive and database architecture and usability

Presentation for the 2017 Symposium on Developing Infrastructure for Computational Resources on South Asian Languages. This presentation discusses the use of a database for language archiving through the example of Himalayan Tibeto-Burman languages.
Date: November 17, 2017
Creator: Caplow, Nancy J.; Khular, Sumshot & Willis Oko, Christina
Object Type: Presentation
System: The UNT Digital Library

Computational Resource for South Asian Languages

Presentation for the 2017 Symposium on Developing Infrastructure for Computational Resources on South Asian Languages. This presentation provides an overview of the Computational Resource for South Asian Languages project.
Date: November 17, 2017
Creator: Chelliah, Shobhana Lakshmi
Object Type: Presentation
System: The UNT Digital Library
STREAMLInED Challenges: Aligning Research Interests with Shared Tasks (open access)

STREAMLInED Challenges: Aligning Research Interests with Shared Tasks

This paper describes the use of Shared Task Evaluation Campaigns by designing tasks that are compelling to speech and natural language processing researchers while addressing technical challenges in language documentation and exploiting growing archives of endangered language data.
Date: March 2017
Creator: Levow, Gina-Anne; Bender, Emily M.; Littell, Patrick; Howell, Kristen; Chelliah, Shobhana Lakshmi; Crowgey, Joshua et al.
Object Type: Paper
System: The UNT Digital Library
An Automated Framework for Fast Cognate Detection and Bayesian Phylogenetic Inference in Computational Historical Linguistics (open access)

An Automated Framework for Fast Cognate Detection and Bayesian Phylogenetic Inference in Computational Historical Linguistics

Article presents a fully automated workflow for phylogenetic reconstruction on large datasets, consisting of two novel methods, one for fast detection of cognates and one for fast Bayesian phylogenetic inference.
Date: 2019
Creator: Kasicheyanula, Taraka & List, Johann-Mattis
Object Type: Article
System: The UNT Digital Library
Hierarchical Coding Scheme: Exploring Methods and Techniques for Facilitating Access to Digital Language Archives (open access)

Hierarchical Coding Scheme: Exploring Methods and Techniques for Facilitating Access to Digital Language Archives

This is the hierarchical coding scheme used for qualitative analysis of interviews with language archive managers, depositors, and end-users as part of the 'Exploring Methods and Techniques for Facilitating Access to Digital Language Archives' project (January 2019-August 2020).
Date: June 2020
Creator: Burke, Mary; Zavalina, Oksana; Chelliah, Shobhana Lakshmi & Phillips, Mark Edward
Object Type: Paper
System: The UNT Digital Library
Illegal is not a Noun: Linguistic Form for Detection of Pejorative Nominalizations (open access)

Illegal is not a Noun: Linguistic Form for Detection of Pejorative Nominalizations

This paper focuses on a particular type of abusive language, targeting expressions in which typically neutral adjectives take on pejorative meaning when used as nouns.
Date: August 2017
Creator: Palmer, Alexis; Robinson, Melissa & Phillips, Kristy
Object Type: Paper
System: The UNT Digital Library
Neural classification of Norwegian radiology reports: using NLP to detect findings in CT-scans of children (open access)

Neural classification of Norwegian radiology reports: using NLP to detect findings in CT-scans of children

This article trained machine learning techniques to classify Norwegian radiology reports of pediatric CT examinations according to their description of abnormal findings. The developed models are robust with respect to different contexts, and may be used in quality assurance processes.
Date: March 4, 2021
Creator: Dahl, Fredrik A.; Rama, Taraka; Hurlen, Petter; Brekke, Pål H.; Husby, Haldor; Gundersen, Tore et al.
Object Type: Article
System: The UNT Digital Library
A test of Generalized Bayesian dating: A new linguistic dating method (open access)

A test of Generalized Bayesian dating: A new linguistic dating method

Article addressing if a new Bayesian framework can be introduced and ways to overcome subjectivity. The authors introduce a new method called Generalized Bayesian Dating (GBD) for inferring dates of language groups from lexical and phonological data. This work has implications for future performance testing in the area of linguistic dating.
Date: August 12, 2020
Creator: Kasicheyanula, Taraka & Søren Wichmann
Object Type: Article
System: The UNT Digital Library
Phrasal Proper Names in German and Norwegian (open access)

Phrasal Proper Names in German and Norwegian

Article discusses the morpho-syntax of phrasal proper names like Deutsche Bahn 'German Railway' and Norske Skog 'Norwegian Forest' in German and Norwegian. The authors document that phrasal proper names may show features of recursivity evidenced most clearly in Norwegian.
Date: September 9, 2023
Creator: Julien, Marit & Roehrs, Dorian
Object Type: Article
System: The UNT Digital Library