States

Language

Photograph of a CoRSAL presentation

Photograph of Alexis Palmer giving a presentation on "A View from CL/NLP" at the 2017 Symposium on Developing Infrastructure for Computational Resources on South Asian Languages. The photo is taken from behind several seated members of the audience; Palmer is standing behind a podium and projected slides are visible to her right, on a screen mounted to a brick wall.
Date: November 17, 2017
Creator: University of North Texas. College of Information.
Object Type: Photograph
System: The UNT Digital Library

Photograph of a CoRSAL presentation

Photograph of Shobhana Chelliah standing behind a podium at the 2017 Symposium on Developing Infrastructure for Computational Resources on South Asian Languages and smiling broadly. A part of a projection screen and partial slide are visible behind her on the left, against a brick wall.
Date: November 17, 2017
Creator: University of North Texas. College of Information.
Object Type: Photograph
System: The UNT Digital Library

Photograph of a CoRSAL presentation

Photograph of Sadaf Munshi standing behind a podium with her palms held up, during her presentation "Ethics in Data Sharing" at the 2017 Symposium on Developing Infrastructure for Computational Resources on South Asian Languages. A presentation slide titled "Ethics in data sharing" is projected on a screen in the center of the image, against a brick wall. An audience member is partially visible in the far left part of the image.
Date: November 17, 2017
Creator: University of North Texas. College of Information.
Object Type: Photograph
System: The UNT Digital Library

Photograph of a CoRSAL presentation

Photograph of Shobhana Chelliah standing behind a podium and gesturing with both hands held out toward her left, while at the 2017 Symposium on Developing Infrastructure for Computational Resources on South Asian Languages. Behind her, the final slide of a presentation with the title "Ethics in data sharing: major questions" is projected on a screen against a brick wall. The slide includes contact information for Sadaf Munshi and links to the Burushaski Language Resource and Kashmir Oral History.
Date: November 17, 2017
Creator: University of North Texas. College of Information.
Object Type: Photograph
System: The UNT Digital Library
Determining Event Durations: Models and Error Analysis (open access)

Determining Event Durations: Models and Error Analysis

This paper presents models to predict event durations.
Date: June 1, 2018
Creator: Vempala, Alakananda; Blanco, Eduardo & Palmer, Alexis
Object Type: Paper
System: The UNT Digital Library
Definition and Goals of Descriptive Linguistic Fieldwork (open access)

Definition and Goals of Descriptive Linguistic Fieldwork

Book chapter defining descriptive linguistic fieldwork, explores tasks that fall under this definition, outlines goals of descriptive linguistic fieldwork, and identifies aspirations and limitations of linguistic fieldworkers.
Date: 2011
Creator: Chelliah, Shobhana Lakshmi & de Reuse, Willem
Object Type: Book Chapter
System: The UNT Digital Library
Reproducible Research in Linguistics: A Position Statement on Data Citation and Attribution in Our Field (open access)

Reproducible Research in Linguistics: A Position Statement on Data Citation and Attribution in Our Field

This article is a position statement on reproducible research in linguistics, including data citation and attribution, that represents the collective views of some 41 colleagues.
Date: December 6, 2017
Creator: Berez-Kroeker, Andrea; Gawne, Lauren; Kung, Susan Smythe; Kelly, Barbara F.; Heston, Tyler; Holton, Gary et al.
Object Type: Article
System: The UNT Digital Library
Prenominal possessives in Yiddish: mayn khaver versus mayner a khaver (open access)

Prenominal possessives in Yiddish: mayn khaver versus mayner a khaver

Article provides a systematic comparison and detailed analysis of two prenominal possessive constructions in Yiddish, the familiar mayn khaver ‘my friend’ and the less well-known mayner a khaver ‘a friend of mine.’
Date: February 21, 2022
Creator: Roehrs, Dorian
Object Type: Article
System: The UNT Digital Library
Synthetic data for annotation and extraction of family history information from clinical text (open access)

Synthetic data for annotation and extraction of family history information from clinical text

This article investigates the use of synthetic data for the annotation and automated extraction of family history information relating to cases of cardiac disease from Norwegian clinical text. This work assesses the validity and applicability of the annotated synthetic corpus using machine learning techniques. The methodology outlined in this article may be useful in other situations where limited availability of clinical text hinders NLP tasks.
Date: July 14, 2021
Creator: Brekke, Pål H.; Kasicheyanula, Taraka; Pilán, Ildikó; Nytrø, Øystein & Øvrelid, Lilja
Object Type: Article
System: The UNT Digital Library
Temporally-oriented possession: A corpus for tracking possession over time (open access)

Temporally-oriented possession: A corpus for tracking possession over time

This paper presents a new corpus of Wikipedia articles annotated with temporally-oriented possession or tracking concrete objects as they change hands over time.
Date: January 2019
Creator: Chinnappa, Dhivya; Palmer, Alexis & Blanco, Eduardo
Object Type: Paper
System: The UNT Digital Library
Challenges to Representing Personal Names and Language Names in Language Archives: Examples from Northeast India (open access)

Challenges to Representing Personal Names and Language Names in Language Archives: Examples from Northeast India

Article reviewing one particular challenge to data management relevant to South Asia, which is the complexity of names (of individuals, groups, and languages). It was presented at the 1st International Workshop on Digital Language Archives held on September 30-October 1, 2021 as part of the ACM/IEEE Joint Conference on Digital Libraries 2021.
Date: October 7, 2021
Creator: Burke, Mary & Chelliah, Shobhana Lakshmi
Object Type: Article
System: The UNT Digital Library
Classifying Semantic Clause Types: Modeling Context and Genre Characteristics with Recurrent Neural Networks and Attention (open access)

Classifying Semantic Clause Types: Modeling Context and Genre Characteristics with Recurrent Neural Networks and Attention

This paper introduces an attention mechanism that pinpoints relevant context not only for the current instance, but also for the larger context.
Date: August 2017
Creator: Becker, Maria; Staniek, Michael; Nastase, Vivi; Palmer, Alexis & Frank, Anette
Object Type: Article
System: The UNT Digital Library
Semantic Clause Types and Modality as Features for Argument Analysis (open access)

Semantic Clause Types and Modality as Features for Argument Analysis

This article investigates the role of semantic clause types and modality in argumentative texts.
Date: August 17, 2017
Creator: Becker, Maria; Palmer, Alexis & Frank, Anette
Object Type: Article
System: The UNT Digital Library
2017 Dene/Athabaskan Language Conference and Workshop Day 1 Part 3 captions transcript

2017 Dene/Athabaskan Language Conference and Workshop Day 1 Part 3

This video continues from the Day 1 Part 2 video of the 2017 Dene/Athabaskan Language Conference and Workshop. Ramon Riley reports on the state of White Mountain Apache, and Velma Hale presents on the DNA model from a Dine perspective.
Date: June 27, 2017
Creator: Ross, Chasen & Sisk, Trevor
Object Type: Video
System: The UNT Digital Library
2017 Dene/Athabaskan Language Conference and Workshop Day 3 Part 4 captions transcript

2017 Dene/Athabaskan Language Conference and Workshop Day 3 Part 4

This video continues from the 2017 Dene/Athabaskan Language Conference and Workshop Day 3 Part 3. Conference participants wrap up their reports about what they discussed in their groups. The conference is closed with a prayer led by Don Decker and Terrill Goseyun.
Date: June 29, 2017
Creator: Ross, Chasen & Sisk, Trevor
Object Type: Video
System: The UNT Digital Library
2017 Dene/Athabaskan Language Conference and Workshop Day 1 Part 6 captions transcript

2017 Dene/Athabaskan Language Conference and Workshop Day 1 Part 6

This video continues from the Day 1 Part 5 video of the 2017 Dene/Athabaskan Language Conference and Workshop. Oscar Rodriguez and David Gohre report on the state of Lipan Apache of West Texas.
Date: June 27, 2017
Creator: Ross, Chasen & Sisk, Trevor
Object Type: Video
System: The UNT Digital Library
2017 Dene/Athabaskan Language Conference and Workshop Day 2 Part 3 captions transcript

2017 Dene/Athabaskan Language Conference and Workshop Day 2 Part 3

This video continues from the Day 2 Part 2 video of the 2017 Dene/Athabaskan Language Conference and Workshop. Theodore Fernald describes efforts to teach the structure of Navajo in Philadelphia. Sharon Hargus details blends in Witsuwit’en.
Date: June 28, 2017
Creator: Ross, Chasen & Sisk, Trevor
Object Type: Video
System: The UNT Digital Library
What do complexity measures measure? Correlating and validating corpus-based measures of morphological complexity (open access)

What do complexity measures measure? Correlating and validating corpus-based measures of morphological complexity

Article describes how the authors present an analysis of eight measures used for quantifying morphological complexity of natural languages. The measures they study are corpus-based measures of morphological complexity with varying requirements for corpus annotation.
Date: September 22, 2022
Creator: Çöltekin, Çağrı & Rama, Taraka
Object Type: Article
System: The UNT Digital Library

A partial example of a Data Management Plan, with post-project comments

Presentation for the 2017 Symposium on Developing Infrastructure for Computational Resources on South Asian Languages. This presentation provides an example of a data management plan for a linguistics archiving project.
Date: November 17, 2017
Creator: de Reuse, Willem
Object Type: Presentation
System: The UNT Digital Library

Ethics in Data Sharing

Presentation for the 2017 Symposium on Developing Infrastructure for Computational Resources on South Asian Languages. This presentation discusses the ethics involved in data management and data sharing.
Date: November 17, 2017
Creator: Munshi, Sadaf
Object Type: Presentation
System: The UNT Digital Library

Harmonized Annotation and Data Formats

Presentation for the 2017 Symposium on Developing Infrastructure for Computational Resources on South Asian Languages. This presentation asks the authors various questions regarding harmonized annotation and data.
Date: November 17, 2017
Creator: Simons, Gary F.; Aristar-Dry, Helen; Palmer, Alexis & Kung, Susan Smythe
Object Type: Presentation
System: The UNT Digital Library

The Lamkang Community Uses for CoRSAL

Presentation for the 2017 Symposium on Developing Infrastructure for Computational Resources on South Asian Languages. This presentation looks at language archiving needs and issues from the perspective of the Lamkang community.
Date: November 17, 2017
Creator: Khular, Sumshot
Object Type: Presentation
System: The UNT Digital Library

A View from CL/NLP

Presentation for the 2017 Symposium on Developing Infrastructure for Computational Resources on South Asian Languages. This presentation provides an overview of the data structures needed for computational linguistics and natural language processing.
Date: November 17, 2017
Creator: Palmer, Alexis
Object Type: Presentation
System: The UNT Digital Library

Archive and database architecture and usability

Presentation for the 2017 Symposium on Developing Infrastructure for Computational Resources on South Asian Languages. This presentation discusses the use of a database for language archiving through the example of Himalayan Tibeto-Burman languages.
Date: November 17, 2017
Creator: Caplow, Nancy J.; Khular, Sumshot & Willis Oko, Christina
Object Type: Presentation
System: The UNT Digital Library