Co-training and Self-training for Word Sense Disambiguation (open access)

Co-training and Self-training for Word Sense Disambiguation

This paper investigates the application of co-training and self-training to word sense disambiguation. Optimal and empirical parameter selection methods for co-training and self-training are investigated, with various degrees of error reduction. A new method that combines co-training with majority voting is introduced, with the effect of smoothing the bootstrapping learning curves, and improving the average performance.
Date: May 2004
Creator: Mihalcea, Rada, 1974-
Object Type: Paper
System: The UNT Digital Library
Graph-based Ranking Algorithms for Sentence Extraction, Applied to Text Summarization (open access)

Graph-based Ranking Algorithms for Sentence Extraction, Applied to Text Summarization

Abstract: This paper presents an innovative unsupervised method for automatic sentence extraction using graph-based ranking algorithms. We evaluate the method in the context of a text summarization task, and show that the results obtained compare favorably with previously published results on established benchmarks.
Date: July 2004
Creator: Mihalcea, Rada, 1974-
Object Type: Paper
System: The UNT Digital Library
Instance Based Learning with Automatic Feature Selection Applied to Word Sense Disambiguation (open access)

Instance Based Learning with Automatic Feature Selection Applied to Word Sense Disambiguation

This paper discusses instance based learning with automatic feature selection applied to word sense disambiguation.
Date: August 2002
Creator: Mihalcea, Rada, 1974-
Object Type: Paper
System: The UNT Digital Library
Language Independent Extractive Summarization (open access)

Language Independent Extractive Summarization

This paper discusses language independent extractive summarization.
Date: July 2005
Creator: Mihalcea, Rada, 1974-
Object Type: Paper
System: The UNT Digital Library
Making Sense Out of the Web (open access)

Making Sense Out of the Web

This paper discusses the main lines of research in deriving efficient Word Sense Disambiguation.
Date: November 2004
Creator: Mihalcea, Rada, 1974-
Object Type: Paper
System: The UNT Digital Library
The Multidisciplinary Facets of Research on Humour (open access)

The Multidisciplinary Facets of Research on Humour

In this paper, the authors summarize the main theories of humor that emerged from philosophical and modern psychological research, and survey the past and present developments in the fields of theoretical and computational linguistics.
Date: July 2007
Creator: Mihalcea, Rada, 1974-
Object Type: Paper
System: The UNT Digital Library
Performance Analysis of a Part of Speech Tagging Task (open access)

Performance Analysis of a Part of Speech Tagging Task

This article discusses performance analysis of a part of speech tagging task.
Date: February 2003
Creator: Mihalcea, Rada, 1974-
Object Type: Paper
System: The UNT Digital Library
[Review] The Text Mining Handbook: Advanced Approaches to Analyzing Unstructured Data (open access)

[Review] The Text Mining Handbook: Advanced Approaches to Analyzing Unstructured Data

This article reviews the book "'The Text Mining Handbook: Advanced Approaches to Analyzing Unstructured Data," by Ronen Feldman and James Sanger.
Date: March 2008
Creator: Mihalcea, Rada, 1974-
Object Type: Review
System: The UNT Digital Library
The Role of Non-Ambiguous Words in Natural Language Disambiguation (open access)

The Role of Non-Ambiguous Words in Natural Language Disambiguation

This article discusses the role of non-ambiguous words in natural language disambiguation.
Date: September 2003
Creator: Mihalcea, Rada, 1974-
Object Type: Paper
System: The UNT Digital Library
The Semantic Wildcard (open access)

The Semantic Wildcard

This paper introduces the semantic wildcard, one of the most powerful operators implemented in IRSLO, which allows for searches along general-specific lines.
Date: May 2002
Creator: Mihalcea, Rada, 1974-
Object Type: Paper
System: The UNT Digital Library
A Semi-Complete Disambiguation Algorithm for Open Text (open access)

A Semi-Complete Disambiguation Algorithm for Open Text

This paper discusses a semi-complete disambiguation algorithm for open text.
Date: 2000
Creator: Mihalcea, Rada, 1974-
Object Type: Paper
System: The UNT Digital Library
Unsupervised Large-Vocabulary Word Sense Disambiguation with Graph-based Algorithms for Sequence Data Labeling (open access)

Unsupervised Large-Vocabulary Word Sense Disambiguation with Graph-based Algorithms for Sequence Data Labeling

This paper introduces a graph-based algorithm for sequence data labeling, using random walks on graphs encoding label dependencies. The algorithm is illustrated and tested in the context of an unsupervised word sense disambiguation problem, and shown to significantly outperform the accuracy achieved through individual label assignment, as measured on standard sense-annotated data sets.
Date: October 2005
Creator: Mihalcea, Rada, 1974-
Object Type: Paper
System: The UNT Digital Library
Using Wikipedia for Automatic Word Sense Disambiguation (open access)

Using Wikipedia for Automatic Word Sense Disambiguation

This paper describes a method for generating sense-tagged data using Wikipedia as a source of sense annotations. Through word sense disambiguation experiments, the authors show that the Wikipedia-based sense annotations are reliable and can be used to construct accurate sense classifiers.
Date: April 2007
Creator: Mihalcea, Rada, 1974-
Object Type: Paper
System: The UNT Digital Library
Word Sense Disambiguation with Pattern Learning and Automatic Feature Selection (open access)

Word Sense Disambiguation with Pattern Learning and Automatic Feature Selection

Article discussing word sense disambiguation with pattern learning and automatic feature selection.
Date: January 22, 2003
Creator: Mihalcea, Rada, 1974-
Object Type: Article
System: The UNT Digital Library
Creating Large Annotated Data Collections with Web Users' Help (open access)

Creating Large Annotated Data Collections with Web Users' Help

This paper discusses creating annotated data collections.
Date: April 2003
Creator: Mihalcea, Rada, 1974- & Chklovski, Timothy A. (Timothy Anatolievich), 1977-
Object Type: Paper
System: The UNT Digital Library
SenseLearner: Word Sense Disambiguation for All Words in Unrestricted Text (open access)

SenseLearner: Word Sense Disambiguation for All Words in Unrestricted Text

This paper describes SenseLearner, a minimally supervised word sense disambiguation system that attempts to disambiguate all content words in a text using WordNet senses.
Date: June 2005
Creator: Mihalcea, Rada, 1974- & Csomai, Andras
Object Type: Paper
System: The UNT Digital Library
Wikify! Linking Documents to Encyclopedic Knowledge (open access)

Wikify! Linking Documents to Encyclopedic Knowledge

This paper introduces the use of Wikipedia as a resource for automatic keyword extraction and word sense disambiguation, and shows how this online encyclopedia can be used to achieve state-of-the-art results on both these tasks.
Date: November 2007
Creator: Mihalcea, Rada, 1974- & Csomai, Andras
Object Type: Paper
System: The UNT Digital Library
SenseLearner: Minimally Supervised Word Sense Disambiguation for All Words in Open Text (open access)

SenseLearner: Minimally Supervised Word Sense Disambiguation for All Words in Open Text

This paper introduces SenseLearner - a minimally supervised sense tagger that attempts to disambiguate all content words in a text using the sense from WordNet. SenseLearner participated in the SENSEVAL-3 English all words task, and achieved an average accuracy of 64.6%.
Date: 2004
Creator: Mihalcea, Rada, 1974- & Faruque, Ehsanul
Object Type: Paper
System: The UNT Digital Library
Using the Essence of Texts to Improve Document Classification (open access)

Using the Essence of Texts to Improve Document Classification

This article discusses using the essence of texts to improve document classification.
Date: September 2005
Creator: Mihalcea, Rada, 1974- & Hassan, Samer
Object Type: Paper
System: The UNT Digital Library
Toward Communicating Simple Sentences Using Pictorial Representations (open access)

Toward Communicating Simple Sentences Using Pictorial Representations

This article discusses communicating simple sentences using pictorial representations.
Date: April 9, 2009
Creator: Mihalcea, Rada, 1974- & Leong, Ben
Object Type: Article
System: The UNT Digital Library
Automatic generation of a coarse grained WordNet (open access)

Automatic generation of a coarse grained WordNet

This paper discusses automatic generation of a coarse grained WordNet.
Date: June 2001
Creator: Mihalcea, Rada, 1974- & Moldovan, Dan I.
Object Type: Paper
System: The UNT Digital Library
Document Indexing using Named Entities (open access)

Document Indexing using Named Entities

This article discusses document indexing using named entities.
Date: January 2001
Creator: Mihalcea, Rada, 1974- & Moldovan, Dan I.
Object Type: Article
System: The UNT Digital Library
eXtended WordNet: progress report (open access)

eXtended WordNet: progress report

This paper discusses eXtended WordNet.
Date: June 2001
Creator: Mihalcea, Rada, 1974- & Moldovan, Dan I.
Object Type: Paper
System: The UNT Digital Library
An Iterative Approach to Word Sense Disambiguation (open access)

An Iterative Approach to Word Sense Disambiguation

This paper discusses an iterative approach to Word Sense Disambiguation.
Date: May 2000
Creator: Mihalcea, Rada, 1974- & Moldovan, Dan I.
Object Type: Paper
System: The UNT Digital Library