Header

UZH-Logo

Maintenance Infos

Deriving a lexicon for a precision grammar from language documentation resources: a case study of Chintang


Bender, Emily M; Schikowski, Robert; Bickel, Balthasar (2012). Deriving a lexicon for a precision grammar from language documentation resources: a case study of Chintang. In: Kay, M; Boitet, C. Proceedings of the 24th International Conference on Computational Linguistics (COLING). Mumbai: Association for Computational Linguistics, 247-262.

Abstract

Language documentation projects typically invest a lot of effort in creating digitized lexical resources, which are used in the creation of dictionaries and in the glossing of collected texts. We present and evaluate a methodology for repurposing such a lexical resource developed for Chintang (ISO639-3: ctn), a language of Nepal, for use with a precision implemented grammar developed in the DELPH-IN formalism. The target lexicon, when combined with a set of morphological rules, achieves 57% type-level coverage and 50% token-level coverage of held-out texts, while maintaining a feature-level accuracy F-measure of 70%. As lexicon development is typically one of the most expensive aspects of creating a precision grammar, this represents a significant savings of effort.

Abstract

Language documentation projects typically invest a lot of effort in creating digitized lexical resources, which are used in the creation of dictionaries and in the glossing of collected texts. We present and evaluate a methodology for repurposing such a lexical resource developed for Chintang (ISO639-3: ctn), a language of Nepal, for use with a precision implemented grammar developed in the DELPH-IN formalism. The target lexicon, when combined with a set of morphological rules, achieves 57% type-level coverage and 50% token-level coverage of held-out texts, while maintaining a feature-level accuracy F-measure of 70%. As lexicon development is typically one of the most expensive aspects of creating a precision grammar, this represents a significant savings of effort.

Statistics

Citations

Downloads

193 downloads since deposited on 25 Apr 2013
14 downloads since 12 months
Detailed statistics

Additional indexing

Item Type:Book Section, refereed, original work
Communities & Collections:06 Faculty of Arts > Department of Comparative Linguistics
Dewey Decimal Classification:490 Other languages
890 Other literatures
410 Linguistics
Language:English
Date:2012
Deposited On:25 Apr 2013 06:49
Last Modified:07 Dec 2017 21:03
Publisher:Association for Computational Linguistics
Related URLs:http://www.coling2012-iitb.org/

Download

Download PDF  'Deriving a lexicon for a precision grammar from language documentation resources: a case study of Chintang'.
Preview
Content: Published Version
Language: English
Filetype: PDF
Size: 166kB