Navigation auf zora.uzh.ch

Search ZORA

ZORA (Zurich Open Repository and Archive)

German Compound Splitting Using the Compound Productivity of Morphemes

Sugisaki, Kyoko; Tuggener, Don (2018). German Compound Splitting Using the Compound Productivity of Morphemes. In: The Conference on Natural Language Processing / Die Konferenz zur Verarbeitung natürlicher Sprache (KONVENS 2018), Wien, September 2018, Austrian Academy of Sciences.

Abstract

In this work, we present a novel compound splitting method for German by capturing the compound productivity of morphemes. We use a giga web corpus to create a lexicon and decompose noun compounds by computing the probabilities of compound elements as bound and free morphemes. Furthermore, we provide a uniformed evaluation of several unsupervised approaches and morphological analysers for the task. Our method achieved a high F1 score of 0.92, which was a comparable result to state-of-the-art methods.

Additional indexing

Item Type:Conference or Workshop Item (Paper), refereed, original work
Communities & Collections:06 Faculty of Arts > Institute of German Studies
06 Faculty of Arts > Institute of Computational Linguistics
Dewey Decimal Classification:000 Computer science, knowledge & systems
410 Linguistics
Scopus Subject Areas:Physical Sciences > Software
Language:English
Event End Date:September 2018
Deposited On:12 Oct 2018 10:11
Last Modified:02 Feb 2024 13:12
Publisher:Austrian Academy of Sciences
OA Status:Green
Related URLs:https://www.austriaca.at/8437-9
Download PDF  'German Compound Splitting Using the Compound Productivity of Morphemes'.
Preview
  • Content: Accepted Version
  • Language: English

Metadata Export

Statistics

Citations

Downloads

28 downloads since deposited on 12 Oct 2018
9 downloads since 12 months
Detailed statistics

Authors, Affiliations, Collaborations

Similar Publications