Header

UZH-Logo

Maintenance Infos

German Compound Splitting Using the Compound Productivity of Morphemes


Sugisaki, Kyoko; Don, Tuggener (2018). German Compound Splitting Using the Compound Productivity of Morphemes. In: "The Conference on Natural Language Processing" / "Die Konferenz zur Verarbeitung natürlicher Sprache" (KONVENS 2018), Wien, September 2018 - September 2018.

Abstract

In this work, we present a novel compound splitting method for German by capturing the compound productivity of morphemes. We use a giga web corpus to create a lexicon and decompose noun compounds by computing the probabilities of compound elements as bound and free morphemes. Furthermore, we provide a uniformed evaluation of several unsupervised approaches and morphological analysers for the task. Our method achieved a high F1 score of 0.92, which was a comparable result to state-of-the-art methods.

Abstract

In this work, we present a novel compound splitting method for German by capturing the compound productivity of morphemes. We use a giga web corpus to create a lexicon and decompose noun compounds by computing the probabilities of compound elements as bound and free morphemes. Furthermore, we provide a uniformed evaluation of several unsupervised approaches and morphological analysers for the task. Our method achieved a high F1 score of 0.92, which was a comparable result to state-of-the-art methods.

Statistics

Downloads

15 downloads since deposited on 12 Oct 2018
11 downloads since 12 months
Detailed statistics

Additional indexing

Item Type:Conference or Workshop Item (Paper), refereed, original work
Communities & Collections:06 Faculty of Arts > Institute of German Studies
06 Faculty of Arts > Institute of Computational Linguistics
Dewey Decimal Classification:000 Computer science, knowledge & systems
410 Linguistics
Language:English
Event End Date:September 2018
Deposited On:12 Oct 2018 10:11
Last Modified:25 Oct 2019 07:43
Publisher:Austrian Academy of Sciences
OA Status:Green
Related URLs:https://www.oeaw.ac.at/ac/konvens2018/

Download

Green Open Access

Download PDF  'German Compound Splitting Using the Compound Productivity of Morphemes'.
Preview
Content: Accepted Version
Language: English
Filetype: PDF
Size: 119kB