Header

UZH-Logo

Maintenance Infos

Multi-word Adverbs – How well are they handled in Parsing and Machine Translation?


Volk, Martin; Graën, Johannes (2017). Multi-word Adverbs – How well are they handled in Parsing and Machine Translation? In: The 3rd Workshop on Multi-word Units in Machine Translation and Translation Technology (MUMTTT 2017), London, 14 November 2017 - 14 November 2017.

Abstract

Multi-word expressions are often considered problematic for parsing or other tasks in natural language processing. In this paper we investigate a specific type of multi-word expressions: binomial adverbs. These adverbs follow the pattern adverb + conjunction + adverb. We identify and evaluate binomial adverbs in English, German and Swedish. We compute their degree of idiomaticity with an ordering test and with a mutual information score. We show that these idiomaticity measures point us to a number of fixed multi-word expressions which are often mis-tagged and mis-parsed. Interestingly, a second evaluation shows that state-of-the-art machine translation handles them well – with some exceptions.

Abstract

Multi-word expressions are often considered problematic for parsing or other tasks in natural language processing. In this paper we investigate a specific type of multi-word expressions: binomial adverbs. These adverbs follow the pattern adverb + conjunction + adverb. We identify and evaluate binomial adverbs in English, German and Swedish. We compute their degree of idiomaticity with an ordering test and with a mutual information score. We show that these idiomaticity measures point us to a number of fixed multi-word expressions which are often mis-tagged and mis-parsed. Interestingly, a second evaluation shows that state-of-the-art machine translation handles them well – with some exceptions.

Statistics

Downloads

19 downloads since deposited on 13 Nov 2017
19 downloads since 12 months
Detailed statistics

Additional indexing

Item Type:Conference or Workshop Item (Paper), refereed, original work
Communities & Collections:06 Faculty of Arts > Institute of Computational Linguistics
Dewey Decimal Classification:000 Computer science, knowledge & systems
410 Linguistics
Uncontrolled Keywords:multiword expressions, machine translation, syntactic parsing, language technology, text analytics
Language:English
Event End Date:14 November 2017
Deposited On:13 Nov 2017 12:56
Last Modified:19 Mar 2018 09:00
Publisher:s.n.
OA Status:Green

Download

Download PDF  'Multi-word Adverbs – How well are they handled in Parsing and Machine Translation?'.
Preview
Filetype: PDF
Size: 260kB