Header

UZH-Logo

Maintenance Infos

Machine Translation of Spanish Personal and Possessive Pronouns Using Anaphora Probabilities


Luong, Ngoc Quang; Popescu-Belis, Andrei; Rios, Annette; Tuggener, Don (2017). Machine Translation of Spanish Personal and Possessive Pronouns Using Anaphora Probabilities. In: 15th Conference of the European Chapter of the Association for Computational Linguistics, Valencia, Spain, 5 April 2017 - 7 April 2017, 631-636.

Abstract

We implement a fully probabilistic model to combine the hypotheses of a Spanish anaphora resolution system with those of a Spanish-English machine translation system. The probabilities over antecedents are converted into probabilities for the features of translated pronouns, and are integrated with phrase-based MT using an additional translation model for pronouns.
The system improves the translation of several Spanish personal and possessive pronouns into English, by solving translation divergencies such as ella→she|it or su→his|her|its|their. On a test set with 2,286 pronouns, a baseline system correctly translates 1,055 of them, while ours improves this by 41. Moreover, with oracle antecedents, possessives are translated with an accuracy of 83%.

Abstract

We implement a fully probabilistic model to combine the hypotheses of a Spanish anaphora resolution system with those of a Spanish-English machine translation system. The probabilities over antecedents are converted into probabilities for the features of translated pronouns, and are integrated with phrase-based MT using an additional translation model for pronouns.
The system improves the translation of several Spanish personal and possessive pronouns into English, by solving translation divergencies such as ella→she|it or su→his|her|its|their. On a test set with 2,286 pronouns, a baseline system correctly translates 1,055 of them, while ours improves this by 41. Moreover, with oracle antecedents, possessives are translated with an accuracy of 83%.

Statistics

Downloads

5 downloads since deposited on 06 Apr 2017
5 downloads since 12 months
Detailed statistics

Additional indexing

Item Type:Conference or Workshop Item (Other), not refereed, original work
Communities & Collections:06 Faculty of Arts > Institute of Computational Linguistics
Dewey Decimal Classification:000 Computer science, knowledge & systems
410 Linguistics
Language:English
Event End Date:7 April 2017
Deposited On:06 Apr 2017 12:53
Last Modified:06 Apr 2017 18:30
Publisher:Association for Computational Linguistics
Funders:Swiss National Science Foundation (SNSF), Sinergia MODERN project (grant n. 147653), European Union Horizon 2020 SUMMA project (grant. n. 88139)
Free access at:Official URL. An embargo period may apply.
Official URL:http://aclweb.org/anthology/E/E17/E17-2100.pdf

Download

Preview Icon on Download
Preview
Filetype: PDF
Size: 129kB

TrendTerms

TrendTerms displays relevant terms of the abstract of this publication and related documents on a map. The terms and their relations were extracted from ZORA using word statistics. Their timelines are taken from ZORA as well. The bubble size of a term is proportional to the number of documents where the term occurs. Red, orange, yellow and green colors are used for terms that occur in the current document; red indicates high interlinkedness of a term with other terms, orange, yellow and green decreasing interlinkedness. Blue is used for terms that have a relation with the terms in this document, but occur in other documents.
You can navigate and zoom the map. Mouse-hovering a term displays its timeline, clicking it yields the associated documents.

Author Collaborations