Navigation auf zora.uzh.ch

Search ZORA

ZORA (Zurich Open Repository and Archive)

Migration von ZORA auf die Software DSpace

ZORA will change to a new software on 8th September 2025. Please note: deadline for new submissions is 21th July 2025!

Information & dates for training courses can be found here: Information on Software Migration.

Treatment of Markup in Statistical Machine Translation

Müller, Mathias (2017). Treatment of Markup in Statistical Machine Translation. In: Third Workshop on Discourse in Machine Translation, Copenhagen, Denmark, 8 September 2017. Association of Computational Linguistics, 36-46.

Abstract

We present work on handling XML markup in Statistical Machine Translation (SMT). The methods we propose can be used to effectively preserve markup (for instance inline formatting or structure) and to place markup correctly in a machine-translated segment. We evaluate our approaches with parallel data that naturally contains markup or where markup was inserted to create synthetic examples. In our experiments, hybrid reinsertion has proven the most accurate method to handle markup, while alignment masking and alignment reinsertion should be regarded as viable alternatives. We provide implementations of all the methods described and they are freely available as an open-source framework.

Additional indexing

Item Type:Conference or Workshop Item (Other), refereed, original work
Communities & Collections:06 Faculty of Arts > Institute of Computational Linguistics
Dewey Decimal Classification:000 Computer science, knowledge & systems
410 Linguistics
Language:English
Event End Date:8 September 2017
Deposited On:03 Oct 2017 13:44
Last Modified:13 Oct 2023 13:36
Publisher:Association of Computational Linguistics
OA Status:Green
Free access at:Official URL. An embargo period may apply.
Official URL:http://www.aclweb.org/anthology/W/W17/W17-4804.pdf
Related URLs:https://gitlab.cl.uzh.ch/mt/mtrain

Metadata Export

Statistics

Citations

Downloads

112 downloads since deposited on 03 Oct 2017
20 downloads since 12 months
Detailed statistics

Authors, Affiliations, Collaborations

Similar Publications