Quick Search:

uzh logo
Browse by:
bullet
bullet
bullet
bullet

Zurich Open Repository and ArchiveĀ 

Permanent URL to this publication: http://dx.doi.org/10.5167/uzh-63327

Fishel, Mark; Georgakopoulou, Yota; Penkale, Sergio; Petukhova, Volha; Rojc, Matej; Volk, Martin; Way, Andy (2012). From subtitles to parallel corpora. In: The 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, 28 May 2012 - 30 May 2012, 3-6.

[img]
Preview
PDF
164Kb

Abstract

We describe the preparation of parallel corpora based on professional quality subtitles in seven European language pairs. The main focus is the effect of the processing steps on the size and quality of the final corpora.

Item Type:Conference or Workshop Item (Paper), refereed, original work
Communities & Collections:06 Faculty of Arts > Institute of Computational Linguistics
DDC:000 Computer science, knowledge & systems
410 Linguistics
Language:English
Event End Date:30 May 2012
Deposited On:13 Jul 2012 09:18
Last Modified:24 Oct 2012 18:46
Publisher:European Association for Machine Translation
Official URL:http://www.mt-archive.info/EAMT-2012-Fishel.pdf
Related URLs:http://eamt2012.fbk.eu/
Citations:Google Scholarā„¢

Users (please log in): suggest update or correction for this item

Repository Staff Only: item control page