Quick Search:

uzh logo
Browse by:

Zurich Open Repository and Archive

Maintenance: Tuesday, July the 26th 2016, 07:00-10:00

ZORA's new graphical user interface will be relaunched (For further infos watch out slideshow ZORA: Neues Look & Feel). There will be short interrupts on ZORA Service between 07:00am and 10:00 am. Please be patient.

Permanent URL to this publication: http://dx.doi.org/10.5167/uzh-63327

Fishel, Mark; Georgakopoulou, Yota; Penkale, Sergio; Petukhova, Volha; Rojc, Matej; Volk, Martin; Way, Andy (2012). From subtitles to parallel corpora. In: The 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, 28 May 2012 - 30 May 2012, 3-6.



We describe the preparation of parallel corpora based on professional quality subtitles in seven European language pairs. The main focus is the effect of the processing steps on the size and quality of the final corpora.



41 downloads since deposited on 13 Jul 2012
9 downloads since 12 months

Detailed statistics

Additional indexing

Item Type:Conference or Workshop Item (Paper), refereed, original work
Communities & Collections:06 Faculty of Arts > Institute of Computational Linguistics
Dewey Decimal Classification:000 Computer science, knowledge & systems
410 Linguistics
Event End Date:30 May 2012
Deposited On:13 Jul 2012 07:18
Last Modified:05 Apr 2016 15:52
Publisher:European Association for Machine Translation
Official URL:http://www.mt-archive.info/EAMT-2012-Fishel.pdf
Related URLs:http://eamt2012.fbk.eu/

Users (please log in): suggest update or correction for this item

Repository Staff Only: item control page