Navigation auf zora.uzh.ch

Search ZORA

ZORA (Zurich Open Repository and Archive)

From subtitles to parallel corpora

Fishel, Mark; Georgakopoulou, Yota; Penkale, Sergio; Petukhova, Volha; Rojc, Matej; Volk, Martin; Way, Andy (2012). From subtitles to parallel corpora. In: The 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, 28 May 2012 - 30 May 2012. European Association for Machine Translation, 3-6.

Abstract

We describe the preparation of parallel corpora based on professional quality subtitles in seven European language pairs. The main focus is the effect of the processing steps on the size and quality of the final corpora.

Additional indexing

Item Type:Conference or Workshop Item (Paper), refereed, original work
Communities & Collections:06 Faculty of Arts > Institute of Computational Linguistics
Dewey Decimal Classification:000 Computer science, knowledge & systems
410 Linguistics
Scopus Subject Areas:Social Sciences & Humanities > Language and Linguistics
Physical Sciences > Human-Computer Interaction
Physical Sciences > Software
Language:English
Event End Date:30 May 2012
Deposited On:13 Jul 2012 07:18
Last Modified:17 Mar 2022 08:00
Publisher:European Association for Machine Translation
OA Status:Green
Official URL:http://www.mt-archive.info/EAMT-2012-Fishel.pdf
Related URLs:http://eamt2012.fbk.eu/

Metadata Export

Statistics

Citations

Downloads

112 downloads since deposited on 13 Jul 2012
2 downloads since 12 months
Detailed statistics

Authors, Affiliations, Collaborations

Similar Publications