Permanent URL to this publication: http://dx.doi.org/10.5167/uzh-63327
Fishel, Mark; Georgakopoulou, Yota; Penkale, Sergio; Petukhova, Volha; Rojc, Matej; Volk, Martin; Way, Andy (2012). From subtitles to parallel corpora. In: The 16th Annual Conference of the European Association for Machine Translation, Trento, Italy, 28 May 2012 - 30 May 2012, 3-6.
We describe the preparation of parallel corpora based on professional quality subtitles in seven European language pairs. The main focus is the effect of the processing steps on the size and quality of the final corpora.
32 downloads since deposited on 13 Jul 2012
8 downloads since 12 months
|Item Type:||Conference or Workshop Item (Paper), refereed, original work|
|Communities & Collections:||06 Faculty of Arts > Institute of Computational Linguistics|
|Dewey Decimal Classification:||000 Computer science, knowledge & systems
|Event End Date:||30 May 2012|
|Deposited On:||13 Jul 2012 07:18|
|Last Modified:||24 Oct 2012 16:46|
|Publisher:||European Association for Machine Translation|
Users (please log in): suggest update or correction for this item
Repository Staff Only: item control page