Navigation auf zora.uzh.ch

Search ZORA

ZORA (Zurich Open Repository and Archive)

Mixture-modeling with unsupervised clusters for domain adaptation in statistical machine translation

Sennrich, Rico (2012). Mixture-modeling with unsupervised clusters for domain adaptation in statistical machine translation. In: 16th EAMT Conference, Trento, Italy, 28 May 2012 - 29 May 2012, 185-192.

Abstract

In Statistical Machine Translation, in-domain and out-of-domain training data are not always clearly delineated. This paper investigates how we can still use mixture-modeling techniques for domain adaptation in such cases. We apply unsupervised clustering methods to split the original training set, and then use mixture-modeling techniques to build a model adapted to a given target domain. We show that this approach improves performance over an unadapted baseline, and several alternative domain adaptation methods.

Additional indexing

Item Type:Conference or Workshop Item (Paper), refereed, original work
Communities & Collections:06 Faculty of Arts > Institute of Computational Linguistics
Dewey Decimal Classification:000 Computer science, knowledge & systems
410 Linguistics
Scopus Subject Areas:Social Sciences & Humanities > Language and Linguistics
Physical Sciences > Human-Computer Interaction
Physical Sciences > Software
Language:English
Event End Date:29 May 2012
Deposited On:07 Jun 2012 08:49
Last Modified:16 Mar 2022 08:23
OA Status:Green
Free access at:Official URL. An embargo period may apply.
Official URL:http://hltshare.fbk.eu/EAMT2012/html/Papers/42.pdf
Related URLs:http://www.eamt.org/news/news_program_eamt_2012.php
Download PDF  'Mixture-modeling with unsupervised clusters for domain adaptation in statistical machine translation'.
Preview
  • Content: Published Version

Metadata Export

Statistics

Citations

Downloads

276 downloads since deposited on 07 Jun 2012
6 downloads since 12 months
Detailed statistics

Authors, Affiliations, Collaborations

Similar Publications