Header

UZH-Logo

Maintenance Infos

Combining multi-engine machine translation and online learning through dynamic phrase tables


Sennrich, R (2011). Combining multi-engine machine translation and online learning through dynamic phrase tables. In: EAMT-2011: the 15th Annual Conference of the European Association for Machine Translation, Leuven, Belgium, 30 May 2011 - 31 May 2011.

Abstract

Extending phrase-based Statistical Machine Translation systems with a second, dynamic phrase table has been done for multiple purposes.
Promising results have been reported for hybrid or multi-engine machine translation, i.e.\ building a phrase table from the knowledge of external MT systems, and for online learning.
We argue that, in prior research, dynamic phrase tables are not scored optimally because they may be of small size, which makes the Maximum Likelihood Estimation of translation probabilities unreliable.
We propose basing the scores on frequencies from both the dynamic corpus and the primary corpus instead, and show that this modification significantly increases performance.
We also explore the combination of multi-engine MT and online learning.

Abstract

Extending phrase-based Statistical Machine Translation systems with a second, dynamic phrase table has been done for multiple purposes.
Promising results have been reported for hybrid or multi-engine machine translation, i.e.\ building a phrase table from the knowledge of external MT systems, and for online learning.
We argue that, in prior research, dynamic phrase tables are not scored optimally because they may be of small size, which makes the Maximum Likelihood Estimation of translation probabilities unreliable.
We propose basing the scores on frequencies from both the dynamic corpus and the primary corpus instead, and show that this modification significantly increases performance.
We also explore the combination of multi-engine MT and online learning.

Statistics

Citations

Downloads

153 downloads since deposited on 10 May 2011
2 downloads since 12 months
Detailed statistics

Additional indexing

Item Type:Conference or Workshop Item (Paper), refereed, original work
Communities & Collections:06 Faculty of Arts > Institute of Computational Linguistics
Dewey Decimal Classification:000 Computer science, knowledge & systems
410 Linguistics
Scopus Subject Areas:Social Sciences & Humanities > Language and Linguistics
Physical Sciences > Human-Computer Interaction
Physical Sciences > Software
Language:English
Event End Date:31 May 2011
Deposited On:10 May 2011 09:01
Last Modified:28 Jun 2022 15:45
Funders:Swiss National Science Foundation
OA Status:Green
Project Information:
  • : FunderSNSF
  • : Grant ID
  • : Project TitleSwiss National Science Foundation
  • Content: Accepted Version