Publication: Using Multilingual Word Embeddings for Similarity-Based Word Alignments in a Zero-Shot Setting: Tested on the Case of German–Romansh
Using Multilingual Word Embeddings for Similarity-Based Word Alignments in a Zero-Shot Setting: Tested on the Case of German–Romansh
Date
Date
Date
Citations
Dolev, E. L. (2022). Using Multilingual Word Embeddings for Similarity-Based Word Alignments in a Zero-Shot Setting: Tested on the Case of German–Romansh. (Master’s thesis, University of Zurich) https://doi.org/10.5167/uzh-233699
Abstract
Abstract
Abstract
Using multilingual word embeddings for computing word alignments has been shown to be competetive with statistical word alignment methods. However, the languages on which the experiments were made on were all “seen” languages, i.e., they were part of the training data for the embeddings. In this thesis I show that multilingual word embeddings taken from mBERT can be used for computing word alignments for the “unseen” language Romansh, aligned against German. The performance is on par with a baseline statistical model (fast_align). I a
Additional indexing
Creators (Authors)
Faculty
Faculty
Faculty
Item Type
Item Type
Item Type
Referees
In collections
Dewey Decimal Classifikation
Dewey Decimal Classifikation
Dewey Decimal Classifikation
Keywords
Language
Language
Language
Publication date
Publication date
Publication date
Date available
Date available
Date available
Number of pages
Number of pages
Number of pages
OA Status
OA Status
OA Status
Official URL
Official URL
Official URL
Citations
Dolev, E. L. (2022). Using Multilingual Word Embeddings for Similarity-Based Word Alignments in a Zero-Shot Setting: Tested on the Case of German–Romansh. (Master’s thesis, University of Zurich) https://doi.org/10.5167/uzh-233699