UZH-Logo

Maintenance Infos

Word-aligned parallel text : a new resource for contrastive language studies


Volk, M; Göhring, A; Lehner, S; Rios, A; Sennrich, R; Uibo, H (2011). Word-aligned parallel text : a new resource for contrastive language studies. In: Supporting Digital Humanities, Conference 2011, Copenhagen, Denmark, 17 November 2011 - 18 November 2011.

Abstract

This paper describes the opportunities that arise from automatic word alignment for bilingual concordances and contrastive language studies. We introduce our parallel corpus of Alpine texts in French and German and our web-based alignment search system. We explain how we have reduced the number of erroneous alignments in the output by distinguishing between dominant and miscellaneous translations. We are currently in the process of extending the system to a new language pair Spanish-Quechua. This poses special problems because of the scarcity of resources for Quechua but also because of the wide typological gap between the languages.

This paper describes the opportunities that arise from automatic word alignment for bilingual concordances and contrastive language studies. We introduce our parallel corpus of Alpine texts in French and German and our web-based alignment search system. We explain how we have reduced the number of erroneous alignments in the output by distinguishing between dominant and miscellaneous translations. We are currently in the process of extending the system to a new language pair Spanish-Quechua. This poses special problems because of the scarcity of resources for Quechua but also because of the wide typological gap between the languages.

Downloads

422 downloads since deposited on 02 Dec 2011
179 downloads since 12 months
Detailed statistics

Additional indexing

Item Type:Conference or Workshop Item (Paper), refereed, original work
Communities & Collections:06 Faculty of Arts > Institute of Computational Linguistics
Dewey Decimal Classification:000 Computer science, knowledge & systems
410 Linguistics
Language:English
Event End Date:18 November 2011
Deposited On:02 Dec 2011 09:38
Last Modified:05 Apr 2016 15:08
Related URLs:http://cst.ku.dk/sdh2011/
Permanent URL: https://doi.org/10.5167/uzh-51481

Download

[img]
Preview
Content: Accepted Version
Filetype: PDF
Size: 2MB

TrendTerms

TrendTerms displays relevant terms of the abstract of this publication and related documents on a map. The terms and their relations were extracted from ZORA using word statistics. Their timelines are taken from ZORA as well. The bubble size of a term is proportional to the number of documents where the term occurs. Red, orange, yellow and green colors are used for terms that occur in the current document; red indicates high interlinkedness of a term with other terms, orange, yellow and green decreasing interlinkedness. Blue is used for terms that have a relation with the terms in this document, but occur in other documents.
You can navigate and zoom the map. Mouse-hovering a term displays its timeline, clicking it yields the associated documents.

Author Collaborations