UZH-Logo

Maintenance Infos

Multilingwis – A Multilingual Search Tool for Multi-Word Units in Multiparallel Corpora


Clematide, Simon; Graën, Johannes; Volk, Martin (2016). Multilingwis – A Multilingual Search Tool for Multi-Word Units in Multiparallel Corpora. In: Corpas Pastor, Gloria. Computerised and Corpus-based Approaches to Phraseology: Monolingual and Multilingual Perspectives/Fraseología computacional y basada en corpus: perspectivas monolingües y multilingües. Geneva: Tradulex, n/a.

Abstract

We describe a web-based application for searching translations of multi-word units in large, openly available multiparallel corpora. This web application offers a unique resource for multilingual terminologists and translators. The first edition of the tool covers the debates of the European Parliament in five languages: English, French, German, Italian, and Spanish. Our search tool provides a simple and intuitive user interface, which optimally supports content-oriented queries while relieving the user from specifying complicated search expressions in a complex query language. We describe the necessary automatic preprocessing steps of the linguistic data, the retrieval component, and the techniques needed for offering a zero configuration search.

Abstract

We describe a web-based application for searching translations of multi-word units in large, openly available multiparallel corpora. This web application offers a unique resource for multilingual terminologists and translators. The first edition of the tool covers the debates of the European Parliament in five languages: English, French, German, Italian, and Spanish. Our search tool provides a simple and intuitive user interface, which optimally supports content-oriented queries while relieving the user from specifying complicated search expressions in a complex query language. We describe the necessary automatic preprocessing steps of the linguistic data, the retrieval component, and the techniques needed for offering a zero configuration search.

Downloads

55 downloads since deposited on 20 Jan 2016
55 downloads since 12 months
Detailed statistics

Additional indexing

Item Type:Book Section, refereed, original work
Communities & Collections:06 Faculty of Arts > Institute of Computational Linguistics
Dewey Decimal Classification:000 Computer science, knowledge & systems
410 Linguistics
420 English & Old English languages
430 German & related languages
440 French & related languages
450 Italian, Romanian & related languages
460 Spanish & Portuguese languages
Language:English
Date:2016
Deposited On:20 Jan 2016 15:36
Last Modified:05 Apr 2016 19:58
Publisher:Tradulex
Funders:SNF Grant 105215_146781/1

Download

[img]
Preview
Filetype: PDF
Size: 1MB

TrendTerms

TrendTerms displays relevant terms of the abstract of this publication and related documents on a map. The terms and their relations were extracted from ZORA using word statistics. Their timelines are taken from ZORA as well. The bubble size of a term is proportional to the number of documents where the term occurs. Red, orange, yellow and green colors are used for terms that occur in the current document; red indicates high interlinkedness of a term with other terms, orange, yellow and green decreasing interlinkedness. Blue is used for terms that have a relation with the terms in this document, but occur in other documents.
You can navigate and zoom the map. Mouse-hovering a term displays its timeline, clicking it yields the associated documents.

Author Collaborations