Header

UZH-Logo

Maintenance Infos

Building and querying parallel treebanks


Volk, Martin; Marek, Torsten; Samuelsson, Yvonne (2011). Building and querying parallel treebanks. Translation: Computation, Corpora, Cognition, 1(1):7-28.

Abstract

This paper describes our work on building a trilingual parallel treebank. We have annotated constituent structure trees from three text genres (a philosophy novel, economy reports and a technical user manual). Our parallel treebank includes word and phrase alignments. The alignment information was manually checked using a graphical tool that allows the annotator to view a pair of trees from parallel sentences. This tool comes with a powerful search facility which supersedes the expressivity of previous popular treebank query engines.

Abstract

This paper describes our work on building a trilingual parallel treebank. We have annotated constituent structure trees from three text genres (a philosophy novel, economy reports and a technical user manual). Our parallel treebank includes word and phrase alignments. The alignment information was manually checked using a graphical tool that allows the annotator to view a pair of trees from parallel sentences. This tool comes with a powerful search facility which supersedes the expressivity of previous popular treebank query engines.

Statistics

Downloads

117 downloads since deposited on 01 Feb 2012
38 downloads since 12 months
Detailed statistics

Additional indexing

Item Type:Journal Article, refereed, original work
Communities & Collections:03 Faculty of Economics > Department of Informatics
Dewey Decimal Classification:000 Computer science, knowledge & systems
Language:English
Date:2011
Deposited On:01 Feb 2012 10:15
Last Modified:05 Apr 2016 15:21
Publisher:Johannes-Gutenberg-Universität
ISSN:2193-6986
Additional Information:Special Issue on Parallel Corpora: Annotation, Exploitation and Evaluation
Official URL:http://www.t-c3.org/index.php/t-c3/article/view/8
Other Identification Number:merlin-id:6344

Download

Preview Icon on Download
Preview
Content: Published Version
Filetype: PDF
Size: 841kB