Header

UZH-Logo

Maintenance Infos

Supertagging for Domain Adaptation: An Approach with Law Texts


Sugisaki, Kyoko (2017). Supertagging for Domain Adaptation: An Approach with Law Texts. In: The 16th International Conference on Artificial Intelligence and Law, London, 12 June 2017 - 16 June 2017.

Abstract

In this paper, we present a German supertagger that analy- ses syntactic functions in linear order. We apply a statistical sequential model, conditional random fields (CRF), to Swiss law texts, in a real world scenario in which the training data of the domain is missing. We show that the small amount of in-domain training data that was informed by linguistic hard and soft constraints and domain constraints achieved a label accuracy of 90% in the domain data, thus outperforming state-of-the-art parsers.

Abstract

In this paper, we present a German supertagger that analy- ses syntactic functions in linear order. We apply a statistical sequential model, conditional random fields (CRF), to Swiss law texts, in a real world scenario in which the training data of the domain is missing. We show that the small amount of in-domain training data that was informed by linguistic hard and soft constraints and domain constraints achieved a label accuracy of 90% in the domain data, thus outperforming state-of-the-art parsers.

Statistics

Downloads

9 downloads since deposited on 06 Jun 2017
9 downloads since 12 months
Detailed statistics

Additional indexing

Item Type:Conference or Workshop Item (Paper), refereed, original work
Communities & Collections:06 Faculty of Arts > Institute of Computational Linguistics
Dewey Decimal Classification:000 Computer science, knowledge & systems
410 Linguistics
Language:English
Event End Date:16 June 2017
Deposited On:06 Jun 2017 09:07
Last Modified:06 Jun 2017 09:07
Funders:Swiss National Science Foundation

Download

Preview Icon on Download
Preview
Filetype: PDF
Size: 259kB

TrendTerms

TrendTerms displays relevant terms of the abstract of this publication and related documents on a map. The terms and their relations were extracted from ZORA using word statistics. Their timelines are taken from ZORA as well. The bubble size of a term is proportional to the number of documents where the term occurs. Red, orange, yellow and green colors are used for terms that occur in the current document; red indicates high interlinkedness of a term with other terms, orange, yellow and green decreasing interlinkedness. Blue is used for terms that have a relation with the terms in this document, but occur in other documents.
You can navigate and zoom the map. Mouse-hovering a term displays its timeline, clicking it yields the associated documents.

Author Collaborations