Publication:

Word and sentence segmentation in german: Overcoming idiosyncrasies in the use of punctuation in private communication

Date

Date

Date
2017
Conference or Workshop Item
Published version

Citations

Citation copied

Sugisaki, K. (2017, September). Word and sentence segmentation in german: Overcoming idiosyncrasies in the use of punctuation in private communication. 27th International Conference, GSCL 2017, Berlin. https://doi.org/10.1007/978-3-319-73706-5_6

Abstract

Abstract

Abstract

In this paper, we present a segmentation system for German texts. We apply conditional random fields (CRF), a statistical sequential model, to a type of text used in private communication. We show that by segmenting individual punctuation, and by taking into account freestanding lines and that using unsupervised word representation (i.e., Brown clustering, Word2Vec and Fasttext) achieved a label accuracy of 96% in a corpus of postcards used in private communication.

Metrics

Downloads

83 since deposited on 2018-02-20
Acq. date: 2025-11-13

Views

219 since deposited on 2018-02-20
Acq. date: 2025-11-13

Citations

Additional indexing

Creators (Authors)

  • Sugisaki, Kyoko
    affiliation.icon.alt

Event Title

Event Title

Event Title
27th International Conference, GSCL 2017

Event Location

Event Location

Event Location
Berlin

Event Start Date

Event Start Date

Event Start Date
2017-09-01

Event End Date

Event End Date

Event End Date
2017-09-01

Publisher

Publisher

Publisher

Item Type

Item Type

Item Type
Conference or Workshop Item

Dewey Decimal Classifikation

Dewey Decimal Classifikation

Dewey Decimal Classifikation

Language

Language

Language
English

Date available

Date available

Date available
2018-02-20

OA Status

OA Status

OA Status
Hybrid

Free Access at

Free Access at

Free Access at
Unspecified

Metrics

Downloads

83 since deposited on 2018-02-20
Acq. date: 2025-11-13

Views

219 since deposited on 2018-02-20
Acq. date: 2025-11-13

Citations

Citations

Citation copied

Sugisaki, K. (2017, September). Word and sentence segmentation in german: Overcoming idiosyncrasies in the use of punctuation in private communication. 27th International Conference, GSCL 2017, Berlin. https://doi.org/10.1007/978-3-319-73706-5_6

Hybrid Open Access
Loading...
Thumbnail Image

Files

Files

Files
Files available to download:1

Files

Files

Files
Files available to download:1
Loading...
Thumbnail Image