Header

UZH-Logo

Maintenance Infos

Learn-filter-apply-forget. Mixed approaches to named entity recognition


Volk, M; Clematide, S (2001). Learn-filter-apply-forget. Mixed approaches to named entity recognition. In: 6th International Workshop on Applications of Natural Language for Informations Systems, Madrid, Spain, 2001 - 2001.

Abstract

We have explored and implemented different approaches to named entity recognition in German, a difficult task in this language since both regular nouns and proper names are capitalized. Our goal is to identify and recognise per-
son names, geographical names and company names in a computer magazine corpus. Our geographical name classifier works with precompiled lists but our company name classifier learns the names from the corpus. For the recognition of
person names we work with a precompiled list of first names and the program learns the last names. For this classifier we suggest setting an activation value for the last name and subsequently depriming the value until “forgetting” the name. Our evaluation results show that our mixed approaches are as good as the recall and precision values reported for English. It is shown that a carefully tuned cascade of
name classifiers can even distinguish between different interpretations of a name token within the same document.

Abstract

We have explored and implemented different approaches to named entity recognition in German, a difficult task in this language since both regular nouns and proper names are capitalized. Our goal is to identify and recognise per-
son names, geographical names and company names in a computer magazine corpus. Our geographical name classifier works with precompiled lists but our company name classifier learns the names from the corpus. For the recognition of
person names we work with a precompiled list of first names and the program learns the last names. For this classifier we suggest setting an activation value for the last name and subsequently depriming the value until “forgetting” the name. Our evaluation results show that our mixed approaches are as good as the recall and precision values reported for English. It is shown that a carefully tuned cascade of
name classifiers can even distinguish between different interpretations of a name token within the same document.

Statistics

Downloads

217 downloads since deposited on 18 Aug 2009
48 downloads since 12 months
Detailed statistics

Additional indexing

Item Type:Conference or Workshop Item (Paper), refereed, original work
Communities & Collections:06 Faculty of Arts > Institute of Computational Linguistics
Dewey Decimal Classification:000 Computer science, knowledge & systems
410 Linguistics
Event End Date:2001
Deposited On:18 Aug 2009 14:39
Last Modified:06 Dec 2017 20:20

Download

Download PDF  'Learn-filter-apply-forget. Mixed approaches to named entity recognition'.
Preview
Filetype: PDF
Size: 1MB