Open Access System for Information Sharing

Login Library

 

Article
Cited 2 time in webofscience Cited 0 time in scopus
Metadata Downloads

Learning Korean named entity by bootstrapping with Web resources SCIE SCOPUS

Title
Learning Korean named entity by bootstrapping with Web resources
Authors
Lee, SAn, JHKwak, BKLee, GG
Date Issued
2004-12
Publisher
IEICE-INST ELECTRONICS INFORMATION CO
Abstract
An important issue in applying machine learning algorithms to Natural Language Processing areas such as Named Entity Recognition tasks is to overcome the lack of tagged corpora. Several bootstrapping methods such as co-training have been proposed as a solution. In this paper, we present a different approach using the Web resources. A Named Entity (NE) tagged corpus is generated from the Web using about 3,000 names as seeds. The generated corpus may have a lower quality than the manually tagged corpus but its size can be increased sufficiently. Several features are developed and the decision list is learned using the generated corpus. Our method is verified by comparing it to both the decision list learned on the manual corpus and the DL-CoTrain method. We also present a two-level classification by cascading highly precise lexical patterns and the decision list to improve the performance.
URI
https://oasis.postech.ac.kr/handle/2014.oak/10381
ISSN
0916-8532
Article Type
Article
Citation
IEICE TRANSACTIONS ON INFORMATION AND SYSTEMS, vol. E87D, no. 12, page. 2872 - 2882, 2004-12
Files in This Item:
There are no files associated with this item.

qr_code

  • mendeley

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Views & Downloads

Browse