Open Access System for Information Sharing

Login Library

 

Article
Cited 8 time in webofscience Cited 0 time in scopus
Metadata Downloads

A corpus-based learning method of compound noun indexing rules for Korean

Title
A corpus-based learning method of compound noun indexing rules for Korean
Authors
Kim, JHKwak, BKLee, SLee, GLee, JH
POSTECH Authors
Lee, GLee, JH
Date Issued
Jul-2001
Publisher
KLUWER ACADEMIC PUBL
Abstract
In Korean information retrieval, compound nouns play an important role in improving precision in search experiments. There are two major approaches to compound noun indexing in Korean: statistical and linguistic. Each method, however, has its own shortcomings, such as limitations when indexing diverse types of compound nouns, over-generation of compound nouns, and data sparseness in training. In this paper, we propose a corpus-based learning method, which can index diverse types of compound nouns using rules automatically extracted from a large corpus. The automatic learning method is more portable and requires less human effort, although it exhibits a performance level similar to the manual-linguistic approach. We also present a new filtering method to solve the problems of compound noun over-generation and data sparseness.
Keywords
corpus-based learning; compound noun indexing; filtering; information retrieval; search performance evaluation
URI
http://oasis.postech.ac.kr/handle/2014.oak/19501
ISSN
1386-4564
Article Type
Article
Citation
INFORMATION RETRIEVAL, vol. 4, no. 2, page. 115 - 132, 2001-07
Files in This Item:
There are no files associated with this item.

qr_code

  • mendeley

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher

 LEE, GARY GEUNBAE
Dept of Computer Science & Enginrg
Read more

Views & Downloads

Browse