SVM-based biological named entity recognition using minimum edit-distance feature boosted by virtual examples
SCIE
SCOPUS
- Title
- SVM-based biological named entity recognition using minimum edit-distance feature boosted by virtual examples
- Authors
- Yi, E; Lee, GG; Song, Y; Park, SJ
- Date Issued
- 2005-01
- Publisher
- SPRINGER-VERLAG BERLIN
- Abstract
- In this paper, we propose two independent solutions to the problems of spelling variants and the lack of annotated corpus, which are the main difficulties in SVM(Support-Vector Machine) and other machine-learning based biological named entity recognition. To resolve the problem of spelling variants, we propose the use of edit-distance as a feature for SVM. To resolve the lack-of-corpus problem, we propose the use of virtual examples, by which the annotated corpus can be automatically expanded in a fast, efficient and easy way. The experimental results show that the introduction of edit-distance produces some improvements. And the model, which is trained with the corpus expanded by virtual examples, outperforms the model trained with the original corpus. Finally, we achieved the high performance of 71.46% in F-measure (64.03% in precision, 80.84% in recall) in the experiment of five categories named entity recognition on CENIA corpus (version 3.0).
- URI
- https://oasis.postech.ac.kr/handle/2014.oak/24654
- DOI
- 10.1007/978-3-540-30211-7_86
- ISSN
- 0302-9743
- Article Type
- Article
- Citation
- LECTURE NOTES IN COMPUTER SCIENCE, vol. 3248, page. 807 - 814, 2005-01
- Files in This Item:
- There are no files associated with this item.
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.