Extending Word Representations with Affix Features for Bidirectional LSTM-CRF-based Korean Named Entity recognition
- Title
- Extending Word Representations with Affix Features for Bidirectional LSTM-CRF-based Korean Named Entity recognition
- Authors
- 정예원
- Date Issued
- 2020
- Publisher
- 포항공과대학교
- Abstract
- We propose a Korean named entity recognition (NER) model which extracts affix features to augment word representations. We build upon two recently prominently used NER models, namely BiLSTM-BiLSTM-CRF and CNN-BiLSTM-CRF, by extending the word embeddings with approximated affix information. We choose to use an inexpensive character-level frequency filter to infer the affix information. Our experimental results on the HCLT 2016 and ETRI NER datasets show up to a 0.93% increase in F1 score compared to the original models without any dictionary or morphological tools. This increase is a significant improvement in NER considering the recent stagnation following the introduction of neural NER. These results show that Korean predicted affix features are useful in building neural NER models.
- URI
- http://postech.dcollection.net/common/orgView/200000287250
https://oasis.postech.ac.kr/handle/2014.oak/111876
- Article Type
- Thesis
- Files in This Item:
- There are no files associated with this item.
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.