Efficient implementation of associative classifiers for document classification
SCIE
SCOPUS
- Title
- Efficient implementation of associative classifiers for document classification
- Authors
- Yoon, Y; Lee, GG
- Date Issued
- 2007-03
- Publisher
- PERGAMON-ELSEVIER SCIENCE LTD
- Abstract
- In practical text classification tasks, the ability to interpret the classification result is as important as the ability to classify exactly. Associative classifiers have many favorable characteristics such as rapid training, good classification accuracy, and excellent interpretation. However, associative classifiers also have some obstacles to overcome when they are applied in the area of text classification. The target text collection generally has a very high dimension, thus the training process might take a very long time. We propose a feature selection based on the mutual information between the word and class variables to reduce the space dimension of the associative classifiers. In addition, the training process of the associative classifier produces a huge amount of classification rules, which makes the prediction with a new document ineffective. We resolve this by introducing a new efficient method for storing and pruning classification rules. This method can also be used when predicting a test document. Experimental results using the 20-newsgroups dataset show many benefits of the associative classification in both training and predicting when applied to a real world problem. (c) 2006 Elsevier Ltd. All rights reserved.
- Keywords
- text classification; associative classifier; feature selection; rule pruning; subset expansion
- URI
- https://oasis.postech.ac.kr/handle/2014.oak/23697
- DOI
- 10.1016/j.ipm.2006.07.012
- ISSN
- 0306-4573
- Article Type
- Article
- Citation
- INFORMATION PROCESSING & MANAGEMENT (postech rank 1), vol. 43, no. 2, page. 393 - 405, 2007-03
- Files in This Item:
- There are no files associated with this item.
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.