Instance categorization by support vector machines to adjust weights in AdaBoost for imbalanced data classification
SCIE
SCOPUS
- Title
- Instance categorization by support vector machines to adjust weights in AdaBoost for imbalanced data classification
- Authors
- Lee, W.; Jun, Chi-Hyuck; Lee, J.-S.
- Date Issued
- 2017-03
- Publisher
- Elsevier Inc.
- Abstract
- To address class imbalance in data, we propose a new weight adjustment factor that is applied to a weighted support vector machine (SVM) as a weak learner of the AdaBoost algorithm. Different factor scores are computed by categorizing instances based on the SVM margin and are assigned to related instances. The SVM margin is used to define borderline and noisy instances, and the factor scores are assigned to only borderline instances and positive noise. The adjustment factor is then employed as a multiplier to the instance weight in the AdaBoost algorithm when learning a weighted SVM. Using 10 real class-imbalanced datasets, we compare the proposed method to a standard SVM and other SVMs combined with various sampling and boosting methods. Numerical experiments show that the proposed method outperforms existing approaches in terms of F-measure and area under the receiver operating characteristic curve, which means that the proposed method is useful for relaxing the class-imbalance problem by addressing well-known degradation issues such as overlap, small disjunct, and data shift problems. ? 2016 Elsevier Inc.
- Keywords
- Adaptive boosting; Numerical methods; Class imbalance; Class imbalance problems; Imbalanced Data-sets; Instance categorization; Numerical experiments; Receiver operating characteristic curves; Weight adjustment; Weighted support vector machine; Support vector machines
- URI
- https://oasis.postech.ac.kr/handle/2014.oak/50725
- DOI
- 10.1016/j.ins.2016.11.014
- ISSN
- 0020-0255
- Article Type
- Article
- Citation
- Information Sciences, vol. 381, page. 92 - 103, 2017-03
- Files in This Item:
- There are no files associated with this item.
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.