Efficient Feature Weighting Methods for Ranking
SCOPUS
- Title
- Efficient Feature Weighting Methods for Ranking
- Authors
- Yu, Hwanjo; Oh, Jinoh; AN, WOOK SHIN
- Date Issued
- 2009-11
- Publisher
- Association for Computing Machinary, Inc.
- Abstract
- Feature weighting or selection is a crucial process to identify an important subset of features from a data set. Removing irrelevant or redundant features can improve the generalization performance of ranking functions in information retrieval. Due to fundamental differences between classification and ranking, feature weighting methods developed for classification cannot be readily applied to feature weighting for ranking. A state of the art feature selection method for ranking, called GAS, has been recently proposed, which exploits importance of each feature and similarity between every pair of features. However, GAS must compute the similarity scores of all pairs of features, thus it is not scalable for high-dimensional data and its performance degrades on nonlinear ranking functions. This paper proposes novel algorithms, RankWrapper and RankFilter, which is scalable for high-dimensional data and also performs reasonably well on nonlinear ranking functions. RankWrapper and RankFilter are designed based on the key idea of Relief algorithm. Relief is a feature selection algorithm for classification, which exploits the notions of hits (data points within the same class) and misses (data points from different classes) for classification. However, there is no such notion of hits or misses in ranking. The proposed algorithms instead utilize the ranking distances of nearest data points in order to identify the key features for ranking. Our extensive experiments show that RankWrapper and RankFilter generate higher accuracy overall than the GAS and traditional Relief algorithms adapted for ranking, and run substantially faster than the GAS on high dimensional data.
- URI
- https://oasis.postech.ac.kr/handle/2014.oak/92225
- DOI
- 10.1145/1645953.1646100
- Article Type
- Article
- Citation
- International Conference on Information and Knowledge Management, Proceedings, page. 1157 - 1165, 2009-11
- Files in This Item:
- There are no files associated with this item.
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.