Subtopic mining using simple patterns and hierarchical structure of subtopic candidates from web documents
SCIE
SSCI
SCOPUS
- Title
- Subtopic mining using simple patterns and hierarchical structure of subtopic candidates from web documents
- Authors
- Kim, SJ; Lee, JH
- Date Issued
- 2015-11
- Publisher
- ELSEVIER SCI LTD
- Abstract
- The intention gap between users and queries results in ambiguous and broad queries. To solve these problems, subtopic mining has been studied, which returns a ranked list of possible subtopics according to their relevance, popularity, and diversity. This paper proposes a novel method to mine subtopics using simple patterns and a hierarchical structure of subtopic candidates. First, relevant and various phrases are extracted as subtopic candidates using simple patterns based on noun phrases and alternative partial-queries. Second, a hierarchical structure of the subtopic candidates is constructed using sets of relevant documents from a web document collection. Finally, the subtopic candidates are ranked considering a balance between popularity and diversity using this structure. In experiments, our proposed methods outperformed the baselines and even an external resource based method at high-ranked subtopics, which shows that our methods can be effective and useful in various search scenarios like result diversification. (C) 2015 Elsevier Ltd. All rights reserved.
- URI
- https://oasis.postech.ac.kr/handle/2014.oak/35527
- DOI
- 10.1016/J.IPM.2015.07.001
- ISSN
- 0306-4573
- Article Type
- Article
- Citation
- INFORMATION PROCESSING & MANAGEMENT, vol. 51, no. 6, page. 773 - 785, 2015-11
- Files in This Item:
- There are no files associated with this item.
Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.