Open Access System for Information Sharing

Login Library

 

Article
Cited 4 time in webofscience Cited 4 time in scopus
Metadata Downloads

Fast and memory-efficient algorithms for high-order Tucker decomposition SCIE SCOPUS

Title
Fast and memory-efficient algorithms for high-order Tucker decomposition
Authors
Hwanjo YuJiyuan ZhangJinoh OhKijung ShinEvangelos E. PapalexakisChristos Faloutsos
Date Issued
2020-01
Publisher
SPRINGER LONDON LTD
Abstract
Multi-aspect data appear frequently in web-related applications. For example, product reviews are quadruplets of the form (user, product, keyword, timestamp), and search-engine logs are quadruplets of the form (user, keyword, location, timestamp). How can we analyze such web-scale multi-aspect data on an off-the-shelf workstation with a limited amount of memory? Tucker decomposition has been used widely for discovering patterns in such multi-aspect data, which are naturally expressed as large but sparse tensors. However, existing Tucker decomposition algorithms have limited scalability, failing to decompose large-scale high-order (= 4) tensors, since they explicitly materialize intermediate data, whose size grows exponentially with the order. To address this problem, which we call "Materialization Bottleneck," we propose S- HOT, a scalable algorithm for high-order Tucker decomposition. S- HOT minimizes materialized intermediate data by using an on-the-fly computation, and it is optimized for disk-resident tensors that are too large to fit in memory. We theoretically analyze the amount of memory and the number of data scans required by S- HOT. Moreover, we empirically showthat S- HOT handles tensors with higher order, dimensionality, and rank than baselines. For example, S- HOT successfully decomposes a real-world tensor from the Microsoft Academic Graph on an off-the-shelf workstation, while all baselines fail. Especially, in terms of dimensionality, S- HOT decomposes 1000xlarger tensors than baselines.
URI
https://oasis.postech.ac.kr/handle/2014.oak/100877
DOI
10.1007/s10115-019-01435-1
ISSN
0219-1377
Article Type
Article
Citation
KNOWLEDGE AND INFORMATION SYSTEMS, 2020-01
Files in This Item:
There are no files associated with this item.

qr_code

  • mendeley

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher

Researcher

유환조YU, HWANJO
Dept of Computer Science & Enginrg
Read more

Views & Downloads

Browse