Open Access System for Information Sharing

Login Library


Cited 13 time in webofscience Cited 21 time in scopus
Metadata Downloads

Hybrid Garbage Collection for Multi-Version Concurrency Control in SAP HANA SCIE SCOPUS

Hybrid Garbage Collection for Multi-Version Concurrency Control in SAP HANA
Juchang LeeHyungyu ShinChang Gyoo ParkSeongyun KoJaeyun NohYongjae ChuhWolfgang StephanHan, W.-S.
Date Issued
While multi-version concurrency control (MVCC) supports fast and robust performance in in-memory, relational databases, it has the potential problem of a growing number of versions over time due to obsolete versions. Although a few TB of main memory is available for enterprise machines, the memory resource should be used carefully for economic and practical reasons. Thus, in order to maintain the necessary number of versions in MVCC, versions which will no longer be used need to be deleted. This process is called garbage collection. MVCC uses the concept of visibility to define garbage. A set of versions for each record is first identified as candidate if their version timestamps are lower than the minimum value of snapshot timestamps of active snapshots in the system. All such candidates, except the one which has the maximum version timestamp, are safely reclaimed as garbage versions. In mixed OLTP and OLAP workloads, the typical garbage collector may not effectively reclaim record versions. In these workloads, OLTP applications generate a high volume of new versions, while long-lived queries or transactions in OLAP applications often block garbage collection, since we need to compare the version timestamp of each record version with the snapshot times tamp of the oldest, long-lived snapshot. Thus, these workloads typically cause the in-memory version space to grow. Additionally, the increasing version chains of records over time may also increase the traversal cost for them. In this paper, we present an efficient and effective garbage collector called HYBRIDGC in SAP HANA. HybridGC integrates three novel concepts of garbage collection: timestamp-based group garbage collection, table garbage collection, and interval garbage collection. Through experiments using mixed OLTP and OLAP workloads, we show that HYBRIDGC effectively and efficiently collects garbage versions with negligible overhead.
Article Type
In Proc. 42nd Int'l conf. on Management of Data (SIGMOD 2016), page. 1307 - 1318, 2016-06
Files in This Item:
There are no files associated with this item.


  • mendeley

Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.

Related Researcher


Grad. School of AI
Read more

Views & Downloads