cancel
Showing results for 
Search instead for 
Did you mean: 

Compression technique in HANA + comparisons

Former Member
0 Kudos

Hi guys,

I'm working on SAP HANA right now and try to understand the whole database technique.

I found this video & pdf about SAP HANA compression (Compression | SAP HANA) but which technique does HANA actually use? All of those, do you choose or how exactly does it work?

Additionally it would be great to know if there are any comparisons to other database compression techniques out there to evaluate if the compression rate is good or not?

I appreciate every help!

Thanks
JP

Accepted Solutions (1)

Accepted Solutions (1)

Former Member
0 Kudos

If you are interested in this then you might consider watching the HPI in-memory course at http://open.hpi.de

The primary compression is dictionary encoding of columns.

Secondary compression is in the attribute vector of the column. There are several algorithms including run level encoding, cluster, prefix encoding, etc. This reduces the size of the attribute vector.

HANA doesn't look to compress too aggressively because the plan is to reduce memory bandwidth but not at the overall expense of performance. Therefore all compression algorithms are scan and cache friendly.

lbreddemann
Active Contributor
0 Kudos

John's right - the current ( ! ) HPI lecture is the most extensive description of he various compression techniques applied in SAP HANA that is out there.

Special focus has been put on compression of the attribute vector and the compression of the dictionary entries .

They also nicely cover the variable bit-length encoding based on the number of distinct values that need to be stored.

Really a recommendable read for anyone who actually wants to know the details of this otherwise completely automatic functionality.

- Lars

Answers (1)

Answers (1)

Former Member
0 Kudos

Thank you guys - that helped a lot!