HANA calculation view with heavy joins and calcula...

Former Member · ‎05-26-2017

Hi experts,

Could you please take a look at the problem described below?

Calculation view description (please also see an image):

Projection 6 gets the data from a table that contains roughly 700,000,000 records and grows by roughly 100,000,000 a month. Same thing applies to Projection 7 – it gets the data from a different table, but that table is 1:1 with the table that feeds Projection 6. Projection 8 gets the data from even larger table. All tables are SAP standard tables. Joins that come after the union only add some master data and texts. They don’t affect the performance drastically and can be ignored.

All the necessary filters are applied to decrease the data volumes. Users have an option to filter on calendar month period (for example, start: 201701, end: 201704).

Problem:

All tables feeding projections 6,7 and 8 are very different in terms of their structure. That’s why I can’t use union – joins are required. Also aggregations are required before Union 1 to combine data from Projection 4 and Join 9. And with such data volumes we’re experiencing performance issues: slow runtimes, large memory consumption and also CPU consumption. I don’t really know the reason behind the performance issue, but I have several assumptions:

1) Join 9 contains a calculation that gets a date (VARCHAR of length 😎 from a timestamp (DECIMAL field of length 14 that stores date and time, for example, like this 20170526112351). I need it to be turned into the date to aggregate the data a before doing further joins – that increases the performance. I can also perform this calculation in Projection 6 (that’s where the timestamp comes from), but it doesn’t really change the situation – the calculation increases memory consumption. The formula looks like this: leftstr(string("ACTIVITY_DATE"),8).

2)The reason for performance issues is not this conversion but rather because of the fact that tables from projections 6,7 and 8 are partitioned and it somehow affects the performance of joins?

Questions:

If the assumption 1) is correct, is there a more efficient way of converting this timestamp (decimal) to date (varchar)?

Can assumption 2) be correct at all?

Maybe there’s some other potential reason that I’m missing here?

former_member184594 · ‎07-13-2017

Hi,

This is what you can do,

1. Apply the filters as early as possible.

2. Try to use Input Parameters.

3. Try to simplify your CV if possible try to break in parts.

4. Try to use UNIONS if you need 2 fact tables.

5. If you need to use joins, use LEFT OUTER/REFENTIAL(I personally use LEFT OUTER) joins instead of INNER

6. On joins set optimize join to TRUE

7. Only project the columns you need. Don't bring up all the columns in a table to the next node.

8. Avoid using IF-THEN-ELSE calculations

9. Aggregate as early as possible

10. Try to avoid joining on calculated columns

11. Try to create calculated columns in Aggregation Node.

Your goal should be to work with the data you only need. If you are trying to work on all data, then that's a problem.

Also check link and documents below for best practices and tunning and performance on SAP HANA

https://blogs.sap.com/2014/03/26/hana-modeling-good-practices/

https://blogs.sap.com/2015/11/03/the-art-and-science-of-hana-performance-modeling/

http://www.hdespot.com/wp-content/uploads/2015/11/SAPHDE_Webinar-The-Art-Science-of-Tuning-HANA-Mode...

former_member437692 · ‎05-31-2017

Hi Vlad,

What is your HANA system version?

Have you tried to run Visualize Plan for this view? What is current runtime of the view?

1) Assumption 1 can be verified by creating the logic as 'Generated Column' at table level and see if performance changes.

2) If partition is causing performance issue, this can be verified by checking network transfer time in 'Visualize Plan' of the view.

How are there user filters created in view(Variable/Input Parameter) and where? Is it possible to put this filter at lowest project as input parameter (may be at projection 6)?

Regards,

Venkat

HANA calculation view with heavy joins and calculated columns (performance)

Accepted Solutions (0)

Answers (2)

Answers (2)

Re: Abap2xlsx: print internal table without header

Re: ABAP2XLSX problem downloading file

Re: 如何使用钻取

Struggling with Filters on Select - Fiori App

Re: How can I save Requirement in Cloud ALM