Skip to Content

Load all HANA data to HADOOP

Hi folks,

My HADOOP team is looking for a way to replicate all ECC data into HADOOP system using HANA as the middle man. I've been doing some research and mostly I'm finding SCN posts on doing the opposite; consuming aggregated or reduced HADOOP data into HANA. To me that's how HADOOP should be leveraged but I'm trying to see what others are doing. My questions;

1) Is anybody doing this (loading all HANA data into HADOOP on daily basis)? Is this really best practice and is there any good documentation on this approach?

2) Does anybody know much about HANA Dynamic Tiering? I was asked to look into this and as far as I can tell it seems to be intended for BW scale out system. Our HANA system that contains SAP replicated data is a single node scale up system so I do not think this can be leveraged.

3) I'm not clear on how delta mechanism would work. Is it possible to somehow use log files in HANA and pull inserts/updates/deletes this way?

Thanks,

-Patrick

Add a comment
10|10000 characters needed characters exceeded

Assigned Tags

Related questions

1 Answer

  • Best Answer
    author's profile photo Former Member
    Former Member
    Posted on Apr 21, 2016 at 04:20 PM

    Hi Patrick,

    1) Your scenario from Hadoop to HANA is interesting . I have seen until now all opposite directions now. Can you please share the business use case for the same so that we can think of something else.

    I think even SAP HANA Vora focus on the approach of moving on data from Hadoop to Hana and not vice versa.

    2) on HANA dynamic tiering , did you got a chance to look at below link:

    https://hcp.sap.com/content/dam/website/saphana/en_us/Technology%20Documents/SPS09/SAP%20HANA%20SPS%2009%20-%20Dynamic%2…

    Regards,

    Rajesh

    Add a comment
    10|10000 characters needed characters exceeded

    • Former Member Patrick Bachmann

      Hello All,

      I am starting on a same task as mentioned above. Replicating/Moving data from SAP HANA to Hadoop.
      I have short listed below ways. Kindly suggest or correct me if i am wrong.

      1. SAP Data Services 4.2

      2. SAP Replication Server

      3. Scoop

      4. Data Life-cycle Manager

      Kindly suggest me which of the above listed method will be the most reliable, easy to deploy.
      I am also open for new suggestions.

      Regards,

      Shekhar

Before answering

You should only submit an answer when you are proposing a solution to the poster's problem. If you want the poster to clarify the question or provide more information, please leave a comment instead, requesting additional details. When answering, please include specifics, such as step-by-step instructions, context for the solution, and links to useful resources. Also, please make sure that you answer complies with our Rules of Engagement.
You must be Logged in to submit an answer.

Up to 10 attachments (including images) can be used with a maximum of 1.0 MB each and 10.5 MB total.