Solved: Customer Master Data Migration

Former Member · ‎06-26-2011

Hi Experts,

I am new to Data Services where my requirement is to extract data from 3 different source systems : Flat Files, Oracle, Mainframes, consolidate the data , do cleansing the data and send the files to the Target systems.

Can anyone please let me know the intial steps from a high level? Like first do I need to create 3 different Datastores, one for each Legacy system?

Thanks for your help in advance.

Naga.

Former Member · ‎06-26-2011

From a high level:

1) Obtain access (logins & passwords) to your three sources of data. Make datastores pointing to them.

2) Pull your sources of data into one, unified place, wherever you're going to find it convenient to profile and analyze the data. Personally, I like SQL Server, because I'm most adept writing T-SQL, but doesn't matter -- MySQL, Oracle, Access, whatever.

And now the real work begins. Put Data Services down, pick-up your query and/or reporting tool of choice, and start looking at the data to develop your "real" specification. Unless you're blessed with a specification from a hot shot business analyst, developed in the full knowledge of the source data, based on a thorough, picky, detailed analysis thereof, it's likely that your spec will be a wishlist. That's fine -- you need to know what they want -- but if they want fresh mangoes every day at the South Pole, it might be a challenge. Meaning: you need to figure out if or how well the data supports the desires. They want a "100% accurate customer list". Well, that's impossible, for instance. One of their customers might be moving and getting a sex change operation today, and there's no way to know that today's Bob of Kentucky will be Roberta of Tennessee next week. You can make their data better; you can integrate a lot of it. But where it gets interesting is with the edge cases, where they'll need to decide how much they want to pay to get from 99.78% accurate to 99.83% accurate. DO NOT START ETL CODING UNTIL YOU'VE THOROUGHLY ANALYZED THE DATA and found the edge cases, the "whoops," the "well, we didn't think about that," the "oh, we didn't think there would ever be more than one of those," the "oh, gee, we figured they always had at least one of those." It's sort of like the role of mise en place in cooking -- you don't want the be caught w/ your garlic unminced just when your onions have gotten soft, do you?? If your current spec says silly things like "pull their home address," and nobody's checked to see how "home addresses" are identified as such, and if everybody has at least one, and if there are multiples, and, when there are multiples, how to pick the "primary" home address, then you've just got a wishlist.

Once you actually know your data and what the hell you're doing, then do the ETL coding, which will be fairly easy if you've done the mise en place correctly. The hard work of ETL comes before the ETL.

Best wishes,

Jeff Prenevost

Customer Master Data Migration

Accepted Solutions (1)

Accepted Solutions (1)

Answers (0)

Re: Standard control extension fails to load

Re: Smart Filter Bar does not appear

ModuleError: failed to load 'unit/unitTests.qunit....

Integration object creation with one to one relati...

How to log in to Fiori with SSO after routing?