Skip to Content
avatar image
Former Member

What´s the best approach to work with Excel, csv files

Hi gurus. I got a question for you. According to your experience what's the best approach to work with Excel or csv files that have to be uploaded through DataServices to you datawarehouse.

Let's say your end-user, who is not a programmer, creates a group of 4 excel files with different calculations in a monthly basis, so they can generate a set of reports from their datawarehouse once the files have been uploaded to tables in your DWH. The calculations vary from month to month. The user doesn't have a front-end to upload the excel files directly to Data Services. The end user needs to keep a track of which person uploaded the files for a determined month.

1. The end user should place their 4 excel files in a shared directory that will be seen by DataServices.

2. DataServices will execute certain scheduled job that will read the four files and upload them to the Datawarehouse at a determined time, lets say at 9:00pm.


It makes me wonder... what happens if the user needs to present their reports immediately so they can´t wait until 9:00pm.  Is it possible for the end user to execute some kind of action (out of the DataServices Environment) so DataServices "could know" that it has to process those files right now, instead of waiting for the night schedule?


Is there a way that DS will track who was the person who uploaded those files?


Would it be better to build a front-end for the end user so they can upload their four files directlyto the datawarehouse?

Waiting for your comments to resolve this dilemma

Best Regards

Erika

Add comment
10|10000 characters needed characters exceeded

  • Get RSS Feed

1 Answer

  • avatar image
    Former Member
    Sep 10, 2014 at 03:06 PM

    Hi,

    There are functions in DS that captures the input files automatically. You could use file_exists() or wait_for_file() option to do that. Schedule the job to run every certain minute and if the file exists then run. This could be done by using a certain file name with date and timestamp etc or after running move the old files to archive and DS wait for new files to show up.

    Check this - Selective Reading and Postprocessing - Enterprise Information Management - SCN Wiki

    Hope this helps.

    Arun

    Add comment
    10|10000 characters needed characters exceeded