on 10-15-2015 6:24 PM
I'm testing the BO Data Services as ETL tool for data extraction having as Amazon Redshift destination. However, the performance for the inserts into Redshift is very bad.
On average, the Data Services inserts about 5 records per second. (I made tests with Talend and it already entered 22000 reg / sec)
We have a BODS running on the Linux platform x64.
For connection, we use an ODBC connection created and configured as Amazon recommendations.
http://docs.aws.amazon.com/redshift/latest/mgmt/install-odbc-driver-linux.html
Does anyone have any idea how I can provide more performance in data inserts in Redshift using bods?
Thanks
There's something in the error file. That message might give a hint. Can you post?
You must be a registered user to add a comment. If you've already registered, sign in. Otherwise, register and sign in.
What does your transformation do? Just dump loading on to target? I see a warning on your error screen. Also why did you chose ODBC? What is your DS version, i guess DS 4.2 onwards supports JDBC adapter.
You must be a registered user to add a comment. If you've already registered, sign in. Otherwise, register and sign in.
Hi Arun,
How I am testing ETL tools to put data on Redshift, this job is so simple, have no transformations. Just a query. I am looking for best performance ETL tool to realize it.
About ODBC or JDBC. I tried to create a JDBC adapter, but still can not use it.
I tried to create a JDBC adapter, but still can not use it.
I created a discussion on this forum about this my JDBC problem, but not get answers.
The BO version is 4.2.
I'm running the job again to get the warning messages, yet none was displayed.
Thanks
Hi Vitor,
If ODBC is still causing issue while loading to RedShift database then you can refer to below link
Regards
Arun Sasi
I have no idea of your target Database but in general either of the following must help
1) increasing number of rows per commit on your target table
2) If your destination supports any bulk loading techniques, you can try using those in the target table bulk loader options.
3) Please check the buffer size set in your target DB which is being loaded.
Thanks
Sameera.
You must be a registered user to add a comment. If you've already registered, sign in. Otherwise, register and sign in.
Hi Vitor,
I assume that you are on DS 4.2.
You can give a try with Redshit JDBC but ODBC connectivity might not work using third party driver has its limitations.
You can use JDBC connectivity to connect with AWS RedShift. You need to have redshift JDBC drivers or supporting drivers from vendor.
Refer to below link for configuring JDBC in Data Services
You can download JDBC drivers from amazon portal based on version.
http://docs.aws.amazon.com/redshift/latest/mgmt/configure-jdbc-connection.html
Here is the direct link to download .Jar file
https://s3.amazonaws.com/redshift-downloads/drivers/RedshiftJDBC4-1.1.1.0001.jar
You JDBC URL should include the database name
JDBC URL: jdbc:redshift://xxxxx.redshift.amazonaws.com:0000/database
JDBC Class: com.amazon.redshift.jdbc4.Driver
Regards
Arun Sasi
Hi Arun,
Yes, I am using BO 4.2.
I am using ODBC because I had some issues trying use the JDBC. I follow all steps from your links but something still wrong.
I created a discussion on this forum about this my JDBC problem, but not get answers.
If I solve this issues, I will try use JDBC connection to put data on Redshift and leave ODBC back.
Thanks
No ideias?
You must be a registered user to add a comment. If you've already registered, sign in. Otherwise, register and sign in.
User | Count |
---|---|
86 | |
10 | |
10 | |
9 | |
6 | |
6 | |
6 | |
5 | |
4 | |
3 |
You must be a registered user to add a comment. If you've already registered, sign in. Otherwise, register and sign in.