Skip to Content
0

BO PERFORMANCE ISSUE WITH CLOUDERA IMPALA HADOOP

Jan 31 at 11:16 PM

123

avatar image
Former Member

Hi Experts,


Hope you all are doing well. I want to take some expert opinion here: We are switching from Oracle to Hadoop due to slow performance with Oracle DB, built a universe with Cloudera Simba ODBC connections scheduled a report expecting a faster performance compare to Oracle DB but the report took more than 2 hours, took the same query and ran in HUE SQL editor the result got back in less than 2 mins

We tested in DEV, TEST, & PROD, & also tried switching to JDBC connection little improvement in performance, we feel its the network's latency issue. Points to note here that our Hadoop servers and BO servers are in two different locations NCAL and SCAL, we have 3.5 million records to pull

I am looking for some tested advice here on this issue if anyone has already faced such issue
Regards,


Ahmed

10 |10000 characters needed characters left characters exceeded
* Please Login or Register to Answer, Follow or Comment.

2 Answers

Sonet Kebede
Feb 01 at 08:01 PM
0

Hello,

What's the version of BO you are using?

What's the version of Hadoop (Hive1 or Hive2)?

What is the reason you are using Cloudera Simba ODBC\JDBC vs Apache Simba JDBC \ODBC

If you have Hadoop than you should use Apache Simba JDBC \ODBC

Thanks,

Sonet

Show 1 Share
10 |10000 characters needed characters left characters exceeded
Former Member

Bo version is 4.2. I am not sure about Hadoop Version I use the HUE editor web interface to run queries and the HUE version in 3.10. We are using Impala in hadoop so using Cloudera Impala 2.0 - Simba JDBC Drivers as connection.

0
Sonet Kebede
Feb 02 at 03:55 PM
0

Hello,

If the data source is Hadoop than you should use Apache Simba JDBC \ODBC. Please check our supported platform.

I hope this could help.

Thanks,

Sonet

Share
10 |10000 characters needed characters left characters exceeded