SAP Hana Vs MSSQL on Amazon AWS

Former Member · ‎10-16-2012

Hi,

I'm testing the performances of HANA VS. MSSQL on Amazon AWS.

I have a little testing application written in VB.NET that connects to HANA through ODBC (.NET System.Data.Odbc) and connects to MSSQL through SqlClient (.NET System.Data.SqlClient)

I'm using 3 machines on AWS:

AppSrv: t1.micro (Application)

HANA: m2.4xlarge (HANA DB)

MSSQL: m2.4xlarge (MSSQL DB)

On both HANA and MSSQL i have a table called "FORSELECTER" (ROWSTORE) with 1 milion rows.

Here's the structure of the table:

Name	SQL Data Type	Dim
CODE	NVARCHAR	4000
DESCRIPTION	NVARCHAR	4000
PRICE	INTEGER
SCP0	NVARCHAR	4000
SCP1	NVARCHAR	4000
.....	NVARCHAR	4000
.....	NVARCHAR	4000
SCP30	NVARCHAR	4000

No indexes have been used.

Doing a "SELECT TOP 1 *" would give you:

CODE	DESCRIPTION	PRICE	SCP0	SCP1	SCP..	SCP...	SCP30
TC43440	TD43440	2728999	X	XX			XXXXXXXXXXXXXXXXXXXXXXXXXXXXXX

Taking some code from my testing application:

Here are the objects I use for connecting to the DB (HANA or MSSQL)

...

Private connection As System.Data.Common.DbConnection = Nothing

Private command As System.Data.Common.DbCommand = Nothing

Private reader As System.Data.Common.DbDataReader

...

I initialize them differently depending on the DB I'm working on that moment:

...

Select Case mUsedDbAPI

Case DbAPIEnum.Odbc

'

connection = New System.Data.Odbc.OdbcConnection(sConStrHANA)

command = New System.Data.Odbc.OdbcCommand()

Case DbAPIEnum.SqlClient

connection = New System.Data.SqlClient.SqlConnection(sConStrMSSQL)

command = New System.Data.SqlClient.SqlCommand()

End Select

...

For testing purposes i tried as well not to use IHERITANCE:

...

Private hConnection As System.Data.Odbc.OdbcConnection = Nothing

Private hCommand As System.Data.Odbc.OdbcCommand = Nothing

...

hConnection = New System.Data.Odbc.OdbcConnection(sConStrHANA)

hCommand = New System.Data.Odbc.OdbcCommand

Attached you'll find a text file (results.log) with some SQL statements and the rows affected (Rows), time in seconds (s) and speed (Rows/s),copy pasting from that file:

**************************

Odbc (this means I'm connecting to HANA using ODBC with inheritance)

SELECT *

FROM FORSELECTER

WHERE SUBSTRING(CODE, 3, 4) = '3991' (111 Rows) (0.1716022 s) (646.844853970404 Rows/s)

**************************

SqlClient (this means I'm connecting to MSSQL using SqlClient)

SELECT *

FROM FORSELECTER

WHERE SUBSTRING(CODE, 3, 4) = '3991' (111 Rows) (0.0312004 s) (3557.64669683722 Rows/s)

**************************

HANA SPECIFIC (this means I'm connecting to HANA , NO inheritance)

SELECT *

FROM FORSELECTER

WHERE SUBSTRING(CODE, 3, 4) = '3991' (111 Rows) (0.1248184 s) (889.291963364376 Rows/s)

As you can see in these and the other examples you'll find in the attached file (results.txt) MSSQL is faster than HANA.

As HANA works in memory and MSSQL works on disks (I avoided localised selects, cleaned the buffer etc) there must be something I am missing.

Any clues? Maybe some settings or other things?

Former Member · ‎10-23-2012

This is a very poor use case for comparison. The reason that column stores are generally faster than row stores is because, in the real world, SELECT * operations are not very common. If you are executing a SELECT *, then it will perform better on a row store than a column store. In a row store, all columns in a row (SELECT *) can be read with one I/O operation - not so with a column store. However, in reality, we find it's common to have tables with, for example, 100 columns, and you're only selecting 5 of them. So, when the number of columns selected is comparatively low, and the number of rows selected is higher, this is when a column store really shines.

As for the in-memory portion, a simple read from a fully buffered table in a single-user database will not show true-to-life performance of a disk-based database. So, in essence, you're comparing the best case for SQL Server vs the worst case for HANA.

Cheers,

David.

danielculp · ‎10-23-2012

Hi David,

In your concrete case column store should actually perform better, as it is well optimized for executing simple table scans in parallel.

Best Regards

Daniel

former_member184768 · ‎10-16-2012

Hi David,

Can I recommend / ask for clarifications few things:

1) Do you have the primary key defined for the table and is the where clause using the Primary keys. This information is required to check if the primary key index is available in HANA and MSSQL and how is the performance on the same.

2) In earlier versions of the traditional databases, if a function is used for the Primary key column in where clause, then it used to skip the index search. I am not sure, but that is also a possibility in HANA.

3) Can you please try any of the aggregation functions like count(column_name) or min(), max(), sum() functions and compare the performance.

4) In my opinion, constructing the row or decompressing the column store to form a row might be an expensive operation. But the search on a particular column should be fast provided it is indexed or the search is performed without any data conversion / data pattern search.

If you can try some of the tests mentioned above, it would be helpful for all of us.

Regards,

Ravi

Former Member · ‎10-16-2012

Well - all databases work in memory. But that's a longer story.

It's a rowstore - I'd only consider using these in HANA for a small sub set of tables. HANA really flies with colum store

Suggest you try comparing the column store in SQL Server 2012 with HANA. I ran some tests of this against IQ. IQ won hands down.

SAP Hana Vs MSSQL on Amazon AWS

Accepted Solutions (0)

Answers (4)

Answers (4)

Re: Connecting SAP CAP with SAP Cloud ALM

How to use CopyProvider on Table created by TypeSc...

How to execute/fire onPost when user press enter o...

Re: Connecting SAP CAP with SAP Cloud ALM

Re: How can assign in Identity Authentication Serv...