Problem with IMPORT FROM when file used SOH charac...

Former Member · ‎06-04-2013

Hi All,

I'm pressed with a nagging problem. I'm using AWS HIVE to output the results of a HIVE query to an output file that I'm going to load into SAP HANA. The problem is that HIVE output files using the SOH character ^A, \01, 0x01 as the field delimiter. I have no idea on how to specify this non-printable character in the IMPORT FROM statement. Here is an example:

IMPORT FROM CSV FILE '/wiki-data/year=2013/month=04/000000'

INTO "WIKIPEDIA"."pagecounts"

WITH RECORD DELIMITED BY '\n'

FIELD DELIMITED BY '\01'

I've tried using '^A', '\a', '\01', '0x01' and I'm stumped.

Any ideas?

Regards,

Bill

Former Member · ‎06-04-2013

FWIW - I used the following sed command to replace the SOH characters with commas

sed "s/\x01/,/g" 000000 > 000000.csv

I have two challenges with this approach:

This introduces another process into my "Big Data" flow using AWS HIVE
All my records don't load! In the file, there are 30,000 rows and only 2000 load. To make matters worse - it's not the first 2000 characters that were loaded.

Needless to say '\x01' didn't work directly in the IMPORT FROM statement.

The reason that I didn't get the expected result is that one of my data values every now and then had a comma in the name. The good news is that I could use the '|' pipe character - as there were none in the source file.

So, the updated sed command achieved the desired result:

sed "s/\x01/,/g" 000000 > 000000.csv

I then changed my IMPORT FROM command to the following:

IMPORT FROM CSV FILE '/wiki-data/year=2013/month=04/000000.csv'

INTO "WIKIPEDIA"."pagecounts"

WITH RECORD DELIMITED BY '\n'

FIELD DELIMITED BY '|';

As a result, I got all 30,000 records. I'd still like to be able to process the file directly - any help would be appreciated. At least I'm not blocked for now.

Regards,

Bill

Problem with IMPORT FROM when file used SOH character as field delimiter - created using AWS HIVE

Accepted Solutions (0)

Answers (1)

Answers (1)

Set Filter on Search Help of MDG UI

Re: Digital Signature on PDF

Re: How to add dynamically formcell or button tabl...

Re: How to get Binary Value from Partner Directory...

Re: How add Content or Section in Manage Product M...