Skip to Content
2

HANA Text Analysis configuration problem

Dec 09, 2016 at 02:55 PM

112

avatar image

Hello experts,

I am using SAP HANA Express Edition and try to setup the Text Analysis. It appears like none of standard configurations is working (like EXTRACTION_CORE etc.) even though they are present in sap.hana.ta.config package.

For example, this code returns nothing to the index:

CREATE FULLTEXT INDEX G_FULLTEXT ON "G"."UNSTRUCTURED" ("STRING")
TEXT ANALYSIS ON
CONFIGURATION 'EXTRACTION_CORE'

My custom-created configurations are not working, too.

What happen to work is indexConfigNER.xml which was mentioned here: https://archive.sap.com/discussions/thread/3657442

https://archive.sap.com/discussions/thread/3631568

This code works, but only for English:

CREATE FULLTEXT INDEX G_FULLTEXT ON "G"."UNSTRUCTURED" ("STRING")
TEXT ANALYSIS ON
CONFIGURATION 'indexConfigNER.xml'

Language detection is not working. Even for one language. So if I add something which should always return English:

LANGUAGE DETECTION ('EN')

to my code, my $TA table is empty again.

My questions are:

1) What is this indexConfigNER.xml? :) I wasn't able to find anything on it on the web or in Text Analysis guides. How is it connected to TA configurations (and which one?) and where can these dependencies be found?

2) How can I debug the configuration and language detection functionality?

3) Any additional info on configurations that you might share.

Many thanks.

Ivan

10 |10000 characters needed characters left characters exceeded
* Please Login or Register to Answer, Follow or Comment.

0 Answers