cancel
Showing results for 
Search instead for 
Did you mean: 

HANA Text Analysis configuration problem

Hello experts,

I am using SAP HANA Express Edition and try to setup the Text Analysis. It appears like none of standard configurations is working (like EXTRACTION_CORE etc.) even though they are present in sap.hana.ta.config package.

For example, this code returns nothing to the index:

CREATE FULLTEXT INDEX G_FULLTEXT ON "G"."UNSTRUCTURED" ("STRING")
TEXT ANALYSIS ON
CONFIGURATION 'EXTRACTION_CORE'

My custom-created configurations are not working, too.

What happen to work is indexConfigNER.xml which was mentioned here: https://archive.sap.com/discussions/thread/3657442

https://archive.sap.com/discussions/thread/3631568

This code works, but only for English:

CREATE FULLTEXT INDEX G_FULLTEXT ON "G"."UNSTRUCTURED" ("STRING")
TEXT ANALYSIS ON
CONFIGURATION 'indexConfigNER.xml'

Language detection is not working. Even for one language. So if I add something which should always return English:

LANGUAGE DETECTION ('EN')

to my code, my $TA table is empty again.

My questions are:

1) What is this indexConfigNER.xml? 🙂 I wasn't able to find anything on it on the web or in Text Analysis guides. How is it connected to TA configurations (and which one?) and where can these dependencies be found?

2) How can I debug the configuration and language detection functionality?

3) Any additional info on configurations that you might share.

Many thanks.

Ivan

Accepted Solutions (0)

Answers (0)