Skip to Content

HANA Text Analysis configuration problem

Hello experts,

I am using SAP HANA Express Edition and try to setup the Text Analysis. It appears like none of standard configurations is working (like EXTRACTION_CORE etc.) even though they are present in sap.hana.ta.config package.

For example, this code returns nothing to the index:

CREATE FULLTEXT INDEX G_FULLTEXT ON "G"."UNSTRUCTURED" ("STRING")
TEXT ANALYSIS ON
CONFIGURATION 'EXTRACTION_CORE'

My custom-created configurations are not working, too.

What happen to work is indexConfigNER.xml which was mentioned here: https://archive.sap.com/discussions/thread/3657442

https://archive.sap.com/discussions/thread/3631568

This code works, but only for English:

CREATE FULLTEXT INDEX G_FULLTEXT ON "G"."UNSTRUCTURED" ("STRING")
TEXT ANALYSIS ON
CONFIGURATION 'indexConfigNER.xml'

Language detection is not working. Even for one language. So if I add something which should always return English:

LANGUAGE DETECTION ('EN')

to my code, my $TA table is empty again.

My questions are:

1) What is this indexConfigNER.xml? :) I wasn't able to find anything on it on the web or in Text Analysis guides. How is it connected to TA configurations (and which one?) and where can these dependencies be found?

2) How can I debug the configuration and language detection functionality?

3) Any additional info on configurations that you might share.

Many thanks.

Ivan

Add comment
10|10000 characters needed characters exceeded

  • Get RSS Feed

0 Answers