Skip to Content
author's profile photo Former Member
Former Member

TREX and foreign characters


When indexing a document with the swedish word "älg" in it you can find it by searching for "älg" (of course). The word is also indexed in the form of "aelg" for users without the possibility to use the ä-character.

Is it possible to change this behaviour so that the word also is indexed in the form of "alg"?

The thing is that we have a repository manager developed for users being stored in an LDAP-server and when searching for swedish users it would be nice if international users could use a- and o-characters instead of ä-, å- and ö-characters.



Add a comment
10|10000 characters needed characters exceeded

Related questions

1 Answer

  • Posted on Oct 26, 2004 at 11:35 AM

    Hi Pierre,

    you cannot change this by easy configuration. It might be possible via a Python extension to TREX, but is probably not worth the effort. Especially, as many of these cases should be caught by the TREX fuzzy search algorithm, if you do create a fuzzy index for your user search. Although your exmample may be to short a string for fuzziness to have the desired effect. For those cases (fuzziness does not work) you could use the inclusion of synonyms in your search as described here:

    Or here in SDN, where it had vanished for a while: to enable semantic search or search for synonyms in trex.pdf

    You would then have to maintain each name though. ä=a will not do it there, the XTM file used (see link) would have to hold the info that älg=alg=aelg.



    Add a comment
    10|10000 characters needed characters exceeded

Before answering

You should only submit an answer when you are proposing a solution to the poster's problem. If you want the poster to clarify the question or provide more information, please leave a comment instead, requesting additional details. When answering, please include specifics, such as step-by-step instructions, context for the solution, and links to useful resources. Also, please make sure that you answer complies with our Rules of Engagement.
You must be Logged in to submit an answer.

Up to 10 attachments (including images) can be used with a maximum of 1.0 MB each and 10.5 MB total.