Skip to Content

Fuzzy search typo: o insted of 0

Dear readers,

I'm currently trying to find duplicate invoices in the system based on the reference number (field xblnr).

The following dataset is used:
BELNR XBLNR
88669911 1230678
88669923 123O678

To find the duplicates based on field XBLNR i'm using the following SQL:
select belnr, xblnr, score() as score
from "FUZZYTEST"
where contains (xblnr, '1230678', fuzzy(0.70))

This gives me the following output:
BELNR XBLNR SCORE
88669911 1230678 1
88669923 123O678 0.8911111

I know that the 2nd record has a typo, they used the character "O" instead of the 0 (zero). To my opinion the score of 0.891111 is much to low! It's really important that this fuzzy search delivers a higher score. I tried many things with fuzzy search to add parameters etc. but nothing did work. Does anyone have an idea to make the score higher when a typo has been made?

Add a comment
10|10000 characters needed characters exceeded

Assigned Tags

Related questions

0 Answers

Before answering

You should only submit an answer when you are proposing a solution to the poster's problem. If you want the poster to clarify the question or provide more information, please leave a comment instead, requesting additional details. When answering, please include specifics, such as step-by-step instructions, context for the solution, and links to useful resources. Also, please make sure that you answer complies with our Rules of Engagement.
You must be Logged in to submit an answer.

Up to 10 attachments (including images) can be used with a maximum of 1.0 MB each and 10.5 MB total.