"tolerant" search problem [message #563781] |
Fri, 20 February 2009 10:07 |
Andreas Weber Messages: 26 Registered: July 2009 |
Junior Member |
|
|
Hi again,
I tried to use a search configuration (DataDictionary.xml) which uses
Tolerance="tolerant".
But this delivers strange result orders, e.g. the documents which contain
the exact term are less equal(scored) than those containing the misspelled
term.
I proved that by indexing 75 text documents containing only one word:
- 50 docs containing the word "hallo"
- 22 docs containing the word "hillo"
- 3 docs containing the word "hello"
I made a (tolerant) search for "hillo".
But the first three hits were the docs with "hello" with a score of 15%.
Than followed by "hillo" hits (8%).
So - why are the "hello" hits better scored than the "hillo" hits?
Best regards,
Andreas
|
|
|
Powered by
FUDForum. Page generated in 0.01634 seconds