Eclipse Community Forums
Forum Search:

Search      Help    Register    Login    Home
Home » Eclipse Projects » SeMantic Information Logistics Architecture (SMILA) » "tolerant" search problem
"tolerant" search problem [message #563781] Fri, 20 February 2009 10:07
Andreas Weber is currently offline Andreas Weber
Messages: 24
Registered: July 2009
Junior Member
Hi again,

I tried to use a search configuration (DataDictionary.xml) which uses
Tolerance="tolerant".

But this delivers strange result orders, e.g. the documents which contain
the exact term are less equal(scored) than those containing the misspelled
term.

I proved that by indexing 75 text documents containing only one word:
- 50 docs containing the word "hallo"
- 22 docs containing the word "hillo"
- 3 docs containing the word "hello"

I made a (tolerant) search for "hillo".
But the first three hits were the docs with "hello" with a score of 15%.
Than followed by "hillo" hits (8%).
So - why are the "hello" hits better scored than the "hillo" hits?

Best regards,
Andreas
Previous Topic:It's me again :)
Next Topic:"tolerant" search problem
Goto Forum:
  


Current Time: Tue Sep 23 04:23:12 GMT 2014

Powered by FUDForum. Page generated in 0.01455 seconds
.:: Contact :: Home ::.

Powered by: FUDforum 3.0.2.
Copyright ©2001-2010 FUDforum Bulletin Board Software