|Re: Help Wanted: Ranking Strategy for Subwords Completion Engine [message #690420 is a reply to message #689209]
||Wed, 29 June 2011 09:49
| Johannes Lerch
Registered: February 2011
I just integrated your latest contribution. You might have to search a little to find it, as i refactored a lot of stuff. I also added some tests and found some bugs. For example i think qgrams is symmetric and your implementation was not. I fixed those issues, but i would appreciate that you will take a look on that to verify that i didn't broke it.|
The main classes of interest are now SubwordsRelevanceCalculator, QGramSimilarity and the corresponding test classes SubwordsRelevanceCalculatorTest, QGramSimilarityTest and ExpectedScoringsTest. As you can see your current approach matches my expectations. Nice job!
But can you give me more details on the weighting function? What was the intent of the factor 20 and why is the difference of the length of both strings relevant?
For all who want to test the current state: Get it from the head channel. I turned off the default java proposals and will check what it feels like to only have the subwords completion.
[Updated on: Thu, 30 June 2011 03:10]
Report message to a moderator
Powered by FUDForum
. Page generated in 0.02187 seconds