[EMF Compare] Innacuracies while comparing lists of very similar elements [message #619071] |
Thu, 29 May 2008 12:49 |
Eclipse User |
|
|
|
Sorry to bother again about EMF Compare. I hope to help while reporting
my findings.
Try to imagine a multivalued referenced with lots of referenced
children. These elements are simple named elements. Take a look at the
following example (I'll put just 2 lists of names of these elements):
ELEMENT01 -- ELEMENT01
ELEMENT02 -- ELEMENT02
ELEMENT03 -- ELEMENT03
ELEMENT04 -- ELEMENT04
ELEMENT05 -- ELEMENT05
ELEMENT06 -- ELEMENT06
ELEMENT07 -- ELEMENT07
If I remove one element from the second EReference:
ELEMENT01 -- ELEMENT01
ELEMENT02 -- ELEMENT02
ELEMENT03 -- .........
ELEMENT04 -- ELEMENT04
ELEMENT05 -- ELEMENT05
ELEMENT06 -- ELEMENT06
ELEMENT07 -- ELEMENT07
The comparison will result in (its an example, but happens most of the
times):
ELEMENT01 -- ELEMENT01 <match>
ELEMENT02 -- ELEMENT02 <match>
ELEMENT03 -- ......... <deleted>
ELEMENT04 -- ELEMENT05 <modified>
ELEMENT05 -- ELEMENT06 <modified>
ELEMENT06 -- ELEMENT04 <modified>
ELEMENT07 -- ELEMENT07 <match>
This is just an example situation, but as you can see, the engine tries
to match elements in the same position, and gets a high similarity score
(since these elements are very similar, even in the name).
In my opinion, the engine should take a look in the neighborhood to
search for a perfect match. In case there is no perfect match, then it
could match those with the highest similarity score.
Cheers!
Víctor.
|
|
|
Powered by
FUDForum. Page generated in 0.03260 seconds