Re: [geowave-dev] Determine Tiernumber for Polygons

Hey Marcel,

There's a pretty straightforward ingest and query example in the following slides.  Let me know if that helps you to understand what's going on.  Generally, I think you're on the right track.  I think in this case, you can think of duplicates being synonymous with duplicates.  That is to say, if there are 4 entries for a particular polygon, we consider that to be 4 duplicates.

Anyway, check out these slides, and let me know if that helps.

Whitney

On Thu, Oct 15, 2015 at 4:57 PM, Marcel wrote:
Hello,
I have another theoretical question:
http://accumulosummit.com/program/talks/geowave-geospatial-and-geotemporal-data-storage-and-retrieval-in-accumulo/
Please have a look at slide 18. The blue polygon will be index in tier 2. This is okay for me.
You say that the red polygon is stored in tier 3. That´s not quite clear with the given information from the slide.
Consider the following for red polygon:
tier 0 (all) -> 1 entry
tier 1 (blue) -> 2 entries -> 1 duplicate
tier 2 (red) -> 2 entries -> 1 duplicate
tier 3 (yellow green) -> 4 entries -> 3 duplicates

So when minimizing the number of duplicates it should be stored in tier 0. If one duplicate would be okay (whatever that reason might be), it should be stored in tier 2. But why tier 3? Does it correlate with the number of empty cells that a finer-grained tier is used?

Is there another presentation, with some more theoretical details which you could provide? Maybe something from internal discussions or something like that.

Best regards,
Marcel Jacob.

_______________________________________________
geowave-dev mailing list
geowave-dev@xxxxxxxxxxxxxxxx