[
Date Prev][
Date Next][
Thread Prev][
Thread Next][
Date Index][
Thread Index]
[
List Home]
| Re: [geomesa-users] GDELT - Results incomplete? | 
Hi Marcel,
Yes, Filter.INCLUDE should return all the entries.  There are a few 
quick reasons you should not expect to see a direct match-up between the 
number of entries and the number of entries in Accumulo.
First, in order to better support different query patterns, GeoMesa uses 
a number of different table structures.  The original spatio-temporal 
table has two entries per SimpleFeature.  If you ingested 5.4M records, 
you should see 10.8M records in a table ending in st_idx.
We introduced a new table design using a composite space filling curve.  
That table should end in z3.  If the SimpleFeatureType's has a date 
field and the default geometry is a point, then there should be one 
entry in the Z3 table per record.
To support attribute-base queries, there is an attribute table (ending 
in attr_idx).  For each attribute indexed, there should be 0 or 1 
records per record ingested.  (If I recall, features which have a null 
value for an indexed attribute are not indexed.)
Finally, there is a records table which contains a copy of each 
feature.  This table is used in conjunction with the attribute table.
I hope that helps explain what's going on.  I'd suggest checking out the 
various table sizes (available on the 'Master Server' click from the 
left of the Accumulo monitor page).
Cheers,
Jim
On 08/03/2015 05:46 AM, Marcel wrote:
Hello,
it´s me again. I created a 10GB GDELT file and ingested it to 
accumulo. Looking at the web console, it estimates that there are 
around 30M entries.
Writing a Query with a Filter.INCLUDE statement should return all of 
my events, right? When calling the size() method the output is only 
5.4M. What could be the cause for this?
Is there a problem with empty values?
Thanks.
Marcel Jacob.
_______________________________________________
geomesa-users mailing list
geomesa-users@xxxxxxxxxxxxxxxx
To change your delivery options, retrieve your password, or 
unsubscribe from this list, visit
http://www.locationtech.org/mailman/listinfo/geomesa-users