Hi Chris,
What stats are you looking for? In general, we don't expose accumulo
methods directly, as we try to do all our operations through the
geotools API. If you're looking for a min/max value, you could use a
Min or MaxVisitor on a feature collection:
FeatureCollection features =
dataStore.getFeatureSource("gdelt").getFeatures(filter);
MinVisitor visitor = new MinVisitor("dtg");
features.accept(visitor, null);
Date minValue = visitor.getMin();
For counts or histograms, we have a custom visitor that can also be
run through a WPS process. See
https://github.com/locationtech/geomesa/blob/master/geomesa-accumulo/geomesa-accumulo-datastore/src/main/scala/org/locationtech/geomesa/accumulo/process/unique/UniqueProcess.scala
with an example request here:
https://github.com/locationtech/geomesa/blob/master/geomesa-accumulo/geomesa-accumulo-datastore/src/test/resources/process/unique/geomesa-unique.xml
We've been interested in writing combiners to help speed up this
type of query, but we haven't gotten around to it yet. If you'd like
to cook something up, we'd be glad to provide support and help get
it integrated!
Thanks,
Emilio
On 08/24/2015 11:28 AM, Chris Snider
wrote:
Hi,
Are there any iterators/combiners etc. that
I can run in/against the Accumulo shell to determine some
stats for my GDELT ingestion?
Something along the lines of the combiner
example at
http://accumulo.apache.org/1.5/examples/combiner.html
Thanks,
Chris Snider
Senior Software
Engineer
Intelligent
Software Solutions, Inc.
Direct (719) 452-7257

_______________________________________________
geomesa-users mailing list
geomesa-users@xxxxxxxxxxxxxxxx
To change your delivery options, retrieve your password, or unsubscribe from this list, visit
http://www.locationtech.org/mailman/listinfo/geomesa-users
|