Emilio,
Thanks for the information.
To answer to question;
I loaded some GDELT-FULL data that I downloaded from the GDELT site last week. I didn’t write down/store the last file I had processed. Since I was running them in sequence, I hoped to run a quick Accumulo
shell “query” to find the max date loaded.
I’ll look into the WPS calls since I need to test our integrated security on that anyway.
Chris Snider
Senior Software Engineer
Intelligent Software Solutions, Inc.

From: geomesa-users-bounces@xxxxxxxxxxxxxxxx [mailto:geomesa-users-bounces@xxxxxxxxxxxxxxxx]
On Behalf Of Emilio Lahr-Vivaz
Sent: Monday, August 24, 2015 9:52 AM
To: Geomesa User discussions <geomesa-users@xxxxxxxxxxxxxxxx>
Subject: Re: [geomesa-users] Accumulo Shell commands for Geomesa
Hi Chris,
What stats are you looking for? In general, we don't expose accumulo methods directly, as we try to do all our operations through the geotools API. If you're looking for a min/max value, you could use a Min or MaxVisitor on a feature collection:
FeatureCollection features = dataStore.getFeatureSource("gdelt").getFeatures(filter);
MinVisitor visitor = new MinVisitor("dtg");
features.accept(visitor, null);
Date minValue = visitor.getMin();
For counts or histograms, we have a custom visitor that can also be run through a WPS process. See
https://github.com/locationtech/geomesa/blob/master/geomesa-accumulo/geomesa-accumulo-datastore/src/main/scala/org/locationtech/geomesa/accumulo/process/unique/UniqueProcess.scala
with an example request here:
https://github.com/locationtech/geomesa/blob/master/geomesa-accumulo/geomesa-accumulo-datastore/src/test/resources/process/unique/geomesa-unique.xml
We've been interested in writing combiners to help speed up this type of query, but we haven't gotten around to it yet. If you'd like to cook something up, we'd be glad to provide support and help get it integrated!
Thanks,
Emilio
On 08/24/2015 11:28 AM, Chris Snider wrote:
Hi,
Are there any iterators/combiners etc. that I can run in/against the Accumulo shell to determine some stats for my GDELT ingestion?
Something along the lines of the combiner example at
http://accumulo.apache.org/1.5/examples/combiner.html
Thanks,
Chris Snider
Senior Software Engineer
Intelligent Software Solutions, Inc.
Direct (719) 452-7257

_______________________________________________
geomesa-users mailing list
geomesa-users@xxxxxxxxxxxxxxxx
To change your delivery options, retrieve your password, or unsubscribe from this list, visit
http://www.locationtech.org/mailman/listinfo/geomesa-users