First of all, thannk you for your reply.
I've found the possible reason after multiple tests:
When HBase stores only a small amount of data, the data exists only in the MemStore and hasn't been flushed to the StoreFile yet. In this case, when using Spark to query through spatialRDDProvider.rdd,
no data will be obtained. However, once the data has been flushed to the StoreFile, the query results will be normal.
I'm not sure whether this is a bug of GeoMesa or not.
Best Regards,
Mike
发件人: geomesa-users <geomesa-users-bounces@xxxxxxxxxxx> 代表 Lahr-Vivaz, Emilio via geomesa-users <geomesa-users@xxxxxxxxxxx>
发送时间: 2025年1月2日 21:33
收件人: Geomesa project user mailing list <geomesa-users@xxxxxxxxxxx>
抄送: Lahr-Vivaz, Emilio <Emilio.Lahr-Vivaz@xxxxxxxxxxx>
主题: Re: [geomesa-users] -EXT- problem with query data by spark
Thanks,
Emilio Lahr-Vivaz
General Atomics, CCRi
From: geomesa-users <geomesa-users-bounces@xxxxxxxxxxx> on behalf of zhou lihuang via geomesa-users <geomesa-users@xxxxxxxxxxx>
Sent: Wednesday, December 25, 2024 4:31 AM
To: geomesa-users@xxxxxxxxxxx <geomesa-users@xxxxxxxxxxx>
Cc: zhou lihuang <zlh_0923@xxxxxxxxxxx>
Subject: -EXT-[geomesa-users] problem with query data by spark
WARNING: This message is from an external source. Evaluate the message carefully BEFORE clicking on links or
opening attachments.
hello everyone:
I used RDD Provider to query data, but retrieve 0 data (there are 2 features).
And I used DataStore created by DataStoreFinder.getDataStore,It’s successfully get 2 features.
code is as follows:
The env is :
geomesa: 4.0.5
spark: 3.3.0
hbase: 2.2.0
I've tried modify geomesa version and dependencies version, but it didn't work.
How can I fix this problem now?
Thank you everyone.
Best Regards,
Mike