How does geospatial re-ranking improve agricultural RAG systems, and what is the optimal balance between semantic relevance and geographic proximity for region-specific advisory?
Existing agricultural RAG systems (AgriGPT, standard RAG pipelines) rely solely on semantic similarity for retrieval. They don't account for the fact that agricultural advice is inherently local — planting dates, pest pressures, soil types, and regulations vary by region. A recommendation valid in California can be harmful in North Carolina.
No prior work combines spatial distance decay with semantic retrieval scoring for agricultural knowledge systems. The closest work (Spatial-RAG, 2025) addresses general geospatial QA but not the domain-specific challenges of agriculture.
✅ AgriRegion framework — complete and deployed
✅ 70K+ document corpus indexed in OpenSearch (S3 + vector k-NN)
✅ Spatial-semantic scoring function implemented
✅ AgriRegion-Eval benchmark (160 questions, 12 subfields)
✅ Baseline comparisons vs. GPT-4-Turbo, Claude, Gemini, Mistral
✅ Paper draft exists (11 pages, IEEE Access format)
The desk rejection was likely due to missing baselines and underexplored core contribution. Three additions fix this:
Same corpus, same LLM, same top-k — but α=0 (pure semantic, no distance decay). This isolates the geo contribution. If AgriRegion beats it by even 5% F1, the novelty is undeniable.
Run α from 0.0 to 1.0 in 0.1 steps. Plot F1, EM, BERTScore vs. α. Show the sweet spot. Show that both extremes (pure semantic, pure distance) underperform. This is one experiment, one figure, massive reviewer satisfaction.
Compare inverse (current), Gaussian, step function, and power law. Show which works best. Even if inverse wins, the comparison demonstrates rigor.
2-3 weeks — the paper exists. Run the three experiments, add the figures, fix the template, resubmit to a better-fit venue.
Computers & Electronics in Agriculture (IF: 8.3) — exact domain match, loves applied AI systems with evaluation. They publish RAG papers, they publish NC agriculture papers. Much better fit than IEEE Access for this work.