paper_type.json (256B)
1 { 2 "paper_type": "benchmark-creation", 3 "reason": "Introduces GeoAnalystBench, a new 50-task Python-based GIS benchmark for evaluating LLMs on spatial analysis; while baselines are evaluated, the primary contribution is the benchmark framework itself." 4 }