paper_type.json (237B)
1 { 2 "paper_type": "benchmark-creation", 3 "reason": "Primary contribution is GDPval, a new evaluation framework with 1,320 real-world economically valuable tasks; model performance results are baseline demonstrations of the benchmark." 4 }