paper_type.json (241B)
1 { 2 "paper_type": "benchmark-creation", 3 "reason": "The primary contribution is WebApp1K, a new 1000-problem benchmark for code generation; the empirical evaluations of 11 models serve to validate and demonstrate the benchmark's utility." 4 }