paper_type.json (280B)
1 { 2 "paper_type": "benchmark-creation", 3 "reason": "The primary contribution is NL2Repo-Bench, a new evaluation framework for coding agents on repository generation tasks; the experimental results validate the benchmark rather than constituting independent empirical findings." 4 }