paper_type.json (235B)
1 { 2 "paper_type": "empirical", 3 "reason": "Runs experiments across multiple code generation benchmarks with six LLMs, analyzes failure patterns quantitatively, and reports empirical findings about task difficulty and failure rates." 4 }