paper_type.json (261B)
1 { 2 "paper_type": "empirical", 3 "reason": "Systematically evaluates multi-agent and debugging approaches on HumanEval/HumanEval+ across 19 models, reporting quantitative accuracy improvements and comparative performance findings as the primary contribution." 4 }