paper_type.json (288B)
1 { 2 "paper_type": "empirical", 3 "reason": "Runs experiments on existing benchmarks (GSM8K, MMLU, arithmetic, biographies) and reports quantitative improvements from multiagent debate, with primary contribution being the experimental findings on performance gains and scaling behavior." 4 }