paper_type.json (301B)
1 { 2 "paper_type": "empirical", 3 "reason": "The paper proposes a multi-agent Tree-of-Thought method and reports quantitative experimental results (0.6-8.8pp improvements) on the GSM8K benchmark, making the primary contribution empirical findings rather than a new benchmark or theoretical analysis." 4 }