paper_type.json (343B)
1 { 2 "paper_type": "empirical", 3 "reason": "The paper conducts experiments on multiple benchmarks (GSM8K, arithmetic, commonsense, symbolic reasoning), reports quantitative results (e.g., 56.9% on GSM8K), and includes ablation studies, with the primary contribution being empirical findings about chain-of-thought prompting's effectiveness." 4 }