paper_type.json (231B)
1 { 2 "paper_type": "empirical", 3 "reason": "Proposes VerifierQ method and validates it experimentally on GSM8K and MATH benchmarks, comparing quantitative results against baselines like Process Reward Models and Majority Voting." 4 }