paper_type.json (325B)
1 { 2 "paper_type": "empirical", 3 "reason": "The paper runs experiments evaluating existing GPT-OSS models on established benchmarks (MMLU, SciQ, C-Eval) and reports quantitative comparative results with statistical significance testing; the primary contribution is the experimental findings, not the benchmarks themselves." 4 }