paper_type.json (239B)
1 { 2 "paper_type": "survey", 3 "reason": "Systematic review and meta-analysis of 445 LLM benchmark papers, synthesizing construct validity weaknesses across the field rather than introducing new experiments, benchmarks, or formal theory." 4 }