paper_type.json (334B)
1 { 2 "paper_type": "empirical", 3 "reason": "The paper proposes and empirically validates two methods (retrieval-based search and TS-Guessing protocol) for detecting data contamination in existing benchmarks, reporting quantitative findings (52% exact match on MMLU, fine-tuning validation) that constitute the primary contribution." 4 }