paper_type.json (302B)
1 { 2 "paper_type": "benchmark-creation", 3 "reason": "Introduces OBFUSEVAL, a novel code-obfuscation-based benchmark with 1,354 C functions designed to evaluate LLM code generation capabilities; while experiments are run, the primary contribution is the benchmark itself and its evaluation framework." 4 }