paper_type.json (301B)
1 { 2 "paper_type": "benchmark-creation", 3 "reason": "Introduces DynaCode, a dynamic benchmark generating ~189M Python problems with varying complexity levels, with experiments demonstrating the benchmark's effectiveness at resisting memorization rather than reporting primary experimental findings." 4 }