paper_type.json (345B)
1 { 2 "paper_type": "benchmark-creation", 3 "reason": "The primary contribution is introducing HumanEval-XL, a new multilingual code generation benchmark spanning 23 natural languages and 12 programming languages; while the paper includes experimental results from running models on the benchmark, the benchmark itself is the main contribution." 4 }