paper_type.json (257B)
1 { 2 "paper_type": "benchmark-creation", 3 "reason": "Introduces SAFIM, a new 17,720-example syntax-aware fill-in-the-middle benchmark across four languages; empirical results on model evaluations serve to validate and demonstrate the benchmark's utility." 4 }