paper_type.json (290B)
1 { 2 "paper_type": "empirical", 3 "reason": "The paper runs experiments across multiple models and benchmarks, reporting quantitative findings about mutating action failures (14-18% of steps, 55-96% odds reduction) and validating the SABER safeguard with empirical improvements (+19.7pp)." 4 }