paper_type.json (253B)
1 { 2 "paper_type": "empirical", 3 "reason": "Proposes FIZLE methodology and experimentally evaluates LLM-guided counterfactual generation across three NLP benchmark datasets, reporting quantitative metrics on label flip rates and model accuracy drops." 4 }