paper_type.json (298B)
1 { 2 "paper_type": "empirical", 3 "reason": "The paper proposes a framework and runs controlled experiments comparing model performance (O1 vs others, GPT-4o vs Qwen3-1.7B) on policy extraction and enforcement tasks, reporting quantitative results (F1, accuracy metrics) as primary contributions." 4 }