paper_type.json (250B)
1 { 2 "paper_type": "empirical", 3 "reason": "Proposes MARSHAL RL framework and validates it through quantitative experiments showing 28.7% improvement on held-out games and 10% gains on AIME, with ablation studies demonstrating critical components." 4 }