paper_type.json (315B)
1 { 2 "paper_type": "empirical", 3 "reason": "Runs experiments fine-tuning language models on dialog tasks and reports quantitative results (73.2% groundedness, 65% citation accuracy, safety improvements), with primary contribution being experimental findings on how supervised fine-tuning improves dialog quality." 4 }