paper_type.json (298B)
1 { 2 "paper_type": "empirical", 3 "reason": "Conducts controlled experiments with ablation studies and attribution patching on existing benchmarks (VSR, VQA), reporting quantitative performance impacts (9-16pp accuracy reduction) to understand how multimodal fine-tuning encodes spatial features." 4 }