paper_type.json (269B)
1 { 2 "paper_type": "empirical", 3 "reason": "Proposes new evaluation metrics and algorithms (GroupMatch, SimpleMatch, TTM), then validates them through experiments on Winoground benchmark, demonstrating improved multimodal model performance with quantitative results." 4 }