paper_type.json (240B)
1 { 2 "paper_type": "theoretical", 3 "reason": "Derives closed-form mathematical solutions for optimal attention temperature in linearized Transformers under distribution shift, with experiments serving to validate the theoretical results." 4 }