paper_type.json (318B)
1 { 2 "paper_type": "empirical", 3 "reason": "Trains linear probes on model activations and reports quantitative experimental results (AUROC > 0.7) evaluating success prediction on math and coding benchmarks; primary contribution is the experimental finding that pre-generation activations encode failure information." 4 }