.last-run.json (45B)
1 { 2 "status": "passed", 3 "failedTests": [] 4 }
loop-benchmarkingControlled experiments across agentic coding configurations. Same task, one variable, what actually works. | |
| git clone https://git.shiptheloop.com/loop-benchmarking.git | |
| Log | Files | Refs | README |