loop-benchmarking

Controlled experiments across agentic coding configurations. Same task, one variable, what actually works.
git clone https://git.shiptheloop.com/loop-benchmarking.git
Log | Files | Refs | README

.last-run.json (45B)


      1 {
      2   "status": "failed",
      3   "failedTests": []
      4 }

Impressum · Datenschutz