loop-benchmarking

Controlled experiments across agentic coding configurations. Same task, one variable, what actually works.
git clone https://git.shiptheloop.com/loop-benchmarking.git
Log | Files | Refs | README

commit e52e85189d28eaa9be6271e17b363b5321c9ba16
parent 19603805e8fb248250f450fc3acd810b42dcc389
Author: Brian Graham <brian@buildingbetterteams.de>
Date:   Mon, 13 Apr 2026 23:11:31 +0200

Remove 68 zero-cost GLM-5.1 runs (auth failures)

Z.AI API key was expired/invalid during these runs, resulting in
0 turns and 0 cost. All 68 were glm-5.1 model.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Diff is too large, output suppressed.

Impressum · Datenschutz