loop-benchmarking

Controlled experiments across agentic coding configurations. Same task, one variable, what actually works.
git clone https://git.shiptheloop.com/loop-benchmarking.git
Log | Files | Refs | README

commit 238f1a535996225cdf2c4e054730278cfae58f0f
parent 7418a1208c757383b4559e0bc32b40526ac75cb7
Author: Brian Graham <brian@buildingbetterteams.de>
Date:   Tue, 14 Apr 2026 22:42:18 +0200

Fix Z.AI auth: skip apiKeyHelper for non-anthropic providers

apiKeyHelper in --settings returned an Anthropic OAuth token and
overrode the ANTHROPIC_AUTH_TOKEN env var, so every zai (and
openrouter) request authenticated with the wrong credential. Z.AI
responded with 429 on the first turn, Claude CLI retried 10x, and
the run died after ~200s with zero useful work. Now apiKeyHelper
is only set when provider has no base_url override, so env-var
auth flows through for zai/openrouter.

Also commits ~30 new glm-5.1 runs from the main_effects sweep
that completed cleanly after the fix, minus 5 purged invalid
runs (429/aborted/zero-turn) captured before the fix landed.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Diff is too large, output suppressed.

Impressum · Datenschutz