loop-benchmarking

Controlled experiments across agentic coding configurations. Same task, one variable, what actually works.
git clone https://git.shiptheloop.com/loop-benchmarking.git
Log | Files | Refs | README

commit 8daeef92ba6951d8f87f6476d03c4fafe25a57d2
parent d07dba794c0abd20688958f4185daf3447786621
Author: Brian Graham <brian@buildingbetterteams.de>
Date:   Mon, 13 Apr 2026 15:28:24 +0200

Re-eval all 390 runs with V2 bot on GPU machine

GPU canvas readback works (getImageData returns real pixels),
unlocking 148 canvas games that previously failed on non-GPU machine.
Fix reeval.py artifact path (was dashboard/public/artifacts/, now artifacts/).
Clean up SonarQube .scannerwork temp files from artifacts.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Diff is too large, output suppressed.

Impressum · Datenschutz