Full reeval on GPU machine: V2 bot + SonarQube - loop-benchmarking - Controlled experiments across agentic coding configurations. Same task, one variable, what actually works.

commit 03f7652cb15c203683d9239f08dc22efbb51b1b5
parent b499a01fb7df37b81f26449bd66bfd4cf68de116
Author: Brian Graham <brian@buildingbetterteams.de>
Date:   Thu, 16 Apr 2026 15:50:27 +0200

Full reeval on GPU machine: V2 bot + SonarQube

All 510 runs re-evaluated at -j 20. SonarQube Community 9.9.8 started
locally for the scan; sonarqube-scan.py already updated from sonar.token to
sonar.login for version compat.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Diff is too large, output suppressed.

	loop-benchmarking Controlled experiments across agentic coding configurations. Same task, one variable, what actually works.
	git clone https://git.shiptheloop.com/loop-benchmarking.git
	Log \| Files \| Refs \| README