loop-benchmarking

Controlled experiments across agentic coding configurations. Same task, one variable, what actually works.
git clone https://git.shiptheloop.com/loop-benchmarking.git
Log | Files | Refs | README

commit 1afd77baba3aa07602ce3136076e22fa72f3c63a
parent 8d0ed726f6c4b41847e16603556b64e3d7482bda
Author: Brian Graham <brian@buildingbetterteams.de>
Date:   Sat,  4 Apr 2026 07:29:51 +0200

Increase timeout to 1200s (20 min) for larger models

Sonnet was timing out at 600s. 20 minutes should be enough for
most configurations.

Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Diffstat:
Mgrid.yaml | 2+-
1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/grid.yaml b/grid.yaml @@ -2,7 +2,7 @@ version: 1 defaults: runs_per_cell: 3 - timeout_seconds: 600 + timeout_seconds: 1200 budget: low: 0.50 high: 5.00

Impressum · Datenschutz