huskies

Author	SHA1	Message	Date
dave	6e4fb7fd4b	huskies: merge 1113 story [huskies-server repo] Convert static website to Next.js with static rendering	2026-05-17 15:51:37 +00:00
dave	6a2f81e873	huskies: regen source-map.json	2026-05-16 23:01:49 +00:00
dave	b6df89d24c	huskies: regen source-map.json	2026-05-16 22:39:20 +00:00
Timmy	5c63618b30	docs: chat-driven project bootstrap design overview Captures the architecture for going from "new project" chat command to a running, container-isolated, editor-accessible huskies project. Covers the three personas (chat-only / editor-using / multi-project), the container template (base + stack overlay + project bind mount), build sandbox model (host stays clean, all dep-code in container), editor-agnostic SSH access, git integration, and a 5-phase rollout. Source for upcoming bootstrap stories. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-16 22:40:54 +01:00
dave	d59efa0b5c	huskies: regen source-map.json	2026-05-15 20:24:31 +00:00
dave	9f4f493486	huskies: regen source-map.json	2026-05-15 19:05:56 +00:00
dave	398a5806e7	huskies: regen source-map.json	2026-05-15 18:25:25 +00:00
dave	0ae6dfd565	huskies: regen source-map.json	2026-05-15 12:40:17 +00:00
dave	ce13c00ebd	huskies: regen source-map.json	2026-05-15 12:27:48 +00:00
dave	d944885ce9	huskies: regen source-map.json	2026-05-15 12:10:11 +00:00
dave	46556d308a	huskies: regen source-map.json	2026-05-15 12:03:09 +00:00
dave	fb1311cdae	huskies: regen source-map.json	2026-05-15 11:16:16 +00:00
Timmy	8446ab1c71	chore: gitignore .huskies/double_timmy_log.md Local-only scratchpad for tracking suspected duplicate-Timmy / duplicate-create_story incidents while we hunt the cause. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-15 10:06:37 +01:00
dave	b5054b08d3	huskies: regen source-map.json	2026-05-15 08:47:38 +00:00
dave	60fceee204	huskies: regen source-map.json	2026-05-15 02:03:30 +00:00
dave	f7413cc711	huskies: regen source-map.json	2026-05-15 01:38:05 +00:00
dave	a06bf6778b	huskies: regen source-map.json	2026-05-15 01:27:25 +00:00
dave	ae69cd50b1	huskies: regen source-map.json	2026-05-15 00:58:57 +00:00
dave	5eb8f2f8a7	huskies: regen source-map.json	2026-05-15 00:37:01 +00:00
dave	f04bdd1f14	huskies: regen source-map.json	2026-05-14 23:45:53 +00:00
dave	bf813d910b	huskies: regen source-map.json	2026-05-14 23:29:32 +00:00
Timmy	556d335997	chore: refresh source-map.json before 0.11 release Catches up master with entries added by stories that merged in a binary predating 1065 (merge-pipeline source-map regen): ErrorBoundary, WsConnectivity, transition_merge_failure_to_retry, and others. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-14 23:28:47 +01:00
dave	bb5abcd042	huskies: merge 811	2026-05-14 18:32:37 +00:00
dave	1f9f34ab58	huskies: merge 1038	2026-05-14 17:06:50 +00:00
dave	311883f45d	huskies: merge 1039	2026-05-14 16:33:47 +00:00
dave	0b3a33a63c	huskies: merge 1037	2026-05-14 15:54:17 +00:00
Timmy	b0090aba84	Adding baseline source-map	2026-05-14 16:35:08 +01:00
dave	14a39b6205	huskies: merge 980	2026-05-13 14:44:17 +00:00
dave	c89a5c2da6	huskies: merge 966	2026-05-13 12:21:43 +00:00
dave	3c9851d17d	docs(AGENT.md): forceful "no exceptions" doc-comment rule Two stories today (961, 962) passed every other gate and got bounced at the merge step on a single missing `///` on a `pub mod` line. Sonnet keeps treating the doc comment as optional when the rule says "add doc comments to new modules and pub functions/structs/enums." Promote the rule to its own loud section with no-exceptions wording and a concrete reminder to run source-map-check before committing. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-13 12:08:54 +00:00
Timmy	78b1ecdc3c	docs(AGENT): require PLAN.md update on every wip + final commit The "living document" rule was soft and got ignored — coders wrote PLAN.md once at session start and then drifted away from it. Tie the update to a trigger they already do (the wip/final commit), and call out stale "Current state" as a process failure. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-13 11:57:51 +01:00
dave	4a0fbcaa95	huskies: merge 949	2026-05-13 07:14:50 +00:00
dave	9ccbdff19f	huskies: merge 952	2026-05-13 05:43:22 +00:00
dave	8e9112066f	huskies: merge 935	2026-05-12 22:03:15 +00:00
dave	c3144b7937	huskies: merge 900	2026-05-12 16:46:33 +00:00
Timmy	6feb68f3e3	fix(923): watchdog counts only tool-using turns; narration-only turns no longer burn budget Observed: stories 917, 918, 920, 910 all turn-limit-killed despite producing real commits. Tally across their session logs shows 30–55% of assistant turns were pure narration ("I'll read X next", "Now let me check Y") with no tool_use. At 80 max_turns the effective work budget was ~44 tool calls, not enough for a typical bug fix's edit + test + check_criterion cycle. Changes: - New optional AgentConfig field max_tool_turns. When set the watchdog uses it instead of max_turns; only assistant messages whose data.message.content has at least one tool_use block count. - count_turns_in_log in agents/pool/auto_assign/watchdog/limits.rs filters on tool_use. Existing test helper write_fake_session_log now emits tool_use blocks; added write_fake_mixed_session_log for the narration regression test. - agents.toml: coders/coder-opus get max_turns=200 (claude-code's own --max-turns cap, sized to never bite before the watchdog) and max_tool_turns=80. qa: 120 / 40. mergemaster: 250 / 100. Budgets unchanged — the dollar cap remains the runaway-loop backstop, with ~$3-5 worst-case waste if an agent narrates indefinitely. - Two new regression tests: * watchdog_does_not_count_narration_only_turns: 5 tool + 30 narration under max_tool_turns=10 stays Running. * watchdog_max_tool_turns_overrides_max_turns: 4 tool turns at max_tool_turns=3 / max_turns=200 still terminates with TurnLimit. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-12 17:25:11 +01:00
Timmy	bb845d17cf	docs(904): drop run_tests retry-on-timeout clause from coder prompts Bug 903 (run_tests attach instead of respawn) + 904 (MCP progress notifications + SSE) together eliminate the transport-timeout error mode from the agent's point of view: long test runs complete without the MCP client ever observing a tool-call error. Production verification (see `d64f1e94` / `ddc4228b` deploy at 14:30 UTC today) confirmed 78s and 65s test runs completing in single processes with no respawn churn and no retry needed. The "If run_tests errors with a transport timeout, call it again" sentence in coder-1/2/3/opus system_prompts (added belt-and-braces in `a97a10fb`) is now redundant. Removing it tightens the agent's mental model down to: call run_tests, wait for the result. No error-handling branch, no retry semantics to internalise. This closes the last open AC on story 904. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-12 15:36:53 +01:00
Timmy	a97a10fba2	docs(903): coder system_prompts — clarify run_tests retry contract Pre-d64f1e94 the "call run_tests again — it attaches" guidance was a lie (every call killed the prior job and spawned a fresh one). With the attach fix in place, the contract is now real and safe to depend on. Tighten the wording so agents see exactly what to do: OLD: "Do not use ScheduleWakeup to wait for run_tests; if run_tests appears to time out, call run_tests again — it attaches to the in-flight test job and blocks until completion." NEW: "If run_tests errors with a transport timeout, call it again — it's idempotent and attaches to the same in-flight test job, so retries are safe and eventually return a pass/fail result." Improvements: - "errors with a transport timeout" matches what the agent literally observes (a tool-call error), not the vague "appears to time out". - Explicit on idempotency so agents understand why retry is safe and don't worry about double-running the suite. - Drops the ScheduleWakeup clause — already enforced via the `disallowed_tools` setting on coder-1/2/3/opus, so the prompt reminder was redundant. Applied uniformly across coder-1, coder-2, coder-3, coder-opus. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-12 14:54:34 +01:00
Timmy	e955250474	fix(902): coder system_prompts steer to get_story_todos for story content Bug 902: the Step 0 "resume from worktree state" instruction told coders to call git_status / git_log / git_diff to discover prior session work, which they then extended into hunting for the story `.md` file on disk via find / ls — pointless post-865, since story content lives only in the CRDT. Update Step 0 in coder-1, coder-2, coder-3, and coder-opus to add an explicit instruction: "To read story content, ACs, or description, call the `get_story_todos` MCP tool — do NOT search for a story `.md` file on disk; story content is CRDT-only." Single substring replacement covers all four agents (identical Step 0 across them). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-12 13:13:08 +01:00
dave	fac4442969	fix(896): disallow ScheduleWakeup for coder agents; add run_tests retry guidance - Add `disallowed_tools` field to `AgentConfig` and render it as `--disallowedTools` CLI flag in `render_agent_args` - Set `disallowed_tools = ["ScheduleWakeup"]` on all four coder agents (coder-1, coder-2, coder-3, coder-opus); QA and mergemaster unaffected - Append instruction to all coder `system_prompt`s: do not use ScheduleWakeup to wait for run_tests; if run_tests appears to time out, call run_tests again — it attaches to the in-flight job and blocks - Add tests: `render_agent_args_disallowed_tools` and `coder_agents_disallow_schedule_wakeup`	2026-05-08 15:28:48 +01:00
dave	c50a04445c	spike(814): add gateway update command design doc Documents chat-driven `update` bot command for multi-project gateway: command surface, auth (room+role guard, future Ed25519), Docker-managed rollout sequence, automatic and manual rollback, open questions, and dependencies.	2026-04-29 18:17:19 +00:00
dave	cf35027b5a	config(coders): step 0 — resume prior-session work via git_status + git_log/diff against master..HEAD	2026-04-29 16:03:03 +00:00
dave	b4854cf693	huskies: merge 862	2026-04-29 13:28:37 +00:00
dave	9979ff2cf9	huskies: merge 859	2026-04-29 10:18:37 +00:00
dave	8802e1fe59	huskies: merge 853	2026-04-29 09:08:28 +00:00
dave	549a9defc4	huskies: merge 851	2026-04-29 08:42:28 +00:00
dave	3ce34c34e9	huskies: merge 850	2026-04-29 08:27:05 +00:00
dave	b698cee284	huskies: merge 821	2026-04-28 21:06:54 +00:00
dave	32a3465fc4	fix: tell the truth about run_tests being blocking `tool_run_tests` in `server/src/http/mcp/shell_tools/script.rs` is fully blocking server-side: it spawns the test child, polls every 1s server-side until exit (or `TEST_TIMEOUT_SECS = 1200s`), and returns the full {passed, exit_code, output} directly. There is NO async/started-status return path. But two places told agents the wrong story: 1. `tools_list/system_tools.rs` description claimed "Returns immediately with status: started. Poll get_test_result..." — agents read tool descriptions for protocol semantics, so they followed this and burned turns polling get_test_result. 2. `agents.toml` had been correctly saying it blocks, but my last commit (`776aad38`) "fixed" it the wrong way based on a misread of the code. Now both say: run_tests blocks server-side, returns the full result, do not poll get_test_result. get_test_result remains for external observers (UI checking on a job another caller started). Reverts the prompt change in `776aad38` with the correct text.	2026-04-28 15:59:06 +00:00
dave	776aad3877	fix: agent prompts honest about run_tests being async Pre-f958f57e, run_tests blocked until completion. After that fix it became a background-job starter, with get_test_result polling. The agent prompts were never updated, so they still said "run_tests blocks until complete" — and agents then waste turns polling. Updated coder-1/2/3, coder-opus, and qa prompts to describe the actual flow: run_tests is async, get_test_result blocks for up to 20s per call, test suites typically take 1-5 minutes so expect a few polls. Companion bug filed for bumping TEST_POLL_BLOCK_SECS so one poll covers most test runs (root-cause fix; this commit is the prompt half).	2026-04-28 15:55:15 +00:00

1 2 3 4 5

219 Commits