huskies

Author	SHA1	Message	Date
dave	283bbc8658	huskies: merge 832	2026-04-29 00:34:21 +00:00
dave	f3bb0a6f4b	huskies: merge 828	2026-04-29 00:29:26 +00:00
dave	6f30815b64	huskies: merge 826	2026-04-29 00:21:30 +00:00
dave	89bf4ae0cf	huskies: merge 831	2026-04-29 00:16:18 +00:00
dave	42b576d285	huskies: merge 817	2026-04-29 00:11:48 +00:00
dave	39bdbbd095	huskies: merge 809	2026-04-29 00:06:42 +00:00
dave	a8d45dcdff	huskies: merge 800	2026-04-28 23:36:40 +00:00
dave	3ac10b1e1c	huskies: merge 825	2026-04-28 23:30:39 +00:00
dave	ab01a62bd1	huskies: merge 808	2026-04-28 23:18:57 +00:00
dave	6092f7efbb	huskies: merge 822	2026-04-28 23:12:25 +00:00
dave	b698cee284	huskies: merge 821	2026-04-28 21:06:54 +00:00
dave	dd35a8a530	huskies: merge 799	2026-04-28 21:01:55 +00:00
dave	97b9eaa39d	huskies: merge 807	2026-04-28 20:56:17 +00:00
dave	2a77f73ba4	fix(merge): use server-start-time, not pid, for stale-merge detection The merge_jobs cleanup encoded the server's pid in the CRDT and checked `kill(pid, 0)` to decide whether a "running" entry was stale. Two problems: 1. The cleanup runs inside the server, so checking whether the server's own pid is alive is tautological — kill(self_pid, 0) always succeeds. 2. `rebuild_and_restart` does an `execve()` re-exec, which keeps the same pid. After re-exec, merge_jobs from the previous server instance still encode "the current pid" — so the cleanup never fires, and stories like 799/800 sit forever with status="running" while no actual merge runs. Switch to a per-process server-start-time captured lazily in a `OnceLock<f64>` (reset by execve, so the new instance sees a fresh boot-time). A merge_job's recorded start-time < current boot-time means it came from a previous instance: stale, delete it. Legacy pid-encoded entries decode to None and are also treated as stale. MergeJob.pid → MergeJob.server_start_time. Tests updated.	2026-04-28 20:41:32 +00:00
dave	8f392f4fc7	huskies: merge 789	2026-04-28 20:38:12 +00:00
dave	f5ab75ecaa	huskies: merge 819	2026-04-28 20:28:35 +00:00
dave	b060d8fc88	fix(pty): always pass -p on resume so --include-partial-messages works claude CLI 2.1.97 strictly enforces that --include-partial-messages requires --print/-p to be set. The resume path skipped -p when the prompt was empty (which is the common case on respawns when there's no fresh failure context to inject), so the spawned claude process saw `--resume <sid> ... --include-partial-messages` without -p and exited with code 1: "include-partial-messages requires --print and --output-format=stream-json". Net effect: every coder respawn with prior_sessions > 0 and empty prompt was failing immediately, looking exactly like a rate-limit (empty agent log, zero tool calls). 819 hit retry-limit (4/3) and got marked blocked because of this — not because of any actual code or rate-limit issue. Fix: always pass `-p <prompt>` on resume, even with empty prompt.	2026-04-28 20:14:32 +00:00
dave	20e1210818	huskies: merge 830	2026-04-28 19:30:08 +00:00
dave	46b1e84629	huskies: merge 791	2026-04-28 19:18:12 +00:00
dave	e4af2d5c08	huskies: merge 803	2026-04-28 19:10:41 +00:00
dave	fa54451ba6	huskies: merge 802	2026-04-28 19:05:06 +00:00
dave	5d1e75f7e0	huskies: merge 806	2026-04-28 17:39:29 +00:00
dave	efc15c48da	huskies: merge 804	2026-04-28 17:34:55 +00:00
dave	691f34348e	huskies: merge 805	2026-04-28 17:28:47 +00:00
dave	2f6a221f09	fix(test): stub WebSocket in setupTests so rpcCall fails fast 770's HTTP→read-RPC migration replaced fetch-based agent calls with rpcCall over WebSocket. setupTests.ts only mocks fetch though, so the real jsdom WebSocket runs — and jsdom's WebSocket implementation asynchronously fires its connection-failure error after ~9 seconds (via internal Timeout._onTimeout). Tests that await component-mount state (like App.test.tsx > calls getCurrentProject() on mount) hang on that pending promise and time out at 10s. This silently broke 1 test on master (App.test.tsx getCurrentProject on mount) and cascaded into 16 failures in any worktree where the test file count changed and timing shifted (804, 806). Fix: replace the global WebSocket constructor with a stub that immediately fires onerror + onclose on the next microtask. rpcCall sees the error and rejects synchronously; components catch and continue rendering with empty state. Tests pass without flake. Verified locally: vitest run → 349/349 pass.	2026-04-28 17:05:53 +00:00
dave	619bdd9c82	huskies: merge 801	2026-04-28 16:43:04 +00:00
dave	3ff361bfe8	chore: silence git init defaultBranch hints in script/test Tests that create temp repos call `git init` without specifying a branch. Git then prints a 12-line "hint: Using 'master' as the name for the initial branch..." block — every test, every run. The output drowns out actual failures. Set init.defaultBranch=master via GIT_CONFIG_COUNT/KEY/VALUE env vars in script/test. This affects only subprocesses spawned by the test runner; no change to the user's real git config. Verified: cargo test --bin huskies emits 0 `hint:` lines after this change, all 2732 tests still pass.	2026-04-28 16:38:43 +00:00
dave	30dd4b3a0a	huskies: merge 796	2026-04-28 16:34:10 +00:00
dave	a65cd86c8f	huskies: merge 798	2026-04-28 16:25:33 +00:00
dave	1e40215c3e	huskies: merge 797	2026-04-28 16:06:50 +00:00
dave	d0b7db6765	huskies: merge 785	2026-04-28 15:59:50 +00:00
dave	32a3465fc4	fix: tell the truth about run_tests being blocking `tool_run_tests` in `server/src/http/mcp/shell_tools/script.rs` is fully blocking server-side: it spawns the test child, polls every 1s server-side until exit (or `TEST_TIMEOUT_SECS = 1200s`), and returns the full {passed, exit_code, output} directly. There is NO async/started-status return path. But two places told agents the wrong story: 1. `tools_list/system_tools.rs` description claimed "Returns immediately with status: started. Poll get_test_result..." — agents read tool descriptions for protocol semantics, so they followed this and burned turns polling get_test_result. 2. `agents.toml` had been correctly saying it blocks, but my last commit (`776aad38`) "fixed" it the wrong way based on a misread of the code. Now both say: run_tests blocks server-side, returns the full result, do not poll get_test_result. get_test_result remains for external observers (UI checking on a job another caller started). Reverts the prompt change in `776aad38` with the correct text.	2026-04-28 15:59:06 +00:00
dave	776aad3877	fix: agent prompts honest about run_tests being async Pre-f958f57e, run_tests blocked until completion. After that fix it became a background-job starter, with get_test_result polling. The agent prompts were never updated, so they still said "run_tests blocks until complete" — and agents then waste turns polling. Updated coder-1/2/3, coder-opus, and qa prompts to describe the actual flow: run_tests is async, get_test_result blocks for up to 20s per call, test suites typically take 1-5 minutes so expect a few polls. Companion bug filed for bumping TEST_POLL_BLOCK_SECS so one poll covers most test runs (root-cause fix; this commit is the prompt half).	2026-04-28 15:55:15 +00:00
dave	309d330734	huskies: merge 794	2026-04-28 15:53:33 +00:00
dave	bb779a0b0f	chore: regenerate STACK.md source map from module doc-comments Walked server/src/, frontend/src/, and crates/, extracting each module's //! doc-comment to build a directory-level source map. One row per directory + one row per top-level file. Replaces the hand-written stopgap from `5d6757dd` with content auto-derived from the codebase, so it stays useful as decomposes happen — the descriptions come from mod.rs, not from my recollection of where things live. Still a stopgap until 819 (auto-generated source-map-gen) lands and gets wired into the agent spawn path, but the content is closer to what 819 will produce.	2026-04-28 15:50:29 +00:00
dave	bf17fc14af	huskies: merge 795	2026-04-28 15:45:10 +00:00
dave	5d6757dd65	chore: restore source map in STACK.md (stopgap for 818 regression) 818 stripped the source map because it had stale paths. Empirically that made coder agents far slower — they spent most of each session re-discovering the codebase via Read/Grep before reaching any Edit, and ran out of turn budget without committing. Restoring a fresh source map keyed off current master. Uses directories where possible so it stays useful through future decomposes, plus a "Canonical examples" section pointing at the patterns to copy when adding new CRDT collections, RPC handlers, services, chat commands, etc. This is a stopgap until 819 (auto-generated source-map-gen) lands.	2026-04-28 15:43:44 +00:00
dave	f63464852b	huskies: merge 770	2026-04-28 15:38:34 +00:00
dave	1946709681	huskies: merge 788	2026-04-28 15:28:31 +00:00
dave	f62012ee9c	huskies: merge 793	2026-04-28 15:21:51 +00:00
dave	c1a50eab8e	huskies: merge 790	2026-04-28 15:16:01 +00:00
dave	7cd9706c0f	huskies: merge 813	2026-04-28 14:22:19 +00:00
dave	3772c0d03c	huskies: merge 784	2026-04-28 14:07:42 +00:00
dave	cf470f5048	huskies: merge 776	2026-04-28 14:01:18 +00:00
dave	8f23d13ac8	huskies: merge 779	2026-04-28 13:48:40 +00:00
dave	aed29b952c	huskies: merge 769	2026-04-28 13:42:47 +00:00
dave	36ca8d5e3b	huskies: merge 827	2026-04-28 13:01:48 +00:00
Timmy	1bd01eb9d4	Ignoring node identity	2026-04-28 13:27:25 +01:00
Timmy	6265fa534e	Updated a pile of deps	2026-04-28 13:27:07 +01:00
dave	b7db6d6aae	huskies: merge 775	2026-04-28 12:25:59 +00:00

1 2 3 4 5 ...

3461 Commits