huskies

Author	SHA1	Message	Date
Timmy	7d7ab85994	feat(933): add item_type + epic CRDT registers + migrate epic mechanism Replaces the YAML-only `type: epic` / `epic: <id>` front-matter fields with typed CRDT registers on PipelineItemCrdt. The epic-mechanism MCP tools (`tool_list_epics`, `tool_show_epic`), the epic-context injection in agent spawn, and the type-classifier helpers (`item_type_from_id`, `is_bug_item`, `is_refactor_item`) now all read from the CRDT. Schema: - PipelineItemCrdt: `item_type: LwwRegisterCrdt<String>` and `epic: LwwRegisterCrdt<String>` registers. - WorkItem: typed `item_type()` and `epic()` accessors returning `Option<&str>`. - crdt_state::set_item_type(story_id, Option<&str>) and crdt_state::set_epic(story_id, Option<&str>) typed setters. Write paths populate the new registers: - create_story_file / create_bug_file / create_spike_file / create_refactor_file / create_epic_file — each calls set_item_type after write_story_content. - tool_update_story intercepts `epic` and `type` fields and routes them to the typed setters (same pattern as qa / depends_on). Read paths migrated off yaml_legacy: - http/mcp/story_tools/epic.rs: tool_list_epics + tool_show_epic. - agents/lifecycle.rs::item_type_from_id (numeric-only IDs). - agents/pool/start/spawn.rs epic-context injection. - http/workflow/bug_ops/bug.rs::is_bug_item, refactor.rs::is_refactor_item. - http/workflow/pipeline.rs::load_pipeline_state — review_hold/qa/epic_id all come from the CRDT now; only merge_failure is still YAML (sweep in 929 stage 10). All `yaml_residue(...)` wraps for item_type / epic are removed; the remaining residue marker doc no longer references 933. cargo fmt --check, clippy --all-targets -- -D warnings, and the 2857-test suite all pass. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-12 19:58:43 +01:00
Timmy	aadbb1b2af	feat(932): add review_hold CRDT register + migrate callers off yaml_legacy review_hold is now a typed bool register on PipelineItemCrdt alongside blocked / mergemaster_attempted. Exposed via the typed setter `crdt_state::set_review_hold(story_id, value)` and the `WorkItem::review_hold()` accessor. Replaces the legacy `review_hold: true` YAML front-matter field. Migrated callers: - http/mcp/qa_tools.rs::tool_approve_qa — clear via set_review_hold(false) - agents/lifecycle.rs::reject_story_from_qa — clear via set_review_hold(false) - agents/pool/pipeline/advance/helpers.rs::write_review_hold_to_store — set via set_review_hold(true), no more content rewrite - agents/pool/auto_assign/reconcile.rs (two callsites) — set via set_review_hold(true) instead of FS YAML write - agents/pool/auto_assign/story_checks.rs::has_review_hold — reads the typed register instead of conflating with Stage::Frozen (real bug fix: the legacy implementation returned `stage.is_frozen()`, which made the auto-assigner treat every held-for-review item as frozen even when it wasn't actually parked at the freeze stage). Dead yaml_legacy helpers removed: - write_review_hold(path), write_review_hold_in_content(content) - clear_front_matter_field(path) — last caller was the qa_tools wrap The yaml_residue marker doc now only mentions 933; the 932 line is gone. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-12 19:49:36 +01:00
Timmy	7660a460a5	wip(929): stage 9 — drop FS-archived-deps scan; story_tools/story/create.rs reads CRDT io/watcher and io/watcher/sweep were already CRDT-only — the watcher only watches .huskies/{project,agents}.toml, work-item events come from CRDT subscribe — so the remaining FS shadow reader was the bug-503 archived-dep warning in story_tools/story/create.rs (via check_archived_deps_from_list, which scanned .huskies/work/6_archived/). Migrate that call to the CRDT-direct `dep_is_archived_crdt`. Drop the now-unused helper and the four dead imports in bug/spike/refactor/criteria.rs that referenced it. io/story_metadata/deps.rs is reduced to a module-level comment pointing callers at the crdt_state helpers; nothing in io/ now scans the FS shadow tree. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-12 19:25:47 +01:00
Timmy	6e704a33b7	wip(929): stage 5 — drop FS-based dep checks and qa-mode parser from io/story_metadata Migrate the last three callers of the FS-scanning dependency helpers to the CRDT-direct equivalents and delete the dead helpers: - agents/pool/auto_assign/story_checks.rs: has_unmet_dependencies and check_archived_dependencies now wrap check_unmet_deps_crdt / check_archived_deps_crdt directly. Tests rewritten to seed the CRDT. - http/mcp/story_tools/story/update.rs: bug-503 archived-dep warning now reads from CRDT instead of scanning 6_archived. - agents/pool/pipeline/advance/helpers.rs: resolve_qa_mode_from_store is CRDT-only (the FS fallback for content-store-empty stories is gone). - io/story_metadata/parser.rs: resolve_qa_mode_from_content removed. - io/story_metadata/deps.rs: check_unmet_deps and dep_is_done deleted, along with the unused check_unmet_deps_from_list helper. - io/story_metadata/mod.rs: re-exports trimmed accordingly. check_archived_deps_from_list survives because story-creation still calls it before the CRDT entry exists (used from story_tools/story/create.rs). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-12 19:14:54 +01:00
dave	03a99b3cf1	huskies: merge 927	2026-05-12 17:55:12 +00:00
Timmy	b8945654bf	wip(929): stage 3 — migrate http/mcp/* off yaml_legacy + introduce yaml_residue marker Three MCP files touched: - status_tools.rs (story-status JSON dump): every field with a CRDT equivalent now reads from WorkItem (name, agent, blocked, qa_mode, retry_count, depends_on, claimed_by, claimed_at) or MergeJob.error (merge_failure detail). One field — review_hold — has no CRDT register yet (sub-story 932) and is wrapped in `yaml_residue(parse_front_matter(...))` so the gap is visible at every code-search. - qa_tools.rs: • tool_approve_qa wraps the legacy `clear_front_matter_field("review_hold")` write in `yaml_residue(...)` pending sub-story 932. • tool_reject_qa now reads the agent name from the CRDT WorkItem instead of parsing front matter on disk. - story_tools/epic.rs: the entire epic feature (item_type, epic link) has no CRDT analog — sub-story 933. Every parse_front_matter call here is wrapped in `yaml_residue(...)`. Also: new identity wrapper `db::yaml_legacy::yaml_residue<T>(v: T) -> T` that marks a yaml_legacy callsite blocked on a CRDT-register gap. Pure identity at runtime; the distinctive name makes the residue grep-findable (`grep -rn yaml_residue`). Sub-stories 932 and 933 enumerate the gaps. Filed: - 932: Add CRDT register for review_hold - 933: Add CRDT registers for the epic mechanism All 2854 tests pass; fmt + clippy clean. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-12 18:54:32 +01:00
dave	148ce37beb	huskies: merge 891	2026-05-12 17:09:01 +00:00
dave	b76633b79b	huskies: merge 892	2026-05-12 16:51:23 +00:00
dave	3891de685c	huskies: merge 888	2026-05-12 15:48:38 +00:00
dave	734597902f	huskies: merge 915	2026-05-12 15:38:25 +00:00
dave	a34c9796b5	huskies: merge 913	2026-05-12 15:30:23 +00:00
dave	379ff16d3e	huskies: merge 905	2026-05-12 15:02:58 +00:00
Timmy	ddc4228b10	feat(904): MCP progress notifications + SSE for long-running tool calls Follow-up to bug 903. The attach fix made run_tests retries safe, but agents still observed the underlying MCP transport timeout as a tool-call error and had to handle it via retry. Implement the proper fix: MCP `notifications/progress` events keep the client's transport timer alive so the call never errors from the agent's perspective. What changed: server/src/http/mcp/progress.rs (new) - `ProgressEmitter` (progressToken + mpsc sender) installed in a `tokio::task_local!` scope by the SSE response path. - `emit_progress(progress, total, message)` builds a JSON-RPC `notifications/progress` message and sends it via the channel. No-op when no emitter is in scope (plain JSON path / tests / API runtimes), so tool handlers can call it unconditionally. server/src/http/mcp/mod.rs - mcp_post_handler now detects `Accept: text/event-stream` AND a `params._meta.progressToken` on tools/call. When both are present, routes through `sse_tools_call` instead of the plain JSON path. - sse_tools_call: spawns the dispatch task with the emitter installed, builds an SSE stream that interleaves incoming progress events with the final JSON-RPC response, with a 15s keep-alive interval as a backstop for tools that don't emit their own progress. - Plain JSON behaviour is unchanged for non-SSE clients and for everything other than tools/call. server/src/http/mcp/shell_tools/script.rs - tool_run_tests poll loop emits `notifications/progress` every 25s of elapsed time (well below the typical ~60s MCP transport timeout). Attached callers (the bug 903 fix path) also emit so their MCP socket stays alive while waiting for the in-flight job. - Output filtering: on a passing run the response now returns a one-line summary ("All N tests passed.") instead of the full `cargo test` stdout, which was pure noise that burned agent tokens. Failure output is unchanged (truncated tail with the `failures:` section and final test_result line). CRDT entry stores the same filtered value so attached callers see it too. Tests (3 new): - emit_progress_no_op_without_emitter — calling outside scope is safe - emit_progress_sends_notification_when_emitter_installed — full path - emit_progress_omits_optional_fields — total/message optional Not changed: coder system_prompts still tell agents to retry on transport-timeout errors. That advice is now belt-and-braces — if claude-code's HTTP MCP client honours progress notifications, no agent will ever observe the error; if not, retry is still safe post-903. We can drop the retry advice once we've observed the SSE path working in the field. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-12 15:05:04 +01:00
Timmy	d64f1e94ff	fix(903): run_tests attaches to in-flight job instead of kill+respawn Bug 903: every `run_tests` MCP call killed the prior `cargo test` child for the same worktree and spawned a fresh one. Combined with the ~60s MCP client-side timeout and the 896 agent prompt that told agents to "call run_tests again — it attaches to the in-flight test job", this produced a respawn loop: agent calls, MCP times out at 60s, agent retries, run_tests kills the running build and starts a new one. The test suite never reaches the finish line. Server log evidence: "Started test job for <worktree> (pid N)" with a new PID every ~60-90s for the same worktree. Fix: when `run_tests` is called and a job is already in flight for that worktree, ATTACH to it instead of killing+respawning. The original job's poll loop already writes the final status to the CRDT `test_jobs` collection; attached callers just poll that CRDT entry (the same pattern `get_test_result` uses) and return the result when the in-flight job transitions out of "running". The 896 prompt's claim is now actually true. Worktrees remain isolated from each other and may run `cargo test` concurrently — there is no cross-worktree serialisation. The single invariant is "at most one test job per worktree at a time". New test: `tool_run_tests_concurrent_calls_attach_to_single_job` spawns two concurrent calls on the same worktree against a 2s `sleep`-based script and asserts total elapsed stays close to 2s (attach) rather than 4s (respawn). Note: the cross-worktree linker-OOM symptom Timmy reported in the field was downstream of the respawn loop. Killed-but-not-fully-reaped cargo invocations stack memory pressure beyond the nominal N worktrees. With the attach fix, each worktree runs exactly one in-flight build at a time and old builds finish cleanly. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-12 14:22:35 +01:00
dave	9be438e6d3	huskies: merge 865	2026-05-08 14:29:06 +00:00
dave	61cf7684de	huskies: merge 864	2026-04-30 22:27:51 +00:00
dave	1251b869a6	style: cargo fmt on today's new code (883/884/886/opus-pin) The mergemaster gates run rustfmt and rejected 864's merge because several files I added/touched in master today had not been fmt'd. Six files affected, mostly trivial line-wrapping nits. Fixes the formatting gate for the next 864 merge attempt.	2026-04-30 22:15:37 +00:00
dave	7a0c186d94	fix(886): parse cargo diagnostics in run_check/run_build/run_lint Before: tool_run_check (and run_build/run_lint via run_script_tool) returned the entire cargo log verbatim in `output`. For runs with many errors the response routinely exceeded the MCP token cap, was dumped to a tool-results file, and the agent had to scrape it with python3 just to see the error list — burning many turns on file archaeology for what should be a one-look operation. Real example: 864's coder hit `result (143,708 characters) exceeds maximum allowed tokens` and spent ~8 turns extracting 3 errors. Now: - New `service::shell::parse_diagnostics` parses `error[CODE]:` / `warning[CODE]:` headers + their `--> file:line` markers into structured `Diagnostic { kind, code, message, file, line }`. - `tool_run_check` (and the run_build/run_lint shared body) returns `{ passed, exit_code, errors: [...], warnings: [...], summary }`. Raw `output` is dropped from the default response. - New `verbose: bool` argument (default false) restores the raw output for callers who actually need it. - Updated the existing tool_run_check test to assert the new contract (150 errors → 150 structured entries, response < 50KB). Skipped run_tests in this pass — its parser would need to recognise test-runner output (different format from cargo); will land separately. Closes 886.	2026-04-30 15:06:02 +00:00
dave	7ac3fc2e3e	feat(884): persistent perm_rx lock-holder for Matrix bot Before: handle_message.rs acquired services.perm_rx only while processing one chat message and dropped it on chat_fut completion. The moment the bot wasn't actively responding, prompt_permission auto-denied any spawned coder bash call as "no interactive session" — making unattended coder work impossible. Now: a permission_listener task is spawned at bot startup and holds perm_rx for the bot's lifetime. Permission requests are forwarded to the first configured Matrix room, replies resolved by the existing on_room_message handler via pending_perm_replies. Per-message acquire is gone from handle_message.rs (chat_fut just awaits cleanly). - New module: chat/transport/matrix/bot/permission_listener.rs. - Wired into run_bot before BotContext construction; bot_sent_event_ids is hoisted out so the listener and the rest of the bot share it. - handle_message.rs no longer touches perm_rx. - diagnostics/permission.rs comment updated to reflect the new reality. - Regression test asserts the listener forwards a PermissionForward to the target room and records the pending reply key — exactly the path that was broken when no chat_fut was in flight. Discord/Slack/WhatsApp transports still acquire perm_rx per message (commands.rs:368 / commands/llm.rs:83 / commands/llm.rs:82). They are not the active transport in this deployment so their per-message acquire remains dormant; the same listener pattern should be applied to them as follow-up work in 884 phase 2.	2026-04-30 13:53:46 +00:00
Timmy	3a9ff5e740	fix(mcp): restore HTTP /mcp endpoint after 855 regression 855 deleted the HTTP /mcp route and pointed agents at ws://...crdt-sync, but Claude Code's .mcp.json doesn't speak ws:// and the rendezvous WS never had MCP method handlers wired up — so every spawned Claude Code agent (gateway-routed and local) booted with zero huskies tools and died on --permission-prompt-tool=mcp__huskies__prompt_permission. Restore mcp_post_handler / mcp_get_handler / handle_initialize, re-add the /mcp route, and revert all three .mcp.json writers to emit http://localhost:{port}/mcp with explicit "type": "http". Reuses the already-extracted gateway::jsonrpc types and the surviving dispatch_tool_call / list_tools surfaces — net add ~140 lines. Federation work is unaffected: /crdt-sync continues to do CRDT sync, which is what it was actually doing. MCP-over-WebSocket for cross-LAN agents was never wired up by 855 and can be done as a proper follow-up with a regression test that boots a real claude and verifies tool registration. Verified end-to-end: /mcp initialize, tools/list (74 tools incl. prompt_permission), and tools/call all respond correctly from inside the rebuilt container. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-30 14:03:16 +01:00
dave	a796bd933f	huskies: merge 879	2026-04-30 00:26:35 +00:00
dave	8fc581ad6b	huskies: merge 878	2026-04-29 23:53:15 +00:00
dave	1d86202abb	huskies: merge 868	2026-04-29 23:34:24 +00:00
dave	9a3f60d5d3	huskies: merge 866	2026-04-29 22:47:53 +00:00
dave	a49f668b5a	huskies: merge 867	2026-04-29 22:17:08 +00:00
dave	e56bd2d834	huskies: merge 877	2026-04-29 22:10:47 +00:00
dave	7e2f122d36	huskies: merge 880	2026-04-29 21:46:12 +00:00
dave	4d24b5b661	huskies: merge 855	2026-04-29 21:41:03 +00:00
dave	c0801c3894	huskies: merge 875	2026-04-29 18:44:50 +00:00
dave	a956a98197	huskies: merge 847	2026-04-29 18:40:08 +00:00
dave	320be659c0	huskies: merge 816	2026-04-29 17:57:34 +00:00
dave	fc86774618	huskies: merge 857	2026-04-29 17:45:51 +00:00
dave	8a42839b37	huskies: merge 820	2026-04-29 17:20:32 +00:00
dave	9bd3c10a09	huskies: merge 872	2026-04-29 15:59:37 +00:00
dave	7f8467b068	huskies: merge 871	2026-04-29 15:45:54 +00:00
dave	db65271587	huskies: merge 842	2026-04-29 15:10:11 +00:00
dave	f3e4d5d072	huskies: merge 869	2026-04-29 14:58:11 +00:00
dave	59b626d3ba	huskies: merge 824	2026-04-29 13:42:58 +00:00
dave	b4854cf693	huskies: merge 862	2026-04-29 13:28:37 +00:00
dave	11d111360d	huskies: merge 858	2026-04-29 10:47:18 +00:00
dave	4ed1fb5110	huskies: merge 854	2026-04-29 09:29:32 +00:00
dave	a65cd86c8f	huskies: merge 798	2026-04-28 16:25:33 +00:00
dave	1e40215c3e	huskies: merge 797	2026-04-28 16:06:50 +00:00
dave	32a3465fc4	fix: tell the truth about run_tests being blocking `tool_run_tests` in `server/src/http/mcp/shell_tools/script.rs` is fully blocking server-side: it spawns the test child, polls every 1s server-side until exit (or `TEST_TIMEOUT_SECS = 1200s`), and returns the full {passed, exit_code, output} directly. There is NO async/started-status return path. But two places told agents the wrong story: 1. `tools_list/system_tools.rs` description claimed "Returns immediately with status: started. Poll get_test_result..." — agents read tool descriptions for protocol semantics, so they followed this and burned turns polling get_test_result. 2. `agents.toml` had been correctly saying it blocks, but my last commit (`776aad38`) "fixed" it the wrong way based on a misread of the code. Now both say: run_tests blocks server-side, returns the full result, do not poll get_test_result. get_test_result remains for external observers (UI checking on a job another caller started). Reverts the prompt change in `776aad38` with the correct text.	2026-04-28 15:59:06 +00:00
dave	f63464852b	huskies: merge 770	2026-04-28 15:38:34 +00:00
dave	1946709681	huskies: merge 788	2026-04-28 15:28:31 +00:00
dave	aed29b952c	huskies: merge 769	2026-04-28 13:42:47 +00:00
dave	b7db6d6aae	huskies: merge 775	2026-04-28 12:25:59 +00:00
dave	e9ed58502a	huskies: merge 771	2026-04-28 12:08:44 +00:00
dave	05d057a40a	huskies: merge 782	2026-04-28 11:02:02 +00:00

1 2 3 4 5 ...

397 Commits