huskies

Author	SHA1	Message	Date
dave	615e1c7f73	huskies: merge 738_refactor_delete_fs_shadow_code_from_lifecycle_rs_and_the_work_directory_watcher	2026-04-27 19:56:53 +00:00
dave	26f9f3f7fc	huskies: merge 729_story_store_story_name_as_a_crdt_field_separate_from_the_story_id	2026-04-27 19:09:56 +00:00
dave	646dc490b8	huskies: merge 720_refactor_add_mesh_status_mcp_tool_read_only_peer_mesh_diagnostics	2026-04-27 18:18:51 +00:00
dave	4a0f57478c	huskies: merge 671_refactor_migrate_pipeline_state_consumers_from_string_comparisons_to_typed_pipelinestage_enum	2026-04-27 16:39:39 +00:00
dave	39a9766d7d	huskies: merge 677_refactor_reject_promotion_to_current_coder_of_work_items_with_junk_only_acceptance_criteria	2026-04-27 16:30:35 +00:00
dave	5da29c3d91	huskies: merge 668_bug_pipeline_advances_coder_work_to_merge_when_gates_passed_false	2026-04-27 11:39:11 +00:00
dave	75533225e4	fix: commit minor fmt residue blocking mergemaster cherry-picks Master had 8 uncommitted single-line whitespace changes (blank-line trimming in test mod headers, etc.) left over from a previous mergemaster cargo-fmt run that didn't get committed. Each subsequent merge attempt hit: cherry-pick failed: 'Your local changes to the following files would be overwritten by merge. Please commit your changes or stash them.' So merges had been silently un-mergeable for the last several rounds — mergemaster correctly reported the issue but had no way to fix master's own state from inside the merge_workspace. Files affected (all whitespace-only): - chat/transport/matrix/bot/messages/{handle_message,on_room_message}.rs - chat/transport/slack/commands/{llm,mod}.rs - http/mcp/agent_tools/worktree.rs - http/workflow/story_ops/{create,criterion,update}.rs cargo clippy --all-targets -- -D warnings: clean cargo fmt --all --check: clean 2636 tests pass.	2026-04-27 11:17:31 +00:00
dave	34a399b838	refactor: split http/mcp/shell_tools.rs (1144) into mod + exec + script The 1144-line shell_tools.rs is split: - exec.rs: validate_working_dir + tool_run_command + handle_run_command_sse + their tests (~550 lines) - script.rs: tool_run_tests + tool_get_test_result + tool_run_build + tool_run_lint + helpers + their tests (~610 lines) - mod.rs: re-exports (~12 lines) Tests stay co-located. All 2636 tests pass; clippy clean.	2026-04-27 02:04:04 +00:00
dave	928d613190	refactor: split http/mcp/agent_tools.rs (1094) into mod + worktree The 1094-line agent_tools.rs is split: - worktree.rs: tool_create/list/remove_worktree, tool_get_editor_command, get_worktree_commits + their tests (~190 lines) - mod.rs: agent lifecycle tools (start/stop/list/output/config/wait/ remaining_turns_and_budget/read_coverage helper) + their tests Tests stay co-located. All 2636 tests pass; clippy clean.	2026-04-27 01:57:46 +00:00
dave	a8ead9cd10	refactor: split http/mcp/diagnostics.rs (861) into mod + permission + usage The 861-line diagnostics.rs is split: - permission.rs: tool_prompt_permission + helpers + their tests (584 lines) - usage.rs: tool_get_token_usage + tests (122 lines) - mod.rs: server_logs, rebuild, version, loc_file, dump_crdt, move_story + tests (185 lines) Tests stay co-located. The bigger sub-modules (permission at 584 with tests mostly under 800; usage at 122) are well within the 800-line guide. Also added #[allow(unused_imports)] to two now-pedantic re-exports in service/diagnostics/mod.rs that the split made flag. All 2636 tests pass; clippy clean.	2026-04-27 01:51:36 +00:00
dave	b340aa97b0	fix: clean up clippy warnings + cargo fmt across post-refactor surface The 13-file refactor pass (commits `db00a5d4` through `eca15b4e`) introduced ~89 clippy errors and 38 cargo fmt issues — every agent in every worktree hit them on script/test, burning their turn budget on cleanup before doing real story work. This is the silent kill behind 644, 652, 655, 664, 667 all hitting watchdog limits this round. Changes: - cargo fmt --all across 37 files (formatting normalisation only) - #![allow(unused_imports, dead_code)] on 24 split modules where the python-script splitter imported liberally to be safe; tighter cleanup per-import will happen as agents touch each module - Removed truly-dead re-exports (cleanup_merge_workspace, slog_warn from http/mcp/mod.rs, CliArgs/print_help from main.rs) - Prefixed _auth_msg in crdt_sync/server.rs (handshake helper return is bound but not consumed) - Converted dangling /// doc block in crdt_sync/mod.rs to //! so it attaches to the module - Removed empty lines after doc comments in 4 spots (clippy lint) All 2636 tests pass; clippy --all-targets -- -D warnings clean.	2026-04-27 01:32:08 +00:00
dave	851324740c	refactor: split http/mcp/story_tools.rs into 5 sub-modules by item type The 1864-line story_tools.rs is split into: - story.rs: story creation/lifecycle/management (903 lines incl. tests) - criteria.rs: acceptance-criteria tools (534 lines) - bug.rs: bug item tools (318 lines) - spike.rs: spike item tools (120 lines) - refactor.rs: refactor item tools (60 lines) - mod.rs: re-exports (25 lines) Tests stay co-located with the code they exercise; setup_git_repo_in and setup_story_for_update test helpers are duplicated into the modules that need them rather than centralised, since they are tiny and test-only. No behaviour change. All 60 story_tools tests pass; full suite green (2635 tests with --test-threads=1).	2026-04-26 21:11:09 +00:00
dave	0dff2d5c47	refactor: split http/mcp/mod.rs into 3 logical files The 1882-line mod.rs is split into: - tools_list.rs: handle_tools_list — the static schema for every MCP tool (1172 lines) - dispatch.rs: handle_tools_call — the tool-name → *_tools router (157 lines) - mod.rs: doc, sub-mod decls, JsonRpc structs, Poem handlers, handle_initialize (586 lines) Tests stay co-located with the code they exercise. No behaviour change. All 267 http::mcp tests pass; full suite green (2635 tests with --test-threads=1).	2026-04-26 21:05:07 +00:00
dave	795b172bba	Revert "refactor: split top-5 largest files into mod.rs + tests.rs" This reverts commit `65a3767a7a`.	2026-04-26 20:15:58 +00:00
dave	65a3767a7a	refactor: split top-5 largest files into mod.rs + tests.rs Five files in server/src/ exceeded 1500 lines, with 50–75% of the line count being inline `#[cfg(test)] mod tests { ... }` blocks. Agents working on these files have to navigate huge buffers via Read calls, costing turn budget that could go toward actual work. Pattern: convert `foo.rs` to `foo/mod.rs` + `foo/tests.rs`. Rust resolves `mod foo;` to either form, so no parent-module changes needed. Before / after (production-code lines, what an agent has to navigate when editing the module): crdt_sync.rs: 3672 → 1003 (mod.rs) + 2667 (tests.rs) crdt_state.rs: 2122 → 1263 (mod.rs) + 854 (tests.rs) io/fs/scaffold.rs: 2045 → 702 (mod.rs) + 1342 (tests.rs) http/mcp/mod.rs: 1882 → 1410 (mod.rs) + 472 (tests.rs) http/mcp/story_tools.rs: 1864 → 725 (mod.rs) + 1137 (tests.rs) Side change: scaffold/mod.rs's include_str! paths got an extra `../` because the file moved one directory deeper. Tests: full `cargo test` suite passes (2635 passed, 0 failed). Formatting: cargo fmt --check clean. Motivation: today's agent thrashing on 644 / 650 / 652 was partly due to cumulative-counting (now fixed by 650) but also genuinely due to file size — sonnet's 50-turn budget barely covers reading these files plus making the change. Smaller production-code files mean more turn budget left for the actual work. Committed straight to master because this is an enabling refactor for agent autonomy work; running it through the normal pipeline would require an agent that has to navigate the very files it's about to split, defeating the purpose.	2026-04-26 20:08:24 +00:00
dave	365b907ba4	huskies: merge 650_bug_watchdog_turns_used_and_budget_used_usd_accumulate_across_all_sessions_restart_counts_against_limits_from_prior_runs	2026-04-26 16:24:10 +00:00
dave	148c88bd40	huskies: merge 646_bug_watchdog_from_bug_624_is_not_actually_enforcing_max_turns_max_budget_usd_in_production	2026-04-26 13:11:48 +00:00
dave	120745d102	huskies: merge 640_bug_create_story_create_refactor_create_bug_silently_drop_the_depends_on_parameter	2026-04-25 19:37:55 +00:00
dave	4b089c1ed8	huskies: merge 626_refactor_introduce_services_bundle_and_migrate_appcontext_matrix_transport	2026-04-25 15:08:46 +00:00
dave	e20083a283	huskies: merge 624_bug_agent_turn_and_budget_limits_not_enforced_coder_1_ran_5_6x_over_max_turns	2026-04-25 13:11:30 +00:00
dave	c16d9e471d	huskies: merge 618_story_extract_mcp_only_domain_services	2026-04-24 21:16:19 +00:00
dave	62bfaf20f4	huskies: merge 611_story_extract_settings_service	2026-04-24 17:11:55 +00:00
dave	da6ae89667	huskies: merge 610_story_extract_wizard_service	2026-04-24 16:46:09 +00:00
dave	2f07365745	huskies: merge 604_story_service_module_conventions_and_first_extraction	2026-04-24 13:45:22 +00:00
dave	df2f20a5e5	huskies: merge 589_story_wizard_auto_detects_project_components_and_configures_scripts_accordingly	2026-04-16 00:22:53 +00:00
dave	4553d7215a	huskies: merge 586_bug_wizard_skips_context_and_stack_generation_when_files_already_exist_from_scaffold	2026-04-15 23:52:25 +00:00
dave	4a1c6b4cfa	huskies: merge 585_bug_bot_not_aware_of_actual_running_port_defaults_to_3001	2026-04-15 23:47:37 +00:00
dave	ce37281333	huskies: merge 571_story_expose_agent_remaining_turns_and_budget_via_mcp_tool	2026-04-15 18:30:32 +00:00
dave	7fa31c03a3	huskies: merge 573_story_remove_criterion_mcp_tool_to_delete_an_acceptance_criterion	2026-04-15 13:23:18 +00:00
dave	ec40b4771b	huskies: merge 572_story_edit_criterion_mcp_tool_to_update_acceptance_criteria_text	2026-04-15 13:03:55 +00:00
dave	8482df2f4e	huskies: merge 570_bug_merge_agent_work_should_check_if_story_is_already_done_before_attempting_merge	2026-04-14 16:15:29 +00:00
dave	df5ba8ebab	huskies: merge 560_story_make_merge_agent_work_return_results_like_run_tests_instead_of_polling	2026-04-14 10:26:44 +00:00
dave	ff1149750b	huskies: merge 561_bug_mcp_tools_matching_mcp_huskies_allowlist_still_trigger_permission_prompts	2026-04-14 10:19:51 +00:00
dave	28777b0c77	fix: simplify boolean in validate_working_dir to satisfy clippy nonminimal_bool Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-14 09:52:51 +00:00
dave	f412c7dee6	fix: cargo fmt the merge_workspace validation code Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-14 09:43:18 +00:00
dave	44fe52195e	fix: allow MCP tools to access merge_workspace so mergemaster can fix conflicts The permission lockdown restricted run_command/run_tests to .huskies/worktrees/ only. The mergemaster could diagnose merge conflict compile errors but couldn't edit files in .huskies/merge_workspace/ to fix them. Add merge_workspace as an allowed path. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-14 09:21:39 +00:00
dave	bd04c6acd7	fix: capture test output with background pipe draining instead of Stdio::inherit Stdio::inherit sent test output to server stdout, making it invisible to agents calling run_tests via MCP. Switch back to Stdio::piped with background drain threads (same pattern as gates.rs) to capture output without the pipe deadlock that caused the original switch to inherit. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-13 16:17:06 +00:00
dave	7977b7c5f8	huskies: merge 555_bug_agent_permission_prompts_flood_matrix_chat_instead_of_being_auto_denied	2026-04-13 15:02:47 +00:00
dave	845b85e7a7	fix: add --all to cargo fmt in script/test and autoformat codebase cargo fmt without --all fails with "Failed to find targets" in workspace repos. This was blocking every story's gates. Also ran cargo fmt --all to fix all existing formatting issues. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-13 14:07:08 +00:00
dave	ed2526ce41	feat: add get_version MCP tool returning version and build hash Agents can now call get_version to see what server version and commit they're running against. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-13 13:50:37 +00:00
dave	5806156af3	huskies: merge 553_story_accept_spike_state_machine_transition_skips_merge_and_goes_directly_to_done	2026-04-13 12:54:09 +00:00
dave	05c3b11e57	huskies: merge 551_bug_get_agent_output_mcp_tool_returns_fetch_failed_for_running_agents	2026-04-12 17:50:44 +00:00
dave	a344cfadee	huskies: merge 544_story_add_run_build_and_run_lint_mcp_tools_backed_by_script_build_and_script_lint	2026-04-12 13:21:41 +00:00
dave	cec62dad1c	huskies: merge 542_refactor_add_doc_comments_to_all_undocumented_source_files_and_generate_source_map_in_readme	2026-04-12 13:16:11 +00:00
dave	5f01631e6a	huskies: merge 543_story_resume_failed_coder_agents_with_resume_instead_of_starting_fresh_sessions	2026-04-12 12:58:42 +00:00
dave	f140238cc3	fix: update run_tests tests for Stdio::inherit and bump tool count to 60 run_tests now uses Stdio::inherit so stdout/stderr aren't captured — tests can only assert on pass/fail and exit code. Tool count bumped from 59 to 60 for the new get_test_result tool. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-12 12:30:10 +00:00
dave	ec6891b5ba	fix: remove stale tests that hang or assert dead behaviour - Remove tool_merge_agent_work_returns_started and tool_get_merge_status_returns_running: these tested the old non-blocking API but tool_merge_agent_work now blocks in a poll loop, causing the tests to hang forever. - Update coder_agents_have_root_cause_guidance: prompt no longer requires "git bisect" — check for bug workflow guidance instead. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-12 12:02:47 +00:00
dave	06defd9596	fix: collapse nested if-let blocks to satisfy clippy collapsible_if lint Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-12 11:43:36 +00:00
dave	0b58b0486c	fix: use Stdio::inherit for run_tests to prevent pipe deadlock spawn() with piped stdout/stderr deadlocks when the test binary produces more output than the OS pipe buffer (64KB). Switch to Stdio::inherit so test output flows to server logs and we can see what's happening. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-12 00:46:43 +00:00
dave	8ae6ca3eb8	fix: make run_tests block server-side instead of requiring agent polling run_tests now spawns the child and blocks in a 1-second poll loop until tests complete or the 20-minute timeout fires. Returns the full result in a single MCP call — agents use 1 turn instead of 50+. Child process is properly killed on timeout (no zombies). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-11 23:07:02 +00:00

1 2

97 Commits