huskies

Author	SHA1	Message	Date
dave	77081926d1	huskies: merge 715_refactor_decompose_frontend_src_components_workitemdetailpanel_tsx_827_lines	2026-04-27 17:08:26 +00:00
dave	fce7e16811	huskies: merge 716_story_statuseventbuffer_bounded_per_instance_buffer_over_services_status_broadcaster	2026-04-27 17:03:12 +00:00
dave	2b28ccbf2c	Merge spike branch 'feature/story-679_spike_migrate_inter_component_http_to_signed_crdt_websocket_bus' into master	2026-04-27 17:01:48 +00:00
dave	4a0f57478c	huskies: merge 671_refactor_migrate_pipeline_state_consumers_from_string_comparisons_to_typed_pipelinestage_enum	2026-04-27 16:39:39 +00:00
dave	39a9766d7d	huskies: merge 677_refactor_reject_promotion_to_current_coder_of_work_items_with_junk_only_acceptance_criteria	2026-04-27 16:30:35 +00:00
dave	5884dac825	chore: gitignore .huskies/session_store.json (runtime artifact) Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-27 14:59:33 +00:00
dave	0b7f7dfdf7	config: bump sonnet coder-1/2/3 max_turns 50→80 Stories like the broadcaster-consumer migrations legitimately need ~60 substantive turns (16 ProjectConfig initializer sites + main.rs subscriber + reading existing patterns to mirror). 50 was too tight. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-27 14:56:24 +00:00
dave	756c790b9f	spike 679: document HTTP-to-CRDT-bus migration plan Full inventory of all gateway and project server endpoints with caller, purpose, latency/freshness/durability requirements. Classifies each as write/read/external-webhook/frontend-asset. Maps write endpoints to target CRDT collections, proposes RPC frame shapes for read endpoints, drafts the unsigned read-RPC protocol (envelope, correlation IDs, TTL, error codes, peer-offline handling), lists in-memory state needing CRDT migration with proposed types, and defines a wave-ordered migration plan with explicit dependencies (story 665 Ed25519 auth as the blocker for write migrations).	2026-04-27 14:49:38 +00:00
dave	6a582d73b6	huskies: merge 675_bug_mergemaster_silently_exits_when_feature_branch_has_zero_commits_ahead_of_master	2026-04-27 14:43:54 +00:00
dave	ea872fa01c	huskies: merge 676_bug_apply_and_persist_silently_drops_ops_when_persist_channel_send_fails	2026-04-27 14:38:11 +00:00
dave	cbb0a50729	huskies: merge 649_story_migrate_whatsapp_transport_to_status_broadcaster	2026-04-27 14:19:19 +00:00
dave	6c8043d866	huskies: merge 648_story_migrate_discord_transport_to_status_broadcaster	2026-04-27 14:01:32 +00:00
dave	9040d18f50	huskies: merge 664_story_crdt_lamport_clock_inner_seq_must_resume_from_max_own_author_seq_1_instead_of_resetting_to_1_on_restart_phase_c	2026-04-27 12:30:44 +00:00
dave	25603bb8cb	huskies: merge 669_story_migrate_slack_transport_to_status_broadcaster	2026-04-27 11:57:06 +00:00
dave	5da29c3d91	huskies: merge 668_bug_pipeline_advances_coder_work_to_merge_when_gates_passed_false	2026-04-27 11:39:11 +00:00
dave	65d2fb210c	huskies: merge 655_bug_matrix_bot_spawns_its_own_timerstore_instead_of_using_shared_appcontext_timer_store	2026-04-27 11:32:51 +00:00
dave	ac85cfce5d	huskies: merge 652_story_pass_resume_session_id_on_agent_respawn_so_new_sessions_inherit_prior_reasoning	2026-04-27 11:27:50 +00:00
dave	144f07f412	huskies: merge 644_story_chat_transport_consumers_slack_discord_whatsapp_matrix_for_the_unified_status_broadcaster	2026-04-27 11:22:52 +00:00
dave	75533225e4	fix: commit minor fmt residue blocking mergemaster cherry-picks Master had 8 uncommitted single-line whitespace changes (blank-line trimming in test mod headers, etc.) left over from a previous mergemaster cargo-fmt run that didn't get committed. Each subsequent merge attempt hit: cherry-pick failed: 'Your local changes to the following files would be overwritten by merge. Please commit your changes or stash them.' So merges had been silently un-mergeable for the last several rounds — mergemaster correctly reported the issue but had no way to fix master's own state from inside the merge_workspace. Files affected (all whitespace-only): - chat/transport/matrix/bot/messages/{handle_message,on_room_message}.rs - chat/transport/slack/commands/{llm,mod}.rs - http/mcp/agent_tools/worktree.rs - http/workflow/story_ops/{create,criterion,update}.rs cargo clippy --all-targets -- -D warnings: clean cargo fmt --all --check: clean 2636 tests pass.	2026-04-27 11:17:31 +00:00
dave	56c979c950	config: tell mergemaster to use 5-min sleeps between merge_agent_work polls Real cause of mergemaster turn-burnout: not merge conflicts, just polling overhead. The server-side tool_merge_agent_work IS designed to block until the merge completes, but the MCP client times out after 60s. The agent then polls get_merge_status, with 30-60s sleeps between polls — each poll cycle costs 2 turns (sleep + tool call). The merge takes 5-10 min for a clean run, so the agent burns 10-20 turns just waiting. Updated workflow tells mergemaster: - 'operation timed out' is normal, do NOT immediately re-call (would queue a duplicate merge) - Use Bash sleep 300 (one 5-min wait = 1 turn) between polls - Cap at 3 polls = 15 minutes total, plenty for any clean merge - Reserve turns for actual fix-up work if gates fail Combined with the earlier 30→60 turn / $5→$15 budget bump, this should land any merge with no real conflicts in 3-5 turns total. Plenty of headroom remaining for genuine gate-fix work.	2026-04-27 10:50:44 +00:00
dave	7b305ba892	config: bump mergemaster max_turns 30→60, budget $5→$15 30 turns is too tight for non-trivial merge gate failures. Combined with the 3-retry cap, stories with any post-merge fix-up needed (cargo fmt nits, slightly out-of-date diffs after parallel merges, etc.) get permanently blocked. This is a stopgap until story 668 lands (which will keep gates_passed=false work in the coder stage entirely, so mergemaster only ever sees clean diffs and the original 30 turns / $5 is fine again).	2026-04-27 10:41:45 +00:00
dave	7408cc5b4b	fix(crdt_snapshot): per-thread SNAPSHOT_STATE in cfg(test) instead of shared static (bug 669) Replaces the test-time GLOBAL_STATE_LOCK approach (which was just disguised single-threading) with proper test isolation: each test thread gets its own SnapshotState via a thread_local!. Pattern matches crdt_state::CRDT_STATE_TL — production keeps the global OnceLock; tests get a per-thread OnceLock that's accessed through a snapshot_state() helper. The unsafe `&*ptr` cast to 'static is safe because the thread_local lives as long as the spawning test thread. The race: latest_snapshot_available_after_compaction captured at_seq from a freshly-generated snapshot, then asserted it equalled SNAPSHOT_STATE's latest.at_seq. With shared SNAPSHOT_STATE, another test thread's apply_compaction could overwrite latest_snapshot between capture and read. Per-thread state eliminates the race at its source. ALL_OPS / VECTOR_CLOCK stay shared — the tests don't assert on absolute counts, only on (this-thread's at_seq) == (this-thread's latest.at_seq). 5 consecutive default-parallel `cargo test --bin huskies` runs all green at 2636/2636.	2026-04-27 02:49:53 +00:00
dave	fc71c22305	Revert "fix(crdt_snapshot): serialise tests that share global SNAPSHOT_STATE / ALL_OPS / VECTOR_CLOCK (bug 669)" This reverts commit `8e608feec1`.	2026-04-27 02:45:01 +00:00
dave	8e608feec1	fix(crdt_snapshot): serialise tests that share global SNAPSHOT_STATE / ALL_OPS / VECTOR_CLOCK (bug 669) The crdt_snapshot tests share three global statics: - SNAPSHOT_STATE (latest_snapshot, pending_acks, pending_at_seq) — coordination state - crdt_state::ALL_OPS / VECTOR_CLOCK — op journal + vector clock Only the per-thread CRDT is thread-local (init_for_test); these other globals are shared across test threads. Under default cargo test parallelism, two tests running concurrently interleave their op writes and snapshot generation, so assertions like assert_eq!(at_seq, 4) fail with at_seq=5 (the other thread's ops snuck in). Add a module-level GLOBAL_STATE_LOCK that all 17 affected tests grab at the top. unwrap_or_else(\|e\| e.into_inner()) handles the case where a prior test panicked while holding the lock (poisoned). Fixes bug 669 — these two tests were the silent killer behind every agent's script/test failure (see also bug 668, which advanced agents to merge despite gates_passed=false; that compounded this by sending failing-tests worktrees to mergemaster). All 2636 tests now pass under default parallel execution (no --test-threads=1 needed). Closes #669.	2026-04-27 02:43:49 +00:00
dave	404fd396f5	refactor: split chat/transport/whatsapp/commands.rs (837) into mod + llm The 837-line commands.rs is split: - llm.rs: handle_llm_message (LLM turn for non-command messages, ~195 lines) - mod.rs: handle_incoming_message + tests (~660 lines) Tests stay co-located with handle_incoming_message in mod.rs. All 2636 tests pass; clippy clean.	2026-04-27 02:37:22 +00:00
dave	1f02de8cd0	refactor: split chat/transport/slack/commands.rs (875) into mod + llm The 875-line commands.rs is split: - llm.rs: handle_llm_message (LLM turn for non-command messages, ~190 lines) - mod.rs: SlackSlashCommandPayload + slash_command_to_bot_keyword + handle_incoming_message + tests (~700 lines) Tests stay co-located with handle_incoming_message in mod.rs. All 2636 tests pass; clippy clean.	2026-04-27 02:32:11 +00:00
dave	d07728f22b	refactor: split chat/transport/matrix/bot/messages.rs (912) into mod + on_room_message + handle_message The 912-line messages.rs is split: - on_room_message.rs: incoming Matrix event dispatch (~600 lines) - handle_message.rs: LLM turn + reply streaming (~265 lines) - mod.rs: format_user_prompt + tests (~70 lines) Tests stay co-located with format_user_prompt in mod.rs. All 2636 tests pass; clippy clean.	2026-04-27 02:21:54 +00:00
dave	adf936be07	refactor: split http/workflow/story_ops.rs (1256) into create + criterion + update The 1256-line story_ops.rs is split: - create.rs: create_story_file + tests (~232 lines) - criterion.rs: check/add/remove/edit_criterion_in_file + tests (~525 lines) - update.rs: update_story_in_file + yaml helpers + tests (~640 lines) - mod.rs: re-exports (~12 lines) Workflow helpers (read_story_content, write_story_content, slugify_name, etc.) bumped from pub(super) to pub(crate) since they're now consumed across nested sub-modules and from http/mcp/story_tools/. Tests stay co-located. All 2636 tests pass; clippy clean.	2026-04-27 02:13:31 +00:00
dave	34a399b838	refactor: split http/mcp/shell_tools.rs (1144) into mod + exec + script The 1144-line shell_tools.rs is split: - exec.rs: validate_working_dir + tool_run_command + handle_run_command_sse + their tests (~550 lines) - script.rs: tool_run_tests + tool_get_test_result + tool_run_build + tool_run_lint + helpers + their tests (~610 lines) - mod.rs: re-exports (~12 lines) Tests stay co-located. All 2636 tests pass; clippy clean.	2026-04-27 02:04:04 +00:00
dave	928d613190	refactor: split http/mcp/agent_tools.rs (1094) into mod + worktree The 1094-line agent_tools.rs is split: - worktree.rs: tool_create/list/remove_worktree, tool_get_editor_command, get_worktree_commits + their tests (~190 lines) - mod.rs: agent lifecycle tools (start/stop/list/output/config/wait/ remaining_turns_and_budget/read_coverage helper) + their tests Tests stay co-located. All 2636 tests pass; clippy clean.	2026-04-27 01:57:46 +00:00
dave	a8ead9cd10	refactor: split http/mcp/diagnostics.rs (861) into mod + permission + usage The 861-line diagnostics.rs is split: - permission.rs: tool_prompt_permission + helpers + their tests (584 lines) - usage.rs: tool_get_token_usage + tests (122 lines) - mod.rs: server_logs, rebuild, version, loc_file, dump_crdt, move_story + tests (185 lines) Tests stay co-located. The bigger sub-modules (permission at 584 with tests mostly under 800; usage at 122) are well within the 800-line guide. Also added #[allow(unused_imports)] to two now-pedantic re-exports in service/diagnostics/mod.rs that the split made flag. All 2636 tests pass; clippy clean.	2026-04-27 01:51:36 +00:00
dave	9fbbfcd585	huskies: merge 667_story_agent_prompt_target_maximum_file_size_of_800_lines_as_a_soft_guide_decompose_larger_files_by_concern	2026-04-27 01:37:52 +00:00
dave	a1afe069fa	chore: remove test_fail.txt accidentally committed	2026-04-27 01:32:49 +00:00
dave	c600b94f4e	chore: remove dangling orphan files accidentally added in `b340aa97` server/src/agents/pool/lifecycle.rs and server/src/chat/transport/matrix/notifications.rs were untracked leftovers from an abandoned WIP stash that 'git add -A' picked up. Neither is declared as a mod anywhere — they're dangling code that doesn't get compiled but pollutes the tree.	2026-04-27 01:32:38 +00:00
dave	b340aa97b0	fix: clean up clippy warnings + cargo fmt across post-refactor surface The 13-file refactor pass (commits `db00a5d4` through `eca15b4e`) introduced ~89 clippy errors and 38 cargo fmt issues — every agent in every worktree hit them on script/test, burning their turn budget on cleanup before doing real story work. This is the silent kill behind 644, 652, 655, 664, 667 all hitting watchdog limits this round. Changes: - cargo fmt --all across 37 files (formatting normalisation only) - #![allow(unused_imports, dead_code)] on 24 split modules where the python-script splitter imported liberally to be safe; tighter cleanup per-import will happen as agents touch each module - Removed truly-dead re-exports (cleanup_merge_workspace, slog_warn from http/mcp/mod.rs, CliArgs/print_help from main.rs) - Prefixed _auth_msg in crdt_sync/server.rs (handshake helper return is bound but not consumed) - Converted dangling /// doc block in crdt_sync/mod.rs to //! so it attaches to the module - Removed empty lines after doc comments in 4 spots (clippy lint) All 2636 tests pass; clippy --all-targets -- -D warnings clean.	2026-04-27 01:32:08 +00:00
dave	0e73a34791	Merge spike branch 'feature/story-613_spike_architecture_roadmap_transports_services_state_machine_crdt' into master	2026-04-27 00:25:47 +00:00
dave	06035f20ad	fix: restore #[tokio::main] on main(), #[cfg(unix)] on platform tests, #[allow] on run_pty_session/AuthListenerResult The biggest miss is #[tokio::main] — without it, async fn main() doesn't compile, and the binary in every worktree fails 'cargo check'. Agents in those worktrees burn their turn budgets trying to fix the build before they can do real work, then get killed by the watchdog. That's why all three in-flight stories failed. Other restored attributes: - #[cfg(unix)] on 4 tests in merge/squash and scaffold (skip on non-Unix) - #[allow(dead_code)] on AuthListenerResult test enum - #[allow(clippy::too_many_arguments)] on run_pty_session Same root cause as the earlier #[test] attribute losses: my line ranges started at the fn line, missing the leading attribute on the previous line.	2026-04-26 23:38:17 +00:00
dave	eca15b4ee7	refactor: split agents/pool/start.rs into mod.rs + validation.rs + spawn.rs The 1630-line start.rs is split into a sub-module directory: - validation.rs: validate_agent_stage + read_front_matter_agent helpers (69 lines) - spawn.rs: run_agent_spawn — the background async work that was inlined as a tokio::spawn closure body inside start_agent (359 lines) - mod.rs: AgentPool::start_agent orchestrator + tests (1062 lines) Stage validation and front-matter agent reading are pre-lock pure helpers that naturally extract. The spawn closure body becomes a free async fn that takes the previously-cloned values as parameters; rebound to the original _clone / _owned names at the top of the body so the actual work code is a verbatim copy. No behaviour change. All 23 start tests pass; full suite green.	2026-04-26 22:12:04 +00:00
dave	40f1794d41	fix: restore #[test] attributes on parse_no_args, peer_receives_op_encoded_via_wire_codec, keepalive_constants_are_correct Same root cause as `0d805313`: when extracting a test that's the FIRST inside its mod block, the slicer started at the fn line and missed the leading #[test] attribute on the previous line. Test count now matches pre-split count (2636).	2026-04-26 22:04:12 +00:00
dave	0d805313d6	fix: restore #[test] and #[should_panic] attributes on panics_on_duplicate_agent_names Lost in commit `db00a5d4` when extracting tests from main.rs into cli.rs; the line range used for the panics_on_duplicate_agent_names test in main.rs started at the fn signature instead of the attribute line.	2026-04-26 22:01:06 +00:00
dave	0e09a1ed4b	refactor: extract auth handshake from crdt_sync/server.rs into handshake.rs The 1680-line server.rs is split: - handshake.rs: perform_auth_handshake helper + close_with_auth_failed + auth tests + start_auth_listener / close_listener_auth_failed test helpers + AuthListenerResult enum - server.rs: crdt_sync_handler (now invokes perform_auth_handshake) + wait_for_sync_text + broadcast/e2e/keepalive tests Auth handshake (Steps 1-3 of the WebSocket handshake) is a self-contained sequence that takes &mut SplitSink + &mut SplitStream and returns Option<AuthMessage>. The caller observes None to mean the connection has already been closed with the appropriate close code. No behaviour change. All 63 crdt_sync tests pass; full suite green.	2026-04-26 21:49:46 +00:00
dave	db00a5d4b5	refactor: split main.rs by extracting CLI parsing into cli.rs The 1258-line main.rs is split into: - main.rs: mod declarations, async fn main + panics_on_duplicate_agent_names test (894 lines) - cli.rs: CliArgs struct, parse_cli_args, print_help, resolve_path_arg + their tests (372 lines) main.rs cannot itself become a directory (binary crate must have main.rs at the crate root); cli.rs is a sibling module. No behaviour change. All cli tests pass; full suite green.	2026-04-26 21:41:39 +00:00
dave	a86448f6a6	refactor: split chat/transport/matrix/config.rs into mod.rs + loading.rs The 1260-line config.rs is split into: - mod.rs: BotConfig struct + small impl + default helpers + tests (1047 lines) - loading.rs: BotConfig::load + save_ambient_rooms (223 lines) Tests stay co-located. No behaviour change. All 41 matrix::config tests pass; full suite green.	2026-04-26 21:37:39 +00:00
dave	ca72f36c78	refactor: split agents/pool/pipeline/advance.rs into mod.rs + helpers.rs The 1353-line advance.rs is split into: - mod.rs: impl AgentPool with run_pipeline_advance + start_mergemaster_or_block + tests (1244 lines) - helpers.rs: spawn_pipeline_advance, resolve_qa_mode_from_store, write_review_hold_to_store, should_block_story (128 lines) Tests stay co-located with run_pipeline_advance which they exercise. No behaviour change. All 10 advance tests pass; full suite green.	2026-04-26 21:35:04 +00:00
dave	5aedf94512	refactor: split pipeline_state.rs into 4 sub-modules with co-located tests The 1411-line pipeline_state.rs is split into: - mod.rs: types, transition(), execution_transition(), labels + transition tests (885 lines) - events.rs: TransitionFired, EventBus, TransitionSubscriber + event-bus tests (114 lines) - projection.rs: ProjectionError, TryFrom<&PipelineItemView>, read_typed + projection tests (379 lines) - subscribers.rs: 5 concrete TransitionSubscriber stubs (95 lines) Tests stay co-located. No behaviour change. All 42 pipeline_state tests pass; full suite green.	2026-04-26 21:30:55 +00:00
dave	f1e42710b5	refactor: split llm/providers/claude_code.rs into mod.rs + parse.rs + events.rs The 1427-line claude_code.rs is split into: - parse.rs: parse_assistant_message + parse_tool_results + tests (332 lines) - events.rs: process_json_event + handle_stream_event + tests (749 lines) - mod.rs: doc, types (ClaudeCodeResult, ClaudeCodeProvider), chat_stream, run_pty_session (395 lines) Tests stay co-located. No behaviour change. All 44 claude_code tests pass; full suite green.	2026-04-26 21:22:08 +00:00
dave	ce94dd0af4	refactor: split agents/merge.rs into mod.rs + squash.rs + conflicts.rs The 1772-line merge.rs is split into: - conflicts.rs: try_resolve_conflicts + resolve_simple_conflicts + tests (351 lines) - squash.rs: run_squash_merge orchestrator + cleanup + run_merge_quality_gates + tests (1306 lines) - mod.rs: doc, types (MergeJobStatus, MergeJob, MergeReport, SquashMergeResult), re-exports (52 lines) Tests stay co-located. No behaviour change. All 20 merge tests pass; full suite green (2635 tests with --test-threads=1).	2026-04-26 21:15:06 +00:00
dave	851324740c	refactor: split http/mcp/story_tools.rs into 5 sub-modules by item type The 1864-line story_tools.rs is split into: - story.rs: story creation/lifecycle/management (903 lines incl. tests) - criteria.rs: acceptance-criteria tools (534 lines) - bug.rs: bug item tools (318 lines) - spike.rs: spike item tools (120 lines) - refactor.rs: refactor item tools (60 lines) - mod.rs: re-exports (25 lines) Tests stay co-located with the code they exercise; setup_git_repo_in and setup_story_for_update test helpers are duplicated into the modules that need them rather than centralised, since they are tiny and test-only. No behaviour change. All 60 story_tools tests pass; full suite green (2635 tests with --test-threads=1).	2026-04-26 21:11:09 +00:00
dave	0dff2d5c47	refactor: split http/mcp/mod.rs into 3 logical files The 1882-line mod.rs is split into: - tools_list.rs: handle_tools_list — the static schema for every MCP tool (1172 lines) - dispatch.rs: handle_tools_call — the tool-name → *_tools router (157 lines) - mod.rs: doc, sub-mod decls, JsonRpc structs, Poem handlers, handle_initialize (586 lines) Tests stay co-located with the code they exercise. No behaviour change. All 267 http::mcp tests pass; full suite green (2635 tests with --test-threads=1).	2026-04-26 21:05:07 +00:00
dave	8f91f55cd1	refactor: split io/fs/scaffold.rs into 4 sub-modules with co-located tests The 2045-line scaffold.rs is split into a sub-module directory: - templates.rs: STORY_KIT_* and DEFAULT_* template constants (161 lines) - detect.rs: detect_components_toml + detect_script_{build,lint,test} + tests (989 lines) - helpers.rs: write_*_if_missing, generate_project_toml, gitignore helpers (166 lines) - mod.rs: scaffold_story_kit orchestrator + scaffold tests (756 lines) include_str! paths in templates.rs are adjusted (one extra ../) for the deeper nesting. Tests stay co-located with the code they exercise per Rust convention. No behaviour change. All 77 scaffold tests pass; full suite green (2635 tests with --test-threads=1).	2026-04-26 21:00:31 +00:00

1 2 3 4 5 ...

3411 Commits