huskies

Author	SHA1	Message	Date
dave	89058ebd49	huskies: merge 1124 story Persist TransitionFired into a per-sled CRDT event log	2026-05-17 19:37:50 +00:00
dave	c1b7e12b0b	huskies: merge 1122 story Chat-bot switch command reads stale `gateway_projects` Vec instead of live `gateway_projects_store`	2026-05-17 18:49:58 +00:00
dave	10d992a7e4	huskies: merge 1106 story Chat bootstrap Phase 1: `new project` chat command spawns a bare project container and registers it with the gateway	2026-05-16 22:39:20 +00:00
dave	fc5481dbe4	huskies: merge 1093 bug Chat dispatcher spawns one Timmy per inbound message — needs coalesce window + per-session serial lock	2026-05-15 12:03:09 +00:00
Timmy	fe9804b32c	feat: add process_kill module + use it to fix watchdog double-spawn Adds `crate::process_kill` — reliable SIGKILL-with-verify primitives used across the server in place of the various ad-hoc kill paths that ignored their kill-effective return values. The module exposes three pieces: - `sigkill_pids_and_verify(pids)`: SIGKILL each pid and block (up to 2s) until every pid is verified gone. Returns survivors if not. - `pids_matching(pattern)`: pgrep -f wrapper. - `descendant_pids(root)`: recursive pgrep -P walker for process trees. Wires the watchdog's limit-termination path through it, and reorders the protocol to fix the duplicate-coder bug observed on story 1086 (2026-05-15): Before: check_agent_limits set status=Failed before the kill ran. The kill itself was `portable_pty::ChildKiller::kill()`, which sends SIGHUP on Unix — claude-code ignores SIGHUP, so the process kept running while the agent record was already marked terminated. The idempotency check in `start_agent` whitelists Running/Pending, so the next auto-assign pass spawned a fresh agent alongside the still-alive prior one. Two claude PIDs sharing one session_id, racing on the same worktree. After: status update is moved OUT of check_agent_limits and into the caller AFTER the kill is verified. The kill itself is now SIGKILL-the- process-tree-in-the-worktree, with explicit verification that every pid is gone. The idempotency window is closed. The existing watchdog test suite (14 tests) still passes; 7 new tests cover the process_kill primitives directly. `agents/pool/process.rs`'s `kill_all_children` and `kill_child_for_key` still use the old portable_pty SIGHUP path — they have the same bug but in lower-impact code paths (shutdown, operator stop). They will be migrated under a separate story to keep this commit focused. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-05-15 10:36:33 +01:00
dave	1506141155	huskies: merge 1072	2026-05-15 01:27:25 +00:00
dave	da83fcb78d	huskies: merge 1074	2026-05-15 00:01:58 +00:00
dave	5678f2a556	huskies: merge 1061	2026-05-14 20:12:51 +00:00
dave	1f9f34ab58	huskies: merge 1038	2026-05-14 17:06:50 +00:00
dave	311883f45d	huskies: merge 1039	2026-05-14 16:33:47 +00:00
dave	72d79deec9	huskies: merge 1026	2026-05-14 13:00:51 +00:00
dave	4a8ed4348b	huskies: merge 950	2026-05-13 08:46:22 +00:00
dave	f2943c7e69	huskies: merge 948	2026-05-13 04:48:56 +00:00
dave	cd214d7246	huskies: merge 899	2026-05-12 23:16:25 +00:00
dave	937792f208	huskies: merge 898	2026-05-12 21:33:41 +00:00
dave	03a99b3cf1	huskies: merge 927	2026-05-12 17:55:12 +00:00
dave	11d111360d	huskies: merge 858	2026-04-29 10:47:18 +00:00
dave	46b1e84629	huskies: merge 791	2026-04-28 19:18:12 +00:00
dave	01169332b3	huskies: merge 774	2026-04-28 10:51:59 +00:00
dave	0d14fffe1c	huskies: merge 762	2026-04-28 01:31:16 +00:00
dave	de5b585157	huskies: merge 761	2026-04-28 01:11:07 +00:00
dave	88f9e5dd54	huskies: merge 731_refactor_migrate_existing_stories_from_slug_based_ids_to_numeric_only	2026-04-27 20:42:21 +00:00
dave	26f9f3f7fc	huskies: merge 729_story_store_story_name_as_a_crdt_field_separate_from_the_story_id	2026-04-27 19:09:56 +00:00
dave	80661fa622	huskies: merge 727_story_ed25519_node_identity_keypair_generation_persistence_and_identity_endpoint	2026-04-27 18:37:58 +00:00
dave	272a592a4d	huskies: merge 735_story_attach_statuseventbuffer_to_each_agent_session_scoped_per_project_reset_on_restart	2026-04-27 18:06:11 +00:00
dave	4a0f57478c	huskies: merge 671_refactor_migrate_pipeline_state_consumers_from_string_comparisons_to_typed_pipelinestage_enum	2026-04-27 16:39:39 +00:00
dave	cbb0a50729	huskies: merge 649_story_migrate_whatsapp_transport_to_status_broadcaster	2026-04-27 14:19:19 +00:00
dave	6c8043d866	huskies: merge 648_story_migrate_discord_transport_to_status_broadcaster	2026-04-27 14:01:32 +00:00
dave	25603bb8cb	huskies: merge 669_story_migrate_slack_transport_to_status_broadcaster	2026-04-27 11:57:06 +00:00
dave	65d2fb210c	huskies: merge 655_bug_matrix_bot_spawns_its_own_timerstore_instead_of_using_shared_appcontext_timer_store	2026-04-27 11:32:51 +00:00
dave	b340aa97b0	fix: clean up clippy warnings + cargo fmt across post-refactor surface The 13-file refactor pass (commits `db00a5d4` through `eca15b4e`) introduced ~89 clippy errors and 38 cargo fmt issues — every agent in every worktree hit them on script/test, burning their turn budget on cleanup before doing real story work. This is the silent kill behind 644, 652, 655, 664, 667 all hitting watchdog limits this round. Changes: - cargo fmt --all across 37 files (formatting normalisation only) - #![allow(unused_imports, dead_code)] on 24 split modules where the python-script splitter imported liberally to be safe; tighter cleanup per-import will happen as agents touch each module - Removed truly-dead re-exports (cleanup_merge_workspace, slog_warn from http/mcp/mod.rs, CliArgs/print_help from main.rs) - Prefixed _auth_msg in crdt_sync/server.rs (handshake helper return is bound but not consumed) - Converted dangling /// doc block in crdt_sync/mod.rs to //! so it attaches to the module - Removed empty lines after doc comments in 4 spots (clippy lint) All 2636 tests pass; clippy --all-targets -- -D warnings clean.	2026-04-27 01:32:08 +00:00
dave	06035f20ad	fix: restore #[tokio::main] on main(), #[cfg(unix)] on platform tests, #[allow] on run_pty_session/AuthListenerResult The biggest miss is #[tokio::main] — without it, async fn main() doesn't compile, and the binary in every worktree fails 'cargo check'. Agents in those worktrees burn their turn budgets trying to fix the build before they can do real work, then get killed by the watchdog. That's why all three in-flight stories failed. Other restored attributes: - #[cfg(unix)] on 4 tests in merge/squash and scaffold (skip on non-Unix) - #[allow(dead_code)] on AuthListenerResult test enum - #[allow(clippy::too_many_arguments)] on run_pty_session Same root cause as the earlier #[test] attribute losses: my line ranges started at the fn line, missing the leading attribute on the previous line.	2026-04-26 23:38:17 +00:00
dave	0d805313d6	fix: restore #[test] and #[should_panic] attributes on panics_on_duplicate_agent_names Lost in commit `db00a5d4` when extracting tests from main.rs into cli.rs; the line range used for the panics_on_duplicate_agent_names test in main.rs started at the fn signature instead of the attribute line.	2026-04-26 22:01:06 +00:00
dave	db00a5d4b5	refactor: split main.rs by extracting CLI parsing into cli.rs The 1258-line main.rs is split into: - main.rs: mod declarations, async fn main + panics_on_duplicate_agent_names test (894 lines) - cli.rs: CliArgs struct, parse_cli_args, print_help, resolve_path_arg + their tests (372 lines) main.rs cannot itself become a directory (binary crate must have main.rs at the crate root); cli.rs is a sibling module. No behaviour change. All cli tests pass; full suite green.	2026-04-26 21:41:39 +00:00
dave	d8f9be5b23	huskies: merge 641_story_unified_status_update_delivery_across_chat_web_ui_and_top_level_agent_context	2026-04-26 02:27:34 +00:00
dave	dc7ae3a23c	huskies: merge 637_story_peer_mesh_discovery_via_crdt_node_presence_list	2026-04-26 01:57:31 +00:00
dave	b84ce1f6bb	huskies: merge 636_story_full_crdt_snapshot_compaction_with_cross_node_coordination	2026-04-26 01:19:05 +00:00
dave	7548486a53	huskies: merge 633_story_crdt_sync_bearer_token_connection_auth	2026-04-25 22:13:42 +00:00
dave	2a3f88fdcf	huskies: merge 639_refactor_migrate_whatsapp_transport_to_services_bundle	2026-04-25 19:51:59 +00:00
dave	e4dd4bbe2c	huskies: merge 638_refactor_migrate_discord_transport_to_services_bundle	2026-04-25 19:33:01 +00:00
dave	33cb2bed3e	huskies: merge 627_refactor_migrate_slack_discord_and_whatsapp_transports_to_services_bundle	2026-04-25 19:01:45 +00:00
dave	4b089c1ed8	huskies: merge 626_refactor_introduce_services_bundle_and_migrate_appcontext_matrix_transport	2026-04-25 15:08:46 +00:00
dave	aeff0b55be	huskies: merge 628_story_websocket_connect_time_mutual_auth_using_node_identity_primitives	2026-04-25 14:33:47 +00:00
dave	9e3d2f6a69	huskies: merge 602_spike_node_identity_keypair_foundation_for_distributed_huskies	2026-04-25 14:03:59 +00:00
dave	e20083a283	huskies: merge 624_bug_agent_turn_and_budget_limits_not_enforced_coder_1_ran_5_6x_over_max_turns	2026-04-25 13:11:30 +00:00
dave	271f8ea6a8	huskies: merge 616_story_extract_notifications_service	2026-04-24 18:05:42 +00:00
dave	eca0ef792c	huskies: merge 615_story_extract_timer_service	2026-04-24 17:43:53 +00:00
dave	2f07365745	huskies: merge 604_story_service_module_conventions_and_first_extraction	2026-04-24 13:45:22 +00:00
dave	3521649cbf	huskies: merge 599_story_cross_project_status_notifications_in_chat	2026-04-23 12:09:35 +00:00
Timmy	45f1096b96	Gateway bot: proxy commands to active project instead of reading local state In gateway mode the bot has no local CRDT or project filesystem, so all bot commands (status, backlog, start, assign, etc.) returned empty or broken results. Now the gateway bot proxies non-local commands via HTTP to the active project's /api/bot/command endpoint, which already exists on every project server. Only a small set of gateway-local commands (help, ambient, reset, switch) are still handled directly by the gateway. Everything else is forwarded automatically, so new commands added in the future will work through the proxy without additional gateway changes. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-21 11:47:06 +01:00

1 2 3

149 Commits