huskies

Author	SHA1	Message	Date
dave	8ae6ca3eb8	fix: make run_tests block server-side instead of requiring agent polling run_tests now spawns the child and blocks in a 1-second poll loop until tests complete or the 20-minute timeout fires. Returns the full result in a single MCP call — agents use 1 turn instead of 50+. Child process is properly killed on timeout (no zombies). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-11 23:07:02 +00:00
dave	bac07d28a7	fix: increase run_tests MCP timeout to 20 minutes to match acceptance gates Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-11 22:43:31 +00:00
dave	fc89be2f55	fix: server-side 20s blocking in get_test_result to prevent agent poll spam Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-11 22:29:38 +00:00
dave	1f66183c8e	fix: update scaffold settings template to match locked-down agent permissions	2026-04-11 22:03:53 +00:00
dave	f958f57e56	fix: async run_tests to prevent zombie cargo processes blocking gates run_tests MCP tool now spawns tests in the background and returns immediately. Agents poll get_test_result to check completion. This prevents zombie cargo processes from holding the build lock when the CLI times out the MCP call before tests finish. Also fixes agent permission mode: acceptEdits replaces invalid allowFullAutoEdit that was causing agents to crash-loop on spawn. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-11 22:00:05 +00:00
dave	8393a67c89	fix: log git hash on build success and startup to verify which commit is running Writes HEAD short hash to .huskies/build_hash after successful cargo build. Logs it on startup as [startup] Running build: <hash>. No more guessing whether the rebuild actually deployed. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-11 20:50:15 +00:00
dave	e32300d1f8	fix: switch agent permission mode from bypassPermissions to allowFullAutoEdit bypassPermissions ignored the worktree's .claude/settings.json entirely, letting agents run any Bash command including cargo test (which they'd spawn 4+ times concurrently, deadlocking on the build directory lock). allowFullAutoEdit respects the settings.json allowlist, so agents can only use the Bash commands we explicitly permit (cargo check, cargo build, git) and must use MCP tools for everything else (run_tests, run_lint, run_build). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-11 20:23:22 +00:00
dave	32e36bbc4b	fix: remove cargo test/clippy/npm from agent Bash permissions Agents were running cargo test directly via Bash instead of using the run_tests MCP tool, causing 4 concurrent cargo builds that deadlocked on the build directory lock. Removed cargo test, cargo clippy, cargo nextest, script/test, npm test, and pnpm test from the allowed Bash commands. Agents must use the run_tests MCP tool which returns truncated output and prevents concurrent builds. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-11 19:50:52 +00:00
dave	d06241c20c	fix: merge_agent_work blocks until complete instead of requiring polling The mergemaster agent was burning all 30 turns polling get_merge_status every 2 seconds while the merge pipeline takes ~2 minutes. It would exhaust turns, exit, restart, and repeat — never seeing the result. merge_agent_work now blocks with a 10-second internal poll loop and returns the final result directly. The agent calls it once and gets the answer. No more polling turns wasted. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-11 17:43:50 +00:00
dave	599fbdc71d	huskies: merge 539_bug_crdt_event_bridge_still_writes_filesystem_shadow_files_after_530_eliminated_filesystem_state	2026-04-11 17:04:36 +00:00
dave	6998275331	huskies: merge 540_bug_get_agent_output_mcp_tool_returns_no_agent_for_exited_agents_instead_of_reading_session_logs_from_disk	2026-04-11 16:33:58 +00:00
dave	48ea612739	fix: remove startup CRDT stage sync — it fights the done→archived sweep The sync_crdt_stages_from_db migration reads pipeline_items (which has stale 5_done stages) and overwrites the CRDT back to 5_done for stories that were already swept to 6_archived. On every restart, done stories reappear and get re-swept. The migration served its purpose — CRDT stages are now correct. Remove it. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-11 13:50:07 +00:00
dave	17d635b66b	fix: restore CRDT-based triage command (535 fix was reverted by merge conflict) Story 535's triage fix was overwritten by a subsequent merge that resolved a conflict by taking the old filesystem-based version. Re-applies the CRDT-based triage that reads from pipeline state and content store, works for any pipeline stage. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-11 13:43:26 +00:00
dave	4ab723f40b	huskies: merge 538_bug_done_archived_sweep_never_fires_because_stage_done_projection_uses_utc_now_instead_of_real_merged_at_timestamp	2026-04-11 13:29:38 +00:00
dave	5d193bb568	huskies: merge 537_bug_delete_item_sets_stage_to_deleted_string_instead_of_writing_a_crdt_tombstone	2026-04-11 13:25:45 +00:00
dave	dcf6cf8f82	fix: collapse consecutive str::replace calls to satisfy clippy	2026-04-11 13:21:47 +00:00
dave	eea54ca616	fix: thread-local CRDT and content store for test isolation Tests shared a global CRDT singleton and content store HashMap, causing flaky failures when parallel tests wrote items that polluted each other's assertions. 3-5 random test failures per run. Both CRDT_STATE and CONTENT_STORE now use thread_local! in test mode so each test thread gets its own isolated instance. Production code is unchanged — it still uses the global OnceLock singletons. Also fixed 3 tests (create_story_writes_correct_content, next_item_number_increments_from_existing_bugs, next_item_number_scans_archived_too) that relied on leaked state from other tests — they now write to the content store explicitly. Result: 1902 passed, 0 failed across 5 consecutive runs. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-11 13:02:09 +00:00
dave	5696d77922	debug: add PTY spawn diagnostics for Session: None investigation When an agent CLI exits without creating a session, we now log: - Number of prior sessions and total session log bytes - Child process exit status (exit code or signal) - Explicit SESSION NONE warning with context This will help diagnose whether the fatal runtime error (output.write assertion) correlates with accumulated sessions, budget exhaustion, or something else. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-11 11:21:06 +00:00
dave	44ef477a01	fix: skip rate limit timer for short blocks (≤10 min) — CLI handles internally The rate limit auto-scheduler was creating timers for every hard block, including short 5-minute throttles. This caused a death loop: agent hits rate limit, timer set, agent exits, pipeline restarts before timer fires, new agent dies instantly (Session: None) because API is still throttled. Short rate limits are handled naturally by the CLI's internal wait. Only schedule timers for long session-level blocks (>10 min) where the CLI will exit and needs external restart. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-11 10:52:14 +00:00
dave	fc24da82ae	debug: add logging to sync_crdt_stages_from_db to diagnose stale backlog	2026-04-10 20:33:04 +00:00
dave	bae3619723	fix: startup migration syncs stale CRDT stages from pipeline_items DB 510 stories had stale 1_backlog stages in the CRDT because they were imported during the filesystem→CRDT migration and then moved forward via filesystem-only moves that never wrote CRDT ops. This made done stories appear as ghost entries in the backlog. On startup, reads the authoritative stage from pipeline_items and corrects any CRDT entries that disagree. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-10 19:58:17 +00:00
dave	ea36160667	fix: read_all_items must use deduplicated index, not raw CRDT entries read_all_items was iterating all CRDT entries including stale duplicates from earlier stage writes. A story written multiple times (backlog → current → done) would appear in the output multiple times with different stages, causing ghost entries in the pipeline status and backlog views. Now iterates only the index (story_id → visible_index map) which represents the latest-wins deduplicated view of each story. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-10 19:32:55 +00:00
dave	2e0ed98d42	huskies: merge 480_story_cryptographic_node_auth_for_distributed_mesh	2026-04-10 19:14:21 +00:00
dave	40893a8cb1	huskies: merge 535_bug_chat_status_number_and_mcp_tool_status_still_read_from_filesystem_broken_after_530	2026-04-10 19:01:31 +00:00
dave	bc2b1e244c	huskies: merge 498_bug_stale_merge_job_lock_prevents_new_merges_after_agent_dies	2026-04-10 18:55:05 +00:00
dave	6f7a0c7708	huskies: merge 479_story_build_agent_mode_with_crdt_based_work_claiming	2026-04-10 18:50:30 +00:00
dave	91be0ac47f	huskies: merge 534_refactor_unify_timer_tick_watchdog_and_watcher_sweep_into_a_single_1_second_tick_loop	2026-04-10 17:38:42 +00:00
dave	808935b446	huskies: merge 528_story_crdt_based_peer_discovery_via_node_presence_entries	2026-04-10 17:03:05 +00:00
dave	4c8fe910a7	huskies: merge 533_story_crdt_based_done_archived_sweep_to_replace_filesystem_based_watcher_sweep	2026-04-10 16:58:50 +00:00
dave	8f34c521fb	huskies: merge 508_story_configurable_rendezvous_peer_in_project_toml_with_outbound_crdt_sync_connect	2026-04-10 16:44:50 +00:00
dave	a59f4fc1a5	huskies: merge 532_story_remove_startup_reconcile_pass_and_drift_notification_no_filesystem_to_reconcile_against	2026-04-10 16:40:56 +00:00
dave	b88857c2e4	huskies: merge 507_story_apply_inbound_signedops_with_causal_order_queue_for_partition_recovery	2026-04-10 16:13:07 +00:00
dave	1ca9bc1bfd	huskies: merge 506_story_websocket_sync_endpoint_that_broadcasts_local_signedops_to_connected_peers	2026-04-10 15:52:49 +00:00
dave	73890c98fa	huskies: merge 505_story_signedop_wire_codec_for_crdt_sync_between_nodes	2026-04-10 15:35:10 +00:00
dave	bfede09fe6	huskies: merge 529_bug_stale_mergemaster_advance_moves_done_stories_back_to_merge_zombie_merge_loop	2026-04-10 15:20:34 +00:00
dave	11d19d8902	huskies: merge 530_story_eliminate_filesystem_markdown_shadows_entirely_crdt_db_is_the_only_story_store	2026-04-10 14:59:58 +00:00
dave	1dd675796b	huskies: merge 531_story_mcp_tool_to_read_agent_session_logs_from_disk_not_just_live_stream	2026-04-10 13:08:51 +00:00
dave	31388da609	huskies: merge 517_story_remove_filesystem_shadow_fallback_paths_from_lifecycle_rs_finish_the_migration_to_crdt_only	2026-04-10 13:00:25 +00:00
dave	fe405e81c6	huskies: merge 527_story_remove_rate_limit_hard_block_bot_notifications_from_matrix_chat	2026-04-10 11:27:36 +00:00
dave	2a24a4cc85	huskies: merge 522_story_migrate_status_command_pipeline_view_from_filesystem_to_pipeline_state_read_all_typed	2026-04-10 10:37:17 +00:00
dave	6310c8bf49	huskies: merge 518_story_apply_and_persist_should_log_when_persist_tx_send_fails_instead_of_silently_dropping_the_op	2026-04-10 10:33:01 +00:00
dave	61ae30873f	huskies: merge 516_story_update_story_description_should_create_the_description_section_if_it_doesn_t_exist_instead_of_erroring	2026-04-10 10:28:53 +00:00
dave	f015fe5a1d	huskies: merge 515_story_add_a_debug_mcp_tool_to_dump_the_in_memory_crdt_state_for_inspection	2026-04-10 10:24:30 +00:00
dave	c6b6be872b	huskies: merge 509_bug_create_story_silently_drops_description_and_any_other_unknown_parameters_with_no_error	2026-04-10 10:20:13 +00:00
dave	5377eeae5b	huskies: merge 513_story_startup_reconcile_pass_that_detects_drift_between_crdt_pipeline_items_and_filesystem_shadows	2026-04-10 10:16:45 +00:00
Timmy	92b212e7fd	huskies: merge 504_story_update_story_front_matter_mcp_schema_should_accept_non_string_values_lists_bools_numbers Squash merge of story 504: add MCP regression tests for non-string front_matter values (arrays, bools, integers). The schema change itself was already on master. Fixed the array assertion to match YAML's space-after-comma inline sequence format. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-10 11:08:21 +01:00
Timmy	9633ab35a6	fix: validate_story_dirs reads filesystem shadows instead of global CRDT singleton (bug 525) The post-520 migration changed validate_story_dirs to read from pipeline_state::read_all_typed() (the process-global CRDT singleton), ignoring its root: &Path argument. This broke test isolation — tests creating a tempdir saw dozens of results from ambient CRDT state, causing non-deterministic failures that blocked every mergemaster gate. Remove the CRDT singleton block and rely on the filesystem shadow scan that already uses the root argument correctly. 1845/1845 tests pass. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>	2026-04-10 10:52:42 +01:00
dave	d1b845fd2e	fix: move_item must not overwrite advanced CRDT stage when missing_ok=true (bug 524) When a story is found in the CRDT but not in the expected source stages, and missing_ok is true, return Ok(None) instead of proceeding with the move. This prevents promote_ready_backlog_stories from demoting a story that has already advanced to merge/done via a stale filesystem shadow in 1_backlog. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>	2026-04-10 00:21:39 +00:00
Timmy	962e3d4e7d	fmt	2026-04-10 01:04:09 +01:00
dave	0de9200d48	huskies: merge 512_story_migrate_chat_commands_from_filesystem_lookup_to_crdt_db	2026-04-09 23:03:58 +00:00

... 3 4 5 6 7 ...

737 Commits