story-kit: queue 257_story_rename_storkit_to_story_kit_in_header for merge

story-kit: create 256_story_bot_must_verify_other_users_cross_signing_identity_before_checking_device_verification
2026-03-17 12:14:53 +00:00 · 2026-03-17 11:51:29 +00:00 · 2026-03-17 11:46:28 +00:00 · 2026-03-17 11:45:29 +00:00 · 2026-03-17 11:43:23 +00:00 · 2026-03-17 11:39:28 +00:00
28 changed files with 581 additions and 51 deletions
--- a/.claude/settings.json
+++ b/.claude/settings.json
@@ -59,7 +59,8 @@
      "mcp__story-kit__*",
      "Edit",
      "Write",
-      "Bash(find *)"
+      "Bash(find *)",
      "Bash(sqlite3 *)"
    ]
  }
 }
--- a/.gitignore
+++ b/.gitignore
@@ -13,6 +13,7 @@ store.json
 # Matrix SDK state store
 .story_kit/matrix_store/
 .story_kit/matrix_device_id
 # Agent worktrees and merge workspace (managed by the server, not tracked in git)
 .story_kit/worktrees/
--- a/.story_kit/work/1_upcoming/250_bug_merge_pipeline_cherry_pick_fails_with_bad_revision_on_merge_queue_branch.md
+++ b/.story_kit/work/1_upcoming/250_bug_merge_pipeline_cherry_pick_fails_with_bad_revision_on_merge_queue_branch.md
--- a/.story_kit/work/1_upcoming/256_story_bot_must_verify_other_users_cross_signing_identity_before_checking_device_verification.md
+++ b/.story_kit/work/1_upcoming/256_story_bot_must_verify_other_users_cross_signing_identity_before_checking_device_verification.md
@@ -0,0 +1,20 @@
 ---
 name: "Bot must verify other users' cross-signing identity before checking device verification"
 ---
 # Story 256: Bot must verify other users' cross-signing identity before checking device verification
 ## User Story
 As a Matrix user messaging the bot, I want the bot to correctly recognize my cross-signing-verified devices, so that my messages are not rejected when I have a valid verified identity.
 ## Acceptance Criteria
 - [ ] The bot's `check_sender_verified` function (or equivalent) verifies the sender's identity trust status, not just individual device verification
 - [ ] When @yossarian:crashlabs.io (who has valid cross-signing keys) sends a message in an encrypted room, the bot accepts it instead of rejecting with 'no cross-signing-verified device found'
 - [ ] The bot still rejects messages from users who genuinely have no cross-signing setup
 - [ ] Existing tests (if any) continue to pass after the change
 ## Out of Scope
 - TBD
--- a/.story_kit/work/4_merge/257_story_rename_storkit_to_story_kit_in_header.md
+++ b/.story_kit/work/4_merge/257_story_rename_storkit_to_story_kit_in_header.md
@@ -0,0 +1,15 @@
 ---
 name: "Rename StorkIt to Story Kit in the header"
 ---
 # Story 257: Rename "StorkIt" to "Story Kit" in the header
 ## Description
 The ChatHeader component displays "StorkIt" as the app title. It should say "Story Kit" instead.
 ## Acceptance Criteria
 - [ ] The header in `ChatHeader.tsx` displays "Story Kit" instead of "StorkIt"
 - [ ] The test in `ChatHeader.test.tsx` is updated to match
 - [ ] All existing tests pass
--- a/.story_kit/work/5_done/245_bug_chat_history_persistence_lost_on_page_refresh_story_145_regression.md
+++ b/.story_kit/work/5_done/245_bug_chat_history_persistence_lost_on_page_refresh_story_145_regression.md
--- a/.story_kit/work/6_archived/240_story_btw_side_question_slash_command.md
+++ b/.story_kit/work/6_archived/240_story_btw_side_question_slash_command.md
--- a/.story_kit/work/6_archived/242_story_status_slash_command.md
+++ b/.story_kit/work/6_archived/242_story_status_slash_command.md
--- a/.story_kit/work/6_archived/246_story_enforce_cryptographic_identity_verification_for_matrix_commands.md
+++ b/.story_kit/work/6_archived/246_story_enforce_cryptographic_identity_verification_for_matrix_commands.md
--- a/.story_kit/work/6_archived/249_story_agent_assignment_via_story_front_matter.md
+++ b/.story_kit/work/6_archived/249_story_agent_assignment_via_story_front_matter.md
--- a/.story_kit/work/6_archived/251_bug_archive_sweep_not_moving_stories_from_done_to_archived.md
+++ b/.story_kit/work/6_archived/251_bug_archive_sweep_not_moving_stories_from_done_to_archived.md
--- a/.story_kit/work/6_archived/252_story_coder_agents_must_find_root_causes_for_bugs.md
+++ b/.story_kit/work/6_archived/252_story_coder_agents_must_find_root_causes_for_bugs.md
--- a/.story_kit/work/6_archived/253_bug_watcher_and_auto_assign_do_not_reinitialize_when_project_root_changes.md
+++ b/.story_kit/work/6_archived/253_bug_watcher_and_auto_assign_do_not_reinitialize_when_project_root_changes.md
--- a/.story_kit/work/6_archived/254_story_add_refactor_work_item_type.md
+++ b/.story_kit/work/6_archived/254_story_add_refactor_work_item_type.md
@@ -1,5 +1,6 @@
 ---
 name: "Add refactor work item type"
 merge_failure: "merge_agent_work tool returned empty output on two attempts. The merge-queue branch (merge-queue/254_story_add_refactor_work_item_type) was created with squash merge commit 27d24b2, and the merge workspace worktree exists at .story_kit/merge_workspace, but the pipeline never completed (no success/failure logged after MERGE-DEBUG calls). The stale merge workspace worktree may be blocking completion. Possibly related to bug 250 (merge pipeline cherry-pick fails with bad revision on merge-queue branch). Human intervention needed to: 1) clean up the merge-queue worktree and branch, 2) investigate why the merge pipeline hangs after creating the squash merge commit, 3) retry the merge."
 ---
 # Story 254: Add refactor work item type
--- a/.story_kit/work/6_archived/255_story_show_agent_logs_in_expanded_story_popup.md
+++ b/.story_kit/work/6_archived/255_story_show_agent_logs_in_expanded_story_popup.md
@@ -0,0 +1,16 @@
 ---
 name: "Show agent logs in expanded story popup"
 merge_failure: "merge_agent_work tool returned empty output. The merge pipeline created the merge-queue branch (merge-queue/255_story_show_agent_logs_in_expanded_story_popup) and merge workspace worktree at .story_kit/merge_workspace, but hung without completing. This is the same issue that affected story 254 — likely related to bug 250 (merge pipeline cherry-pick fails with bad revision on merge-queue branch). The stale merge workspace worktree on the merge-queue branch may be blocking completion. Human intervention needed to: 1) clean up the merge workspace worktree and merge-queue branch, 2) investigate the root cause in the merge pipeline (possibly the cherry-pick/fast-forward step after squash merge), 3) retry the merge."
 ---
 # Story 255: Show agent logs in expanded story popup
 ## Description
 The expanded story popup has an "Agent Logs" tab that currently shows "No output". Implement the frontend and any necessary API wiring to display agent output in this tab. This is new functionality — agent logs have never been shown here before.
 ## Acceptance Criteria
 - [ ] Agent Logs tab shows real-time output from running agents
 - [ ] Agent Logs tab shows historical output from completed/failed agents
 - [ ] Logs are associated with the correct story
--- a/Cargo.lock
+++ b/Cargo.lock
@@ -4028,6 +4028,7 @@ dependencies = [
 "tokio-tungstenite 0.28.0",
 "toml 1.0.6+spec-1.1.0",
 "uuid",
 "wait-timeout",
 "walkdir",
 ]
@@ -4771,6 +4772,15 @@ dependencies = [
 "memchr",
 ]
 [[package]]
 name = "wait-timeout"
 version = "0.2.1"
 source = "registry+https://github.com/rust-lang/crates.io-index"
 checksum = "09ac3b126d3914f9849036f826e054cbabdc8519970b8998ddaf3b5bd3c65f11"
 dependencies = [
 "libc",
 ]
 [[package]]
 name = "walkdir"
 version = "2.5.0"
--- a/README.md
+++ b/README.md
@@ -13,7 +13,7 @@ cd frontend
 npm install
 npm run dev
-# Run the server (serves embedded frontend/dist/)
+# In another terminal - run the server (serves embedded frontend/dist/)
 cargo run
 ```
--- a/frontend/src/api/agents.ts
+++ b/frontend/src/api/agents.ts
@@ -108,6 +108,14 @@ export const agentsApi = {
 			baseUrl,
 		);
 	},
 	getAgentOutput(storyId: string, agentName: string, baseUrl?: string) {
 		return requestJson<{ output: string }>(
 			`/agents/${encodeURIComponent(storyId)}/${encodeURIComponent(agentName)}/output`,
 			{},
 			baseUrl,
 		);
 	},
 };
 /**
--- a/frontend/src/components/Chat.test.tsx
+++ b/frontend/src/components/Chat.test.tsx
@@ -529,6 +529,57 @@ describe("Chat localStorage persistence (Story 145)", () => {
 		confirmSpy.mockRestore();
 	});
 	it("Bug 245: messages survive unmount/remount cycle (page refresh)", async () => {
 		// Step 1: Render Chat and populate messages via WebSocket onUpdate
 		const { unmount } = render(
 			<Chat projectPath={PROJECT_PATH} onCloseProject={vi.fn()} />,
 		);
 		await waitFor(() => expect(capturedWsHandlers).not.toBeNull());
 		const history: Message[] = [
 			{ role: "user", content: "Persist me across refresh" },
 			{ role: "assistant", content: "I should survive a reload" },
 		];
 		act(() => {
 			capturedWsHandlers?.onUpdate(history);
 		});
 		// Verify messages are persisted to localStorage
 		expect(localStorage.getItem(STORAGE_KEY)).not.toBeNull();
 		const storedBefore = JSON.parse(localStorage.getItem(STORAGE_KEY) ?? "[]");
 		expect(storedBefore).toEqual(history);
 		// Step 2: Unmount the Chat component (simulates page unload)
 		unmount();
 		// Verify localStorage was NOT cleared by unmount
 		expect(localStorage.getItem(STORAGE_KEY)).not.toBeNull();
 		const storedAfterUnmount = JSON.parse(
 			localStorage.getItem(STORAGE_KEY) ?? "[]",
 		);
 		expect(storedAfterUnmount).toEqual(history);
 		// Step 3: Remount the Chat component (simulates page reload)
 		capturedWsHandlers = null;
 		render(<Chat projectPath={PROJECT_PATH} onCloseProject={vi.fn()} />);
 		// Verify messages are restored from localStorage
 		expect(
 			await screen.findByText("Persist me across refresh"),
 		).toBeInTheDocument();
 		expect(
 			await screen.findByText("I should survive a reload"),
 		).toBeInTheDocument();
 		// Verify localStorage still has the messages
 		const storedAfterRemount = JSON.parse(
 			localStorage.getItem(STORAGE_KEY) ?? "[]",
 		);
 		expect(storedAfterRemount).toEqual(history);
 	});
 	it("AC5: uses project-scoped storage key", async () => {
 		const otherKey = "storykit-chat-history:/other/project";
 		localStorage.setItem(
--- a/frontend/src/components/StagePanel.tsx
+++ b/frontend/src/components/StagePanel.tsx
@@ -4,12 +4,13 @@ import { useLozengeFly } from "./LozengeFlyContext";
 const { useLayoutEffect, useRef } = React;
-type WorkItemType = "story" | "bug" | "spike" | "unknown";
+type WorkItemType = "story" | "bug" | "spike" | "refactor" | "unknown";
 const TYPE_COLORS: Record<WorkItemType, string> = {
 	story: "#3fb950",
 	bug: "#f85149",
 	spike: "#58a6ff",
 	refactor: "#a371f7",
 	unknown: "#444",
 };
@@ -17,6 +18,7 @@ const TYPE_LABELS: Record<WorkItemType, string | null> = {
 	story: "STORY",
 	bug: "BUG",
 	spike: "SPIKE",
 	refactor: "REFACTOR",
 	unknown: null,
 };
@@ -24,7 +26,12 @@ function getWorkItemType(storyId: string): WorkItemType {
 	const match = storyId.match(/^\d+_([a-z]+)_/);
 	if (!match) return "unknown";
 	const segment = match[1];
-	if (segment === "story" || segment === "bug" || segment === "spike") {
+	if (
 		segment === "story" ||
 		segment === "bug" ||
 		segment === "spike" ||
 		segment === "refactor"
 	) {
 		return segment;
 	}
 	return "unknown";
--- a/script/test
+++ b/script/test
@@ -11,5 +11,7 @@ echo "=== Running frontend unit tests ==="
 cd "$PROJECT_ROOT/frontend"
 npm test
-echo "=== Running e2e tests ==="
+# Disabled: e2e tests may be causing merge pipeline hangs (no running server
-npm run test:e2e
+# in merge workspace → Playwright blocks indefinitely). Re-enable once confirmed.
 # echo "=== Running e2e tests ==="
 # npm run test:e2e
--- a/server/Cargo.toml
+++ b/server/Cargo.toml
@@ -33,6 +33,7 @@ pulldown-cmark = { workspace = true }
 # Force bundled SQLite so static musl builds don't need a system libsqlite3
 libsqlite3-sys = { version = "0.35.0", features = ["bundled"] }
 wait-timeout = "0.2.1"
 [dev-dependencies]
 tempfile = { workspace = true }
--- a/server/src/agents/gates.rs
+++ b/server/src/agents/gates.rs
@@ -1,5 +1,10 @@
 use std::path::Path;
 use std::process::Command;
 use std::time::Duration;
 use wait_timeout::ChildExt;
 /// Maximum time any single test command is allowed to run before being killed.
 const TEST_TIMEOUT: Duration = Duration::from_secs(600); // 10 minutes
 /// Detect whether the base branch in a worktree is `master` or `main`.
 /// Falls back to `"master"` if neither is found.
@@ -65,48 +70,20 @@ pub(crate) fn run_project_tests(path: &Path) -> Result<(bool, String), String> {
    let script_test = path.join("script").join("test");
    if script_test.exists() {
        let mut output = String::from("=== script/test ===\n");
-        let result = Command::new(&script_test)
+        let (success, out) = run_command_with_timeout(&script_test, &[], path)?;
            .current_dir(path)
            .output()
            .map_err(|e| format!("Failed to run script/test: {e}"))?;
        let out = format!(
            "{}{}",
            String::from_utf8_lossy(&result.stdout),
            String::from_utf8_lossy(&result.stderr)
        );
        output.push_str(&out);
        output.push('\n');
-        return Ok((result.status.success(), output));
+        return Ok((success, output));
    }
    // Fallback: cargo nextest run / cargo test
    let mut output = String::from("=== tests ===\n");
-    let (success, test_out) = match Command::new("cargo")
+    let (success, test_out) = match run_command_with_timeout("cargo", &["nextest", "run"], path) {
-        .args(["nextest", "run"])
+        Ok(result) => result,
        .current_dir(path)
        .output()
    {
        Ok(o) => {
            let combined = format!(
                "{}{}",
                String::from_utf8_lossy(&o.stdout),
                String::from_utf8_lossy(&o.stderr)
            );
            (o.status.success(), combined)
        }
        Err(_) => {
            // nextest not available — fall back to cargo test
-            let o = Command::new("cargo")
+            run_command_with_timeout("cargo", &["test"], path)
-                .args(["test"])
+                .map_err(|e| format!("Failed to run cargo test: {e}"))?
                .current_dir(path)
                .output()
                .map_err(|e| format!("Failed to run cargo test: {e}"))?;
            let combined = format!(
                "{}{}",
                String::from_utf8_lossy(&o.stdout),
                String::from_utf8_lossy(&o.stderr)
            );
            (o.status.success(), combined)
        }
    };
    output.push_str(&test_out);
@@ -114,6 +91,49 @@ pub(crate) fn run_project_tests(path: &Path) -> Result<(bool, String), String> {
    Ok((success, output))
 }
 /// Run a command with a timeout. Returns `(success, combined_output)`.
 /// Kills the child process if it exceeds `TEST_TIMEOUT`.
 fn run_command_with_timeout(
    program: impl AsRef<std::ffi::OsStr>,
    args: &[&str],
    dir: &Path,
 ) -> Result<(bool, String), String> {
    let mut child = Command::new(program)
        .args(args)
        .current_dir(dir)
        .stdout(std::process::Stdio::piped())
        .stderr(std::process::Stdio::piped())
        .spawn()
        .map_err(|e| format!("Failed to spawn command: {e}"))?;
    match child.wait_timeout(TEST_TIMEOUT) {
        Ok(Some(status)) => {
            // Process exited within the timeout — collect output.
            let stdout = child.stdout.take().map(|mut r| {
                let mut s = String::new();
                std::io::Read::read_to_string(&mut r, &mut s).ok();
                s
            }).unwrap_or_default();
            let stderr = child.stderr.take().map(|mut r| {
                let mut s = String::new();
                std::io::Read::read_to_string(&mut r, &mut s).ok();
                s
            }).unwrap_or_default();
            Ok((status.success(), format!("{stdout}{stderr}")))
        }
        Ok(None) => {
            // Timed out — kill the child.
            let _ = child.kill();
            let _ = child.wait();
            Err(format!(
                "Command timed out after {} seconds",
                TEST_TIMEOUT.as_secs()
            ))
        }
        Err(e) => Err(format!("Failed to wait for command: {e}")),
    }
 }
 /// Run `cargo clippy` and the project test suite (via `script/test` if present,
 /// otherwise `cargo nextest run` / `cargo test`) in the given directory.
 /// Returns `(gates_passed, combined_output)`.
--- a/server/src/agents/pool.rs
+++ b/server/src/agents/pool.rs
@@ -1377,7 +1377,13 @@ impl AgentPool {
            for story_id in &items {
                // Re-acquire the lock on each iteration to see state changes
                // from previous start_agent calls in the same pass.
-                let (already_assigned, free_agent) = {
+                let preferred_agent =
                    read_story_front_matter_agent(project_root, stage_dir, story_id);
                // Outcome: (already_assigned, chosen_agent, preferred_busy)
                // preferred_busy=true means the story has a specific agent requested but it is
                // currently occupied — the story should wait rather than fall back.
                let (already_assigned, free_agent, preferred_busy) = {
                    let agents = match self.agents.lock() {
                        Ok(a) => a,
                        Err(e) => {
@@ -1386,13 +1392,20 @@ impl AgentPool {
                        }
                    };
                    let assigned = is_story_assigned_for_stage(&config, &agents, story_id, stage);
-                    let free = if assigned {
+                    if assigned {
-                        None
+                        (true, None, false)
                    } else if let Some(ref pref) = preferred_agent {
                        // Story has a front-matter agent preference.
                        if is_agent_free(&agents, pref) {
                            (false, Some(pref.clone()), false)
                        } else {
-                        find_free_agent_for_stage(&config, &agents, stage)
+                            (false, None, true)
-                            .map(|s| s.to_string())
+                        }
-                    };
+                    } else {
-                    (assigned, free)
+                        let free = find_free_agent_for_stage(&config, &agents, stage)
                            .map(|s| s.to_string());
                        (false, free, false)
                    }
                };
                if already_assigned {
@@ -1400,6 +1413,16 @@ impl AgentPool {
                    continue;
                }
                if preferred_busy {
                    // The story requests a specific agent that is currently busy.
                    // Do not fall back to a different agent; let this story wait.
                    slog!(
                        "[auto-assign] Preferred agent '{}' busy for '{story_id}'; story will wait.",
                        preferred_agent.as_deref().unwrap_or("?")
                    );
                    continue;
                }
                match free_agent {
                    Some(agent_name) => {
                        slog!(
@@ -1815,6 +1838,29 @@ fn find_active_story_stage(project_root: &Path, story_id: &str) -> Option<&'stat
 /// Scan a work pipeline stage directory and return story IDs, sorted alphabetically.
 /// Returns an empty `Vec` if the directory does not exist.
 /// Read the optional `agent:` field from the front matter of a story file.
 ///
 /// Returns `Some(agent_name)` if the front matter specifies an agent, or `None`
 /// if the field is absent or the file cannot be read / parsed.
 fn read_story_front_matter_agent(project_root: &Path, stage_dir: &str, story_id: &str) -> Option<String> {
    use crate::io::story_metadata::parse_front_matter;
    let path = project_root
        .join(".story_kit")
        .join("work")
        .join(stage_dir)
        .join(format!("{story_id}.md"));
    let contents = std::fs::read_to_string(path).ok()?;
    parse_front_matter(&contents).ok()?.agent
 }
 /// Return `true` if `agent_name` has no active (pending/running) entry in the pool.
 fn is_agent_free(agents: &HashMap<String, StoryAgent>, agent_name: &str) -> bool {
    !agents.values().any(|a| {
        a.agent_name == agent_name
            && matches!(a.status, AgentStatus::Running | AgentStatus::Pending)
    })
 }
 fn scan_stage_items(project_root: &Path, stage_dir: &str) -> Vec<String> {
    let dir = project_root
        .join(".story_kit")
--- a/server/src/http/agents.rs
+++ b/server/src/http/agents.rs
@@ -105,6 +105,12 @@ impl TestResultsResponse {
    }
 }
 /// Response for the agent output endpoint.
 #[derive(Object, Serialize)]
 struct AgentOutputResponse {
    output: String,
 }
 /// Returns true if the story file exists in `work/5_done/` or `work/6_archived/`.
 ///
 /// Used to exclude agents for already-archived stories from the `list_agents`
@@ -400,6 +406,45 @@ impl AgentsApi {
        ))
    }
    /// Get the historical output text for an agent session.
    ///
    /// Reads the most recent persistent log file for the given story+agent and
    /// returns all `output` events concatenated as a single string. Returns an
    /// empty string if no log file exists yet.
    #[oai(path = "/agents/:story_id/:agent_name/output", method = "get")]
    async fn get_agent_output(
        &self,
        story_id: Path<String>,
        agent_name: Path<String>,
    ) -> OpenApiResult<Json<AgentOutputResponse>> {
        let project_root = self
            .ctx
            .agents
            .get_project_root(&self.ctx.state)
            .map_err(bad_request)?;
        let log_path =
            crate::agent_log::find_latest_log(&project_root, &story_id.0, &agent_name.0);
        let Some(path) = log_path else {
            return Ok(Json(AgentOutputResponse {
                output: String::new(),
            }));
        };
        let entries = crate::agent_log::read_log(&path).map_err(bad_request)?;
        let output: String = entries
            .iter()
            .filter(|e| {
                e.event.get("type").and_then(|t| t.as_str()) == Some("output")
            })
            .filter_map(|e| e.event.get("text").and_then(|t| t.as_str()).map(str::to_owned))
            .collect();
        Ok(Json(AgentOutputResponse { output }))
    }
    /// Remove a git worktree and its feature branch for a story.
    #[oai(path = "/agents/worktrees/:story_id", method = "delete")]
    async fn remove_worktree(&self, story_id: Path<String>) -> OpenApiResult<Json<bool>> {
@@ -835,6 +880,100 @@ allowed_tools = ["Read", "Bash"]
        assert!(result.is_err());
    }
    // --- get_agent_output tests ---
    #[tokio::test]
    async fn get_agent_output_returns_empty_when_no_log_exists() {
        let tmp = TempDir::new().unwrap();
        let ctx = AppContext::new_test(tmp.path().to_path_buf());
        let api = AgentsApi {
            ctx: Arc::new(ctx),
        };
        let result = api
            .get_agent_output(
                Path("42_story_foo".to_string()),
                Path("coder-1".to_string()),
            )
            .await
            .unwrap()
            .0;
        assert_eq!(result.output, "");
    }
    #[tokio::test]
    async fn get_agent_output_returns_concatenated_output_events() {
        use crate::agent_log::AgentLogWriter;
        use crate::agents::AgentEvent;
        let tmp = TempDir::new().unwrap();
        let root = tmp.path();
        let mut writer =
            AgentLogWriter::new(root, "42_story_foo", "coder-1", "sess-test").unwrap();
        writer
            .write_event(&AgentEvent::Status {
                story_id: "42_story_foo".to_string(),
                agent_name: "coder-1".to_string(),
                status: "running".to_string(),
            })
            .unwrap();
        writer
            .write_event(&AgentEvent::Output {
                story_id: "42_story_foo".to_string(),
                agent_name: "coder-1".to_string(),
                text: "Hello ".to_string(),
            })
            .unwrap();
        writer
            .write_event(&AgentEvent::Output {
                story_id: "42_story_foo".to_string(),
                agent_name: "coder-1".to_string(),
                text: "world\n".to_string(),
            })
            .unwrap();
        writer
            .write_event(&AgentEvent::Done {
                story_id: "42_story_foo".to_string(),
                agent_name: "coder-1".to_string(),
                session_id: None,
            })
            .unwrap();
        let ctx = AppContext::new_test(root.to_path_buf());
        let api = AgentsApi {
            ctx: Arc::new(ctx),
        };
        let result = api
            .get_agent_output(
                Path("42_story_foo".to_string()),
                Path("coder-1".to_string()),
            )
            .await
            .unwrap()
            .0;
        // Only output event texts should be concatenated; status and done are excluded.
        assert_eq!(result.output, "Hello world\n");
    }
    #[tokio::test]
    async fn get_agent_output_returns_error_when_no_project_root() {
        let tmp = TempDir::new().unwrap();
        let ctx = AppContext::new_test(tmp.path().to_path_buf());
        *ctx.state.project_root.lock().unwrap() = None;
        let api = AgentsApi {
            ctx: Arc::new(ctx),
        };
        let result = api
            .get_agent_output(
                Path("42_story_foo".to_string()),
                Path("coder-1".to_string()),
            )
            .await;
        assert!(result.is_err());
    }
    // --- create_worktree error path ---
    #[tokio::test]
--- a/server/src/http/mcp.rs
+++ b/server/src/http/mcp.rs
@@ -6,9 +6,9 @@ use crate::slog_warn;
 use crate::http::context::AppContext;
 use crate::http::settings::get_editor_command_from_store;
 use crate::http::workflow::{
-    add_criterion_to_file, check_criterion_in_file, create_bug_file, create_spike_file,
+    add_criterion_to_file, check_criterion_in_file, create_bug_file, create_refactor_file,
-    create_story_file, list_bug_files, load_upcoming_stories, update_story_in_file,
+    create_spike_file, create_story_file, list_bug_files, list_refactor_files,
-    validate_story_dirs,
+    load_upcoming_stories, update_story_in_file, validate_story_dirs,
 };
 use crate::worktree;
 use crate::io::story_metadata::{parse_front_matter, parse_unchecked_todos, write_merge_failure};
@@ -719,6 +719,37 @@ fn handle_tools_list(id: Option<Value>) -> JsonRpcResponse {
                        "properties": {}
                    }
                },
                {
                    "name": "create_refactor",
                    "description": "Create a refactor work item in work/1_upcoming/ with a deterministic filename and YAML front matter. Returns the refactor_id.",
                    "inputSchema": {
                        "type": "object",
                        "properties": {
                            "name": {
                                "type": "string",
                                "description": "Short human-readable refactor name"
                            },
                            "description": {
                                "type": "string",
                                "description": "Optional description of the desired state after refactoring"
                            },
                            "acceptance_criteria": {
                                "type": "array",
                                "items": { "type": "string" },
                                "description": "Optional list of acceptance criteria"
                            }
                        },
                        "required": ["name"]
                    }
                },
                {
                    "name": "list_refactors",
                    "description": "List all open refactors in work/1_upcoming/ matching the _refactor_ naming convention.",
                    "inputSchema": {
                        "type": "object",
                        "properties": {}
                    }
                },
                {
                    "name": "close_bug",
                    "description": "Archive a bug from work/2_current/ or work/1_upcoming/ to work/5_done/ and auto-commit to master.",
@@ -896,6 +927,9 @@ async fn handle_tools_call(
        "create_bug" => tool_create_bug(&args, ctx),
        "list_bugs" => tool_list_bugs(ctx),
        "close_bug" => tool_close_bug(&args, ctx),
        // Refactor lifecycle tools
        "create_refactor" => tool_create_refactor(&args, ctx),
        "list_refactors" => tool_list_refactors(ctx),
        // Mergemaster tools
        "merge_agent_work" => tool_merge_agent_work(&args, ctx).await,
        "move_story_to_merge" => tool_move_story_to_merge(&args, ctx).await,
@@ -1582,6 +1616,39 @@ fn tool_close_bug(args: &Value, ctx: &AppContext) -> Result<String, String> {
    ))
 }
 // ── Refactor lifecycle tool implementations ───────────────────────
 fn tool_create_refactor(args: &Value, ctx: &AppContext) -> Result<String, String> {
    let name = args
        .get("name")
        .and_then(|v| v.as_str())
        .ok_or("Missing required argument: name")?;
    let description = args.get("description").and_then(|v| v.as_str());
    let acceptance_criteria: Option<Vec<String>> = args
        .get("acceptance_criteria")
        .and_then(|v| serde_json::from_value(v.clone()).ok());
    let root = ctx.state.get_project_root()?;
    let refactor_id = create_refactor_file(
        &root,
        name,
        description,
        acceptance_criteria.as_deref(),
    )?;
    Ok(format!("Created refactor: {refactor_id}"))
 }
 fn tool_list_refactors(ctx: &AppContext) -> Result<String, String> {
    let root = ctx.state.get_project_root()?;
    let refactors = list_refactor_files(&root)?;
    serde_json::to_string_pretty(&json!(refactors
        .iter()
        .map(|(id, name)| json!({ "refactor_id": id, "name": name }))
        .collect::<Vec<_>>()))
    .map_err(|e| format!("Serialization error: {e}"))
 }
 // ── Mergemaster tool implementations ─────────────────────────────
 async fn tool_merge_agent_work(args: &Value, ctx: &AppContext) -> Result<String, String> {
@@ -2077,13 +2144,15 @@ mod tests {
        assert!(names.contains(&"create_bug"));
        assert!(names.contains(&"list_bugs"));
        assert!(names.contains(&"close_bug"));
        assert!(names.contains(&"create_refactor"));
        assert!(names.contains(&"list_refactors"));
        assert!(names.contains(&"merge_agent_work"));
        assert!(names.contains(&"move_story_to_merge"));
        assert!(names.contains(&"report_merge_failure"));
        assert!(names.contains(&"request_qa"));
        assert!(names.contains(&"get_server_logs"));
        assert!(names.contains(&"prompt_permission"));
-        assert_eq!(tools.len(), 31);
+        assert_eq!(tools.len(), 33);
    }
    #[test]
--- a/server/src/http/workflow.rs
+++ b/server/src/http/workflow.rs
@@ -338,6 +338,73 @@ pub fn create_spike_file(
    Ok(spike_id)
 }
 /// Create a refactor work item file in `work/1_upcoming/`.
 ///
 /// Returns the refactor_id (e.g. `"5_refactor_split_agents_rs"`).
 pub fn create_refactor_file(
    root: &Path,
    name: &str,
    description: Option<&str>,
    acceptance_criteria: Option<&[String]>,
 ) -> Result<String, String> {
    let refactor_number = next_item_number(root)?;
    let slug = slugify_name(name);
    if slug.is_empty() {
        return Err("Name must contain at least one alphanumeric character.".to_string());
    }
    let filename = format!("{refactor_number}_refactor_{slug}.md");
    let upcoming_dir = root.join(".story_kit").join("work").join("1_upcoming");
    fs::create_dir_all(&upcoming_dir)
        .map_err(|e| format!("Failed to create upcoming directory: {e}"))?;
    let filepath = upcoming_dir.join(&filename);
    if filepath.exists() {
        return Err(format!("Refactor file already exists: {filename}"));
    }
    let refactor_id = filepath
        .file_stem()
        .and_then(|s| s.to_str())
        .unwrap_or_default()
        .to_string();
    let mut content = String::new();
    content.push_str("---\n");
    content.push_str(&format!("name: \"{}\"\n", name.replace('"', "\\\"")));
    content.push_str("---\n\n");
    content.push_str(&format!("# Refactor {refactor_number}: {name}\n\n"));
    content.push_str("## Current State\n\n");
    content.push_str("- TBD\n\n");
    content.push_str("## Desired State\n\n");
    if let Some(desc) = description {
        content.push_str(desc);
        content.push('\n');
    } else {
        content.push_str("- TBD\n");
    }
    content.push('\n');
    content.push_str("## Acceptance Criteria\n\n");
    if let Some(criteria) = acceptance_criteria {
        for criterion in criteria {
            content.push_str(&format!("- [ ] {criterion}\n"));
        }
    } else {
        content.push_str("- [ ] Refactoring complete and all tests pass\n");
    }
    content.push('\n');
    content.push_str("## Out of Scope\n\n");
    content.push_str("- TBD\n");
    fs::write(&filepath, &content)
        .map_err(|e| format!("Failed to write refactor file: {e}"))?;
    // Watcher handles the git commit asynchronously.
    Ok(refactor_id)
 }
 /// Returns true if the item stem (filename without extension) is a bug item.
 /// Bug items follow the pattern: {N}_bug_{slug}
 fn is_bug_item(stem: &str) -> bool {
@@ -403,6 +470,59 @@ pub fn list_bug_files(root: &Path) -> Result<Vec<(String, String)>, String> {
    Ok(bugs)
 }
 /// Returns true if the item stem (filename without extension) is a refactor item.
 /// Refactor items follow the pattern: {N}_refactor_{slug}
 fn is_refactor_item(stem: &str) -> bool {
    let after_num = stem.trim_start_matches(|c: char| c.is_ascii_digit());
    after_num.starts_with("_refactor_")
 }
 /// List all open refactors — files in `work/1_upcoming/` matching the `_refactor_` naming pattern.
 ///
 /// Returns a sorted list of `(refactor_id, name)` pairs.
 pub fn list_refactor_files(root: &Path) -> Result<Vec<(String, String)>, String> {
    let upcoming_dir = root.join(".story_kit").join("work").join("1_upcoming");
    if !upcoming_dir.exists() {
        return Ok(Vec::new());
    }
    let mut refactors = Vec::new();
    for entry in fs::read_dir(&upcoming_dir)
        .map_err(|e| format!("Failed to read upcoming directory: {e}"))?
    {
        let entry = entry.map_err(|e| format!("Failed to read entry: {e}"))?;
        let path = entry.path();
        if path.is_dir() {
            continue;
        }
        if path.extension().and_then(|ext| ext.to_str()) != Some("md") {
            continue;
        }
        let stem = path
            .file_stem()
            .and_then(|s| s.to_str())
            .ok_or_else(|| "Invalid file name.".to_string())?;
        if !is_refactor_item(stem) {
            continue;
        }
        let refactor_id = stem.to_string();
        let name = fs::read_to_string(&path)
            .ok()
            .and_then(|contents| parse_front_matter(&contents).ok())
            .and_then(|m| m.name)
            .unwrap_or_else(|| refactor_id.clone());
        refactors.push((refactor_id, name));
    }
    refactors.sort_by(|a, b| a.0.cmp(&b.0));
    Ok(refactors)
 }
 /// Locate a work item file by searching all active pipeline stages.
 ///
 /// Searches in priority order: 2_current, 1_upcoming, 3_qa, 4_merge, 5_done, 6_archived.
--- a/server/src/io/story_metadata.rs
+++ b/server/src/io/story_metadata.rs
@@ -7,6 +7,7 @@ pub struct StoryMetadata {
    pub name: Option<String>,
    pub coverage_baseline: Option<String>,
    pub merge_failure: Option<String>,
    pub agent: Option<String>,
 }
 #[derive(Debug, Clone, PartialEq, Eq)]
@@ -29,6 +30,7 @@ struct FrontMatter {
    name: Option<String>,
    coverage_baseline: Option<String>,
    merge_failure: Option<String>,
    agent: Option<String>,
 }
 pub fn parse_front_matter(contents: &str) -> Result<StoryMetadata, StoryMetaError> {
@@ -61,6 +63,7 @@ fn build_metadata(front: FrontMatter) -> StoryMetadata {
        name: front.name,
        coverage_baseline: front.coverage_baseline,
        merge_failure: front.merge_failure,
        agent: front.agent,
    }
 }
Author	SHA1	Message	Date
Dave	a85d1a1170	story-kit: queue 257_story_rename_storkit_to_story_kit_in_header for merge	2026-03-17 12:14:53 +00:00
Dave	afc1ab5e0e	story-kit: queue 257_story_rename_storkit_to_story_kit_in_header for merge	2026-03-17 11:51:29 +00:00
Dave	32b6439f2f	story-kit: create 256_story_bot_must_verify_other_users_cross_signing_identity_before_checking_device_verification	2026-03-17 11:46:28 +00:00
Dave	85e56e0ea8	story-kit: queue 257_story_rename_storkit_to_story_kit_in_header for merge	2026-03-17 11:45:29 +00:00
Dave	b63fa6be4f	story-kit: queue 256_story_bot_must_verify_other_users_cross_signing_identity_before_checking_device_verification for merge	2026-03-17 11:43:23 +00:00
Dave	f012311303	story-kit: queue 257_story_rename_storkit_to_story_kit_in_header for merge	2026-03-17 11:39:28 +00:00
Dave	af0aa007ca	story-kit: queue 257_story_rename_storkit_to_story_kit_in_header for merge	2026-03-17 11:36:05 +00:00
Dave	b2aec94d4c	story-kit: queue 256_story_bot_must_verify_other_users_cross_signing_identity_before_checking_device_verification for QA	2026-03-17 11:35:03 +00:00
Dave	2ac550008a	story-kit: start 256_story_bot_must_verify_other_users_cross_signing_identity_before_checking_device_verification	2026-03-17 11:32:49 +00:00
Dave	ebbbfed1d9	Add 10-minute timeout to test commands and disable e2e in merge pipeline Test commands in run_project_tests now use wait-timeout to enforce a 600-second ceiling, preventing hung processes (e.g. Playwright with no server) from blocking the merge pipeline indefinitely. Also disables e2e tests in script/test until the merge workspace can run them safely. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-17 11:32:44 +00:00
Dave	fd6ef83f76	story-kit: queue 257_story_rename_storkit_to_story_kit_in_header for QA	2026-03-17 11:29:57 +00:00
Dave	473461b65d	story-kit: start 257_story_rename_storkit_to_story_kit_in_header	2026-03-17 11:24:55 +00:00
Dave	dc8d639d02	story-kit: create 256_story_bot_must_verify_other_users_cross_signing_identity_before_checking_device_verification	2026-03-17 11:24:20 +00:00
Dave	594fc500cf	story-kit: create 250_bug_merge_pipeline_cherry_pick_fails_with_bad_revision_on_merge_queue_branch	2026-03-17 11:24:15 +00:00
Dave	5448a99759	story-kit: done 245_bug_chat_history_persistence_lost_on_page_refresh_story_145_regression	2026-03-17 11:24:09 +00:00
Dave	f5524b3ae1	story-kit: accept 255_story_show_agent_logs_in_expanded_story_popup	2026-03-17 11:23:36 +00:00
Dave	4585537dd8	story-kit: create 250_bug_merge_pipeline_cherry_pick_fails_with_bad_revision_on_merge_queue_branch	2026-03-17 11:23:19 +00:00
Dave	57911fd9e7	story-kit: start 256_story_bot_must_verify_other_users_cross_signing_identity_before_checking_device_verification	2026-03-17 11:23:11 +00:00
Dave	b6f5169b56	story-kit: start 250_bug_merge_pipeline_cherry_pick_fails_with_bad_revision_on_merge_queue_branch	2026-03-17 11:23:07 +00:00
Dave	a4b99c68da	story-kit: done 255_story_show_agent_logs_in_expanded_story_popup	2026-03-17 11:23:04 +00:00
Dave	85062c338f	story-kit: done 254_story_add_refactor_work_item_type	2026-03-17 11:23:01 +00:00
Dave	a7f3d283ec	story-kit: done 245_bug_chat_history_persistence_lost_on_page_refresh_story_145_regression	2026-03-17 11:22:57 +00:00
Dave	6cc9d1bde9	story-kit: queue 245_bug_chat_history_persistence_lost_on_page_refresh_story_145_regression for merge	2026-03-17 11:14:16 +00:00
Dave	a82fa37730	story-kit: accept 253_bug_watcher_and_auto_assign_do_not_reinitialize_when_project_root_changes	2026-03-17 03:26:24 +00:00
Dave	06ceab3e22	story-kit: merge 245_bug_chat_history_persistence_lost_on_page_refresh_story_145_regression	2026-03-17 01:02:34 +00:00
Dave	58438f3ab6	story-kit: queue 245_bug_chat_history_persistence_lost_on_page_refresh_story_145_regression for merge	2026-03-17 01:02:04 +00:00
Dave	59bb7dbc3a	story-kit: queue 245_bug_chat_history_persistence_lost_on_page_refresh_story_145_regression for merge	2026-03-17 00:59:38 +00:00
Dave	9c2471fbcc	story-kit: queue 250_bug_merge_pipeline_cherry_pick_fails_with_bad_revision_on_merge_queue_branch for merge	2026-03-17 00:52:34 +00:00
Dave	f383d0cb4f	story-kit: queue 245_bug_chat_history_persistence_lost_on_page_refresh_story_145_regression for QA	2026-03-17 00:52:02 +00:00
Dave	be61803af0	story-kit: merge 255_story_show_agent_logs_in_expanded_story_popup	2026-03-17 00:49:43 +00:00
Dave	c132d4f5c0	story-kit: queue 255_story_show_agent_logs_in_expanded_story_popup for merge	2026-03-17 00:49:20 +00:00
Dave	263ba440dc	story-kit: queue 255_story_show_agent_logs_in_expanded_story_popup for merge	2026-03-17 00:46:33 +00:00
Dave	2fae9066e2	story-kit: merge 254_story_add_refactor_work_item_type	2026-03-17 00:40:37 +00:00
Dave	3553f59078	story-kit: queue 254_story_add_refactor_work_item_type for merge	2026-03-17 00:39:55 +00:00
Dave	78ea96d0a9	story-kit: queue 255_story_show_agent_logs_in_expanded_story_popup for QA	2026-03-17 00:36:40 +00:00
Dave	79d3eccc46	story-kit: queue 254_story_add_refactor_work_item_type for merge	2026-03-17 00:34:56 +00:00
Dave	c21a087399	story-kit: queue 250_bug_merge_pipeline_cherry_pick_fails_with_bad_revision_on_merge_queue_branch for merge	2026-03-17 00:30:47 +00:00
Dave	67942d466c	story-kit: done 253_bug_watcher_and_auto_assign_do_not_reinitialize_when_project_root_changes	2026-03-17 00:30:26 +00:00
Dave	1d6a4fa8c6	story-kit: queue 254_story_add_refactor_work_item_type for QA	2026-03-17 00:29:53 +00:00
Dave	250f3ff819	story-kit: accept 242_story_status_slash_command	2026-03-17 00:26:51 +00:00
Dave	a02ea3c292	story-kit: queue 253_bug_watcher_and_auto_assign_do_not_reinitialize_when_project_root_changes for merge	2026-03-17 00:25:50 +00:00
Dave	bbc5d9c90c	story-kit: queue 256_story_bot_must_verify_other_users_cross_signing_identity_before_checking_device_verification for merge	2026-03-17 00:21:03 +00:00
Dave	24f6a5c7cc	story-kit: queue 256_story_bot_must_verify_other_users_cross_signing_identity_before_checking_device_verification for merge	2026-03-17 00:13:44 +00:00
Dave	ab3420fa90	story-kit: queue 250_bug_merge_pipeline_cherry_pick_fails_with_bad_revision_on_merge_queue_branch for merge	2026-03-17 00:13:12 +00:00
Dave	4c6228abee	story-kit: queue 253_bug_watcher_and_auto_assign_do_not_reinitialize_when_project_root_changes for QA	2026-03-17 00:12:25 +00:00
Dave	6df28d5393	story-kit: queue 250_bug_merge_pipeline_cherry_pick_fails_with_bad_revision_on_merge_queue_branch for merge	2026-03-17 00:04:39 +00:00
Dave	2ad59ba155	story-kit: accept 249_story_agent_assignment_via_story_front_matter	2026-03-17 00:04:13 +00:00
Dave	319fc3823a	story-kit: queue 250_bug_merge_pipeline_cherry_pick_fails_with_bad_revision_on_merge_queue_branch for merge	2026-03-17 00:04:12 +00:00
Dave	b9f3449021	story-kit: support agent assignment via story front matter (story 249) Adds an optional `agent:` field to story file front matter so that a specific agent can be requested for a story. The auto-assign loop now: 1. Reads the front-matter `agent` field for each story before picking a free agent. 2. If a preferred agent is named, uses it when free; skips the story (without falling back) when that agent is busy. 3. Falls back to the existing `find_free_agent_for_stage` behaviour when no preference is specified. Ported from feature branch that predated the agents.rs module refactoring. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-17 00:03:49 +00:00
Dave	cd7444ac5c	story-kit: done 249_story_agent_assignment_via_story_front_matter	2026-03-17 00:03:21 +00:00
Dave	f5d9c98e74	story-kit: queue 256_story_bot_must_verify_other_users_cross_signing_identity_before_checking_device_verification for QA	2026-03-16 23:59:22 +00:00
Dave	7cd19e248c	story-kit: start 245_bug_chat_history_persistence_lost_on_page_refresh_story_145_regression	2026-03-16 23:57:48 +00:00
Dave	ec5024a089	story-kit: start 255_story_show_agent_logs_in_expanded_story_popup	2026-03-16 23:57:05 +00:00
Dave	9041cd1d16	story-kit: start 256_story_bot_must_verify_other_users_cross_signing_identity_before_checking_device_verification	2026-03-16 23:53:53 +00:00
Dave	0a0624795c	story-kit: create 256_story_bot_must_verify_other_users_cross_signing_identity_before_checking_device_verification	2026-03-16 23:53:26 +00:00
Dave	d8d0d7936c	story-kit: queue 253_bug_watcher_and_auto_assign_do_not_reinitialize_when_project_root_changes for QA	2026-03-16 23:46:47 +00:00
Dave	55ea8e6aaf	story-kit: accept 251_bug_archive_sweep_not_moving_stories_from_done_to_archived	2026-03-16 23:44:13 +00:00
Dave	1598d2a453	story-kit: done 251_bug_archive_sweep_not_moving_stories_from_done_to_archived	2026-03-16 23:44:09 +00:00
Dave	0120de5f00	story-kit: queue 249_story_agent_assignment_via_story_front_matter for QA	2026-03-16 23:44:00 +00:00
Dave	21835bc37d	story-kit: queue 254_story_add_refactor_work_item_type for QA	2026-03-16 23:42:04 +00:00
Dave	f01fa6c527	story-kit: start 253_bug_watcher_and_auto_assign_do_not_reinitialize_when_project_root_changes	2026-03-16 23:41:05 +00:00
Dave	a51488a0ce	story-kit: create 253_bug_watcher_and_auto_assign_do_not_reinitialize_when_project_root_changes	2026-03-16 23:40:02 +00:00
Dave	9054ac013e	story-kit: accept 252_story_coder_agents_must_find_root_causes_for_bugs	2026-03-16 23:38:24 +00:00
Dave	95eea3a624	story-kit: start 254_story_add_refactor_work_item_type	2026-03-16 23:35:18 +00:00
Dave	6b9390b243	story-kit: start 255_story_show_agent_logs_in_expanded_story_popup	2026-03-16 23:34:24 +00:00
Dave	3ed9b7a185	story-kit: create 255_story_show_agent_logs_in_expanded_story_popup	2026-03-16 23:29:20 +00:00
Dave	bd7426131f	story-kit: create 255_story_show_agent_logs_in_expanded_story_popup	2026-03-16 23:29:05 +00:00
Dave	e0132a7807	story-kit: create 255_bug_agent_logs_tab_shows_no_output_in_expanded_story_popup	2026-03-16 23:28:24 +00:00
Dave	b829783a84	story-kit: queue 253_bug_watcher_and_auto_assign_do_not_reinitialize_when_project_root_changes for QA	2026-03-16 23:27:28 +00:00