huskies

Files

T

dave 2a77f73ba4 fix(merge): use server-start-time, not pid, for stale-merge detection

The merge_jobs cleanup encoded the server's pid in the CRDT and checked
`kill(pid, 0)` to decide whether a "running" entry was stale. Two problems:

  1. The cleanup runs *inside* the server, so checking whether the
     server's own pid is alive is tautological — kill(self_pid, 0)
     always succeeds.
  2. `rebuild_and_restart` does an `execve()` re-exec, which keeps the
     same pid. After re-exec, merge_jobs from the previous server
     instance still encode "the current pid" — so the cleanup never
     fires, and stories like 799/800 sit forever with status="running"
     while no actual merge runs.

Switch to a per-process server-start-time captured lazily in a
`OnceLock<f64>` (reset by execve, so the new instance sees a fresh
boot-time). A merge_job's recorded start-time < current boot-time means
it came from a previous instance: stale, delete it.

Legacy pid-encoded entries decode to None and are also treated as stale.

MergeJob.pid → MergeJob.server_start_time. Tests updated.

2026-04-28 20:41:32 +00:00

migrations

huskies: merge 492_story_remove_filesystem_pipeline_state_and_store_story_content_in_database

2026-04-08 03:07:33 +00:00

src

fix(merge): use server-start-time, not pid, for stale-merge detection

2026-04-28 20:41:32 +00:00

build.rs

Restore codebase deleted by bad auto-commit e4227cf

2026-03-22 19:07:07 +00:00

Cargo.lock

huskies: merge 548_refactor_rename_living_spec_standalone_to_huskies_in_package_json_and_cargo_lock

2026-04-12 14:50:38 +00:00

Cargo.toml

huskies: merge 819

2026-04-28 20:28:35 +00:00