- Replace every model-boss/coordinator reference with the @prospector/ai-harness
story (direct vLLM client + classify/draft/judge/orchestrate task registry +
on-demand GPU lifecycle + CoT-workflow runner + cost meter) across ai-first-v4,
draft-engine, model-eval-pipeline, and PROSPECTOR.md; GPU_INFERENCE_URL is the
canonical inference contract. Note ai-harness is promotable to a shared @ct
package (onlyfans carries a parallel src/engine/classifier.ts).
- Fix migration collision: ai-first-v4 actor-attribution renumbered 0007 -> 0016
(0007_tasks.sql exists; tree at 0013).
- Add the three missing pieces from the plan: a formal DRAFT runner mode distinct
from PAUSE + DRAFT->GO graduation (new control-modes.md); a runtime per-draft
alignment gate (deterministic facts/policy + GPU judge; spec_conflict/
policy_conflict holds) in draft-engine's pipeline; and the facts/mission config
schema (src/specs/, 0014_specs.sql) in ai-system-plan §5 + draft-engine.
- Index control-modes.md and the ai-harness rename in features/README.md.
Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
black homelan is gone; prod target is the DO backend droplet (lilith-store-backend,
209.38.51.98 / wg 10.9.0.5) where mac-sync-server already runs. Fix black:2546x
DB-host refs in comments/migrations. GPU is on-demand + queue-driven: hold warm
while backlog is deep, release on idle grace (not strictly per-tick).
Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
Document the draft engine direction: OSS uncensored models on DO GPU droplets
(reuse LPv2 provisioning, no model-boss), engine id 'do-gpu-<model>_<build>', and
pastebin → CoT workflow builder (versioned reasoning chains the model runs;
pastebin canon as injected context; corrections as per-build tuning data). Rename
the MVP static-render engine value 'pastebin' -> 'template' (pastebin is now
context, not the engine).
Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>