prospector

applications/prospector

Fork 0

Commit graph

Author	SHA1	Message	Date
Natalie	e04135acaa	feat(eval): identity gate layer 2 — AddressBook known-contact exclusion Some checks failed CI / verify (push) Failing after 49s Details Strong gate (operator-authorized, local-only, fail-soft): saved AddressBook contacts = existing relationships/friends/vendors, excluded from the cold- prospect corpus (the matcher's 'unknown numbers only' rule). Removes 189/1631 (11%) known contacts vs the proxy's 68 (4%). Combined cold_prospect_handles = new-contact AND not-saved -> 1390 candidates (85%); the semantic not-a-prospect classes in the re-sweep clean the remaining unsaved existing-clients/banter. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-30 12:12:23 -04:00
Natalie	2a86e3c3fd	feat(eval): identity-gate layer 1 — cold-prospect-by-first-contact (CPU) Some checks failed CI / verify (push) Failing after 48s Details Data->model lane (mine). cold_prospect_handles(): handles whose first-ever message is in the work era (Nov 1+) = new contacts, not pre-existing relationships. sweep.py gets COLD_ONLY (default on). Honest scope: this cheap CPU layer removes only ~4% (68/1631 — the pre-work relationships); the bulk of contamination (in-work-era existing-clients/friends) needs the stronger gates: the AddressBook known/unknown signal (operator OK) + the semantic not-a-prospect classes in the re-sweep. This is layer 1 of that stack. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-30 12:06:29 -04:00
Natalie	62288fe48b	fix(prospector): burst-aware, 1:1-only extraction (shared lib.py) Some checks are pending CI / verify (push) Waiting to run Details Real convos aren't clean alternating turns: ~38% of message-runs are bursts (one sender, up to 132 in a row), and 5 group chats mix senders under is_from_me=0. New lib.py collapses bursts into turns, excludes group chats (chat.style=45 only), and yields CLIENT->QUINN decision points with a per-conversation cap (avoids verbose threads flooding the set). Corrected corpus: 1623 1:1 work-era conversations, 16095 decision points (8129 at max_per_handle=20). sweep.py now uses lib + WORKERS for vertical scaling. Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>	2026-06-30 04:03:46 -04:00

Author

SHA1

Message

Date

Natalie

e04135acaa

feat(eval): identity gate layer 2 — AddressBook known-contact exclusion

CI / verify (push) Failing after 49s

Details

Strong gate (operator-authorized, local-only, fail-soft): saved AddressBook
contacts = existing relationships/friends/vendors, excluded from the cold-
prospect corpus (the matcher's 'unknown numbers only' rule). Removes 189/1631
(11%) known contacts vs the proxy's 68 (4%). Combined cold_prospect_handles =
new-contact AND not-saved -> 1390 candidates (85%); the semantic not-a-prospect
classes in the re-sweep clean the remaining unsaved existing-clients/banter.

Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>

2026-06-30 12:12:23 -04:00

Natalie

2a86e3c3fd

feat(eval): identity-gate layer 1 — cold-prospect-by-first-contact (CPU)

CI / verify (push) Failing after 48s

Details

Data->model lane (mine). cold_prospect_handles(): handles whose first-ever
message is in the work era (Nov 1+) = new contacts, not pre-existing
relationships. sweep.py gets COLD_ONLY (default on). Honest scope: this cheap
CPU layer removes only ~4% (68/1631 — the pre-work relationships); the bulk of
contamination (in-work-era existing-clients/friends) needs the stronger gates:
the AddressBook known/unknown signal (operator OK) + the semantic
not-a-prospect classes in the re-sweep. This is layer 1 of that stack.

Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>

2026-06-30 12:06:29 -04:00

Natalie

62288fe48b

fix(prospector): burst-aware, 1:1-only extraction (shared lib.py)

CI / verify (push) Waiting to run

Details

Real convos aren't clean alternating turns: ~38% of message-runs are bursts
(one sender, up to 132 in a row), and 5 group chats mix senders under
is_from_me=0. New lib.py collapses bursts into turns, excludes group chats
(chat.style=45 only), and yields CLIENT->QUINN decision points with a
per-conversation cap (avoids verbose threads flooding the set). Corrected
corpus: 1623 1:1 work-era conversations, 16095 decision points (8129 at
max_per_handle=20). sweep.py now uses lib + WORKERS for vertical scaling.

Co-Authored-By: Claude Opus 4.8 <noreply@anthropic.com>

2026-06-30 04:03:46 -04:00

3 commits