ImageApi

Author	SHA1	Message	Date
Cameron Cordes	675b4a4849	faces: add .env.example template covering all documented env vars The face-recognition plan and CLAUDE.md document the full env-var surface (face detection knobs, Apollo / Ollama / OpenRouter / SMS integrations, watch intervals, RAG flags), but no example file existed — operators copying the project to a new deploy had nothing to start from. Group by section, comment out optional integrations so a minimal copy boots without external services. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-30 13:51:45 +00:00
Cameron Cordes	5e1bad3179	faces: filter videos out of detection candidate set The backlog drain pulls every hashed image_exif row, which includes videos. Sending them to Apollo just produces 422 decode_failed → status='failed' markers, burning a round-trip per video and inflating the FAILED stat. Widen filter_excluded to also drop anything is_image_file rejects. Covers both call sites (file-watch hook and per-tick backlog drain) without plumbing a second filter through. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-30 12:45:55 +00:00
Cameron Cordes	1971eeccd6	faces: drain backfill + detection backlog every tick, not just full scans Symptom: ImageApi restart, then ~60 minutes of silence — no face_watch lines at all. Cause: backfill + face-detection candidate build were both gated inside process_new_files, which during quick scans (every 60s) only walks files modified in the last interval. The pre-existing unhashed / unscanned backlog never entered the candidate set, so it only drained on the full-scan path (default once per hour). Surfaced as "scan stuck at 1101/13118" — most of those rows were waiting on the next full scan. Two new per-tick passes that work directly off the DB: (1) backfill_unhashed_backlog uses ExifDao::get_rows_missing_hash to pull unhashed rows in id order, capped (FACE_HASH_BACKFILL_MAX_PER_TICK default 2000), and writes content_hash for each. No filesystem walk — the walk was the gating filter that hid the backlog. (2) process_face_backlog uses a new FaceDao::list_unscanned_candidates (LEFT-anti-join on content_hash via raw SQL, GROUP BY hash so duplicates fire one detect call) to pull a capped batch of hashed-but-unscanned rows (FACE_BACKLOG_MAX_PER_TICK default 64) and runs the existing face_watch detection pipeline on them. Both run only when face_client.is_enabled(). The cap on (2) is small because each candidate is a real Apollo round-trip — 64/tick at 60s quick interval ≈ 64 detections/min, which paces an 8-core CPU inference comfortably while keeping a steady flow visible in logs. process_new_files's own backfill stays in place for the same-tick flow (a brand-new upload gets hashed AND face-scanned in the tick where it's discovered) but is now belt-and-suspenders. Test backstop pinning the new DAO method's filter contract: only hashed, unscanned, in-library rows are returned; scanned rows, unhashed rows, and other-library rows are filtered out.	2026-04-30 01:46:49 +00:00
Cameron Cordes	c2c1fe5b8b	faces: bbox crop respects EXIF orientation + pads enough for RetinaFace Two reasons manually-drawn bboxes were never resolving a face on re-detection: (1) The bbox arrives in display space (browser already applied EXIF orientation when rendering the carousel), but the `image` crate in crop_image_to_bbox opens raw pre-rotation pixels. For any phone photo with Orientation 6/8/etc., applying the bbox without rotating first crops a completely different region of the image — landing on background, hair, or empty pixels. Now reads the EXIF Orientation tag and applies it before indexing into the canonical-oriented dims. (2) Padding was 10 % on each side. A typical 200×250 face bbox + 10 % becomes ~240×300; insightface resizes that to det_size=640, so the face fills ~95 % of the input. RetinaFace's anchors expect faces at 20–60 % of input dimensions; at 95 % it routinely returns zero detections. Bumped to 50 % padding so the crop is 2× the bbox dims and the face occupies ~50 % of the input — anchor-friendly. Bbox is still clamped to image bounds, so edge-of-image cases just get less padding on the clipped side. Together these explain why bbox-edit re-embed practically always fell into the "no face detected" branch (and bbox-edit reverts without the recent soft-fallback commit). Per-photo embedding quality also improves slightly — same face, more context, better landmarks for ArcFace.	2026-04-30 01:06:08 +00:00
Cameron Cordes	5a2f406429	faces: bbox edits survive when re-detection finds no face Moving a tagged bbox off-center (to fine-tune position, or onto a back-of-head the operator already manually tagged) made update_face_handler 422 because the re-embed step ran detection on the new crop and found nothing. Frontend's catch then reverted the optimistic update — visible as the bbox snapping back the moment the user released their drag. The re-embed is a soft contract: a fresh ArcFace vector is preferable, but the operator's bbox edit is sacred. Now: - empty faces[] → keep old embedding, apply the bbox, log info - permanent embed error → keep old embedding, apply the bbox, log info - bad-bytes embedding → keep old embedding, apply the bbox, log warn - transient failure (cuda_oom, engine unavailable) still 503s so the operator can retry — those are recoverable and we don't want to silently drift cluster math on retries that succeed later Cost: a slightly stale embedding for the row, which marginally affects clustering / auto-bind cosine for files re-detected against this person. Accepted because dropping the user's manual drag every time the new crop happens to lose detection is a much worse UX — especially for the force-create rows (back of head, profile) where re-detection will always fail.	2026-04-30 01:01:07 +00:00
Cameron Cordes	6a6a4a6a46	tags: batch lookup expands content-hash siblings cross-library The first cut matched by rel_path only — fine for single-library deploys but wrong for multi-library setups where the same content lives under different rel_paths (e.g. a backup mount holding copies of the primary library). A tag applied under library A would silently not appear in the library-B grid badge even though the carousel's per-path /image/tags would resolve it correctly via siblings. The batch handler now does the expansion server-side in three queries regardless of input size: 1. image_exif batch lookup → query path → content_hash 2. image_exif JOIN by content_hash → all sibling rel_paths sharing each hash (paths are deduped across libraries) 3. tagged_photo + tags JOIN over the union of (query + sibling) rel_paths Tags are then aggregated back to query paths via a sibling→originals reverse map, deduped by tag id. Files without a content_hash (just indexed, hash compute pending, etc.) skip step 2 and only get tags from their own rel_path — same fallback the per-path handler uses. Adds ExifDao::get_rel_paths_for_hashes (batch counterpart of get_rel_paths_by_hash) chunked at 500 to stay under SQLite's SQLITE_LIMIT_VARIABLE_NUMBER. Five queries for a 4k-photo grid is still ~800x cheaper than per-path HTTP fan-out.	2026-04-30 00:36:44 +00:00
Cameron Cordes	3112260dc8	tags: batch lookup endpoint to collapse photo-match fan-out Apollo's photo-match enrichment fanned out one ``GET /image/tags?path=`` per record (bounded concurrency 20) — for a 4k-photo time window that meant ~4000 round-trips, each briefly contending the tag-dao mutex. The cost dwarfed the actual SQL. Add a single ``POST /image/tags/lookup`` body ``{paths: [...]}`` returning ``{path: [tag, ...]}`` with only paths that have at least one tag. SqliteTagDao gains ``get_tags_grouped_by_paths`` which JOINs tagged_photo + tags and chunks the IN clause at 500 (safely under SQLite's variable limit). Five queries for a 4k-photo grid is ~800x cheaper than 4k HTTP calls. Trade-off: the batch matches by rel_path directly and does not do the cross-library content-hash sibling expansion that the per-path ``GET /image/tags`` does. For Apollo's grid that's accepted as deliberate — single-library deploys see no difference, multi-library deploys with rel_path-divergent siblings might miss a tag in the grid badge but the carousel still resolves full sibling tags via the per-path endpoint when opened. If sibling sharing in the grid becomes load-bearing, extend the handler to JOIN image_exif on content_hash.	2026-04-30 00:28:33 +00:00
Cameron Cordes	16abacf4c5	faces: backfill no longer stalls on chronic-error files at the front The content-hash backfill capped at 500/tick AND counted errors against that cap. So a pocket of files that errored every time (vanished mid-scan, permission denied, unreadable) at the head of the exif_records iteration order burned the entire budget every tick and the rest of the backlog never advanced — surfacing as a face-scan stuck at e.g. 44% with no progress. Without a content_hash, those photos never become face-detection candidates, so it looks like detection is broken when really it's the prerequisite hash that isn't filling. Two fixes: - Cap on successes only. Errors still get counted and logged but don't burn the per-tick budget; the loop keeps moving past them to the working files behind. Errors are bounded by the unhashed backlog size (each record walked at most once per tick), so this can't run away. - Always log the unhashed backlog count when non-zero. Previously "stuck at 44%" looked silent from the outside; now every tick surfaces "backfilled N/M; K still need backfill" so an operator can tell backfill is making progress (or isn't). Also bumps the default cap from 500 to 2000. Hashing is cheap (blake3 + one DB UPDATE), and 500 was conservative for a personal-scale library where 10k+ unhashed files is a normal first-run state.	2026-04-30 00:03:26 +00:00
Cameron Cordes	891a9982ef	faces: force-create path for regions the detector can't see Adds an opt-in 'force' flag to POST /image/faces. When set, the handler skips the Apollo embed call entirely and stores the row with a 2048-byte zero-vector embedding under the sentinel model_version 'manual_no_embed'. The row participates as a browse-by-person tag but is excluded from clustering and auto-bind: - face_clustering._decode_b64_embedding filters norm<=0 (already) - cluster suggester groups by model_version, so the sentinel never mixes with real buffalo_l rows - cosine_similarity with a zero vector resolves to 0/NaN, never crossing the 0.4 auto-bind threshold Use case: tag someone looking away from the camera, profile shot, heavily-occluded face — anywhere the detector returns no_face_in_crop on the user's drawn region. The frontend only sets force=true after a 422 from a strict create plus an explicit operator confirmation, so the normal "draw a centered face" UX still gets a real ArcFace embedding.	2026-04-29 23:49:34 +00:00
Cameron Cordes	0eaf27d2d3	faces: cover hydrate_face_with_person — assigned + unassigned branches Two unit tests pinning the response shape that PATCH/POST /image/faces relies on. They use the existing in-memory SQLite harness and exercise the helper directly: - assigned: person_name resolves through the persons join and bbox / source / person_id round-trip cleanly. - unassigned: person_name is None (not stale, not omitted), person_id is None. These would have caught the prior regression — when the handlers returned a bare FaceDetectionRow, person_name was structurally absent from the response shape. A test that asserts person_name is populated when person_id is set forces the join (or any equivalent) to exist. A dangling-person_id case isn't covered: the FK on face_detections makes that state structurally impossible at rest (ON DELETE SET NULL zeroes the column when a person is removed), so there's nothing to defend against.	2026-04-29 23:41:52 +00:00
Cameron Cordes	0c2f421a1f	faces: PATCH/POST /image/faces returns person_name with the row Both create_face_handler and update_face_handler returned the bare FaceDetectionRow, so PATCH /image/faces/{id} (used by both bbox edits and person assignment) replied without person_name. The carousel overlay does an optimistic replace on this row — replacing the joined FaceWithPerson with a row that has person_name = undefined visibly dropped the VFD label off the bbox after every save. Add a small hydrate_face_with_person helper that does the persons lookup and assembles a FaceWithPerson, used by both handlers. The list endpoint already does the join, so the PATCH/POST shape now matches it.	2026-04-29 23:38:24 +00:00
Cameron Cordes	43cb60d3ad	faces: re-embed on bbox edit instead of leaving the embedding stale Phase 2 stored the new bbox on PATCH /image/faces/{id} but logged "embedding now stale (Phase 3 will re-embed)" and moved on. That left the embedding column pointing at the old face area while the bbox described a new one — auto-bind cosine similarity and the cluster suggester would silently rank the row as "the same face it was before the edit" forever after, even though the geometry no longer matched. Now: when the PATCH includes a bbox, the handler: 1. Looks up the row to find its photo (library_id + rel_path). 2. Crops the new bbox region with the same crop_image_to_bbox helper manual-create uses (10% pad on each side so the detector has ear/jaw context). 3. POSTs the crop to face_client.embed for a fresh ArcFace vector. 4. Stores both the new bbox AND the new embedding in one update_face transaction. Errors map cleanly: - face_client disabled → 503 (bbox edit needs Apollo). - decode failure / no face in crop → 422. - Apollo CUDA OOM / unavailable → 503 transient. - Underlying row missing → 404. About 100-500ms per edit on CPU, dominated by Apollo's inference call. Acceptable for a manual operator action; the alternative (stale embedding) silently broke the rest of the face stack. Prerequisite for the upcoming carousel-side draw/resize bbox UI — without re-embed, every operator-driven bbox tweak would corrode the clustering/auto-bind quality. ApiPatchFaceBody on Apollo's side already passes bbox through verbatim, so no Apollo change needed. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-29 23:10:25 +00:00
Cameron Cordes	7303fb8aa3	faces: ignore/junk bucket — DB schema + lazy-create endpoint A single global "Ignored" person row, marked is_ignored=true, that the frontend lazily creates on first use to hold strangers, false detections, and faces the user doesn't want bound to a real person. Schema (new migration 2026-04-29-000200_add_is_ignored): - persons.is_ignored BOOLEAN NOT NULL DEFAULT 0 - Partial index on (is_ignored) WHERE is_ignored = 1; small WHERE set means a tiny index that only ever services the bucket lookup. Why a real persons row instead of a separate table or status enum: - face_detections.person_id stays a clean foreign key — no special code paths for "ignored faces" anywhere else in the schema. - The cluster-suggester already filters by `person_id IS NULL`, so bound-to-ignored faces are naturally excluded from re-clustering without any change. - merge / rename / delete all work on it with the existing routes (the management UI just hides it from default views). DAO additions / changes: - get_or_create_ignored_person (idempotent; race-safe via the UNIQUE COLLATE NOCASE on persons.name + retry-on-409 fallback). - list_persons gains an include_ignored parameter; default false so the management screen hides the bucket unless asked. - find_persons_by_names_ci filters is_ignored=0 in SQL so the auto-bind path can NEVER target the bucket — even if the user happens to tag photos as "Ignored", the heuristic look-up skips it. Bucket assignment is always an explicit operator action. - update_person accepts is_ignored: Option<bool> so a person can be moved into / out of the bucket without a delete + recreate. Routes: - POST /persons/ignore-bucket — returns the bucket, creating it on first call. Frontend uses this lazily right before binding. - GET /persons gains ?include_ignored=true; default behavior unchanged. - PATCH /persons/{id} now accepts is_ignored. Tests: ignore_bucket_idempotent_and_filters_auto_bind covers the contract: bucket is idempotent across calls, find_persons_by_names_ci skips it (even on exact name match), default list_persons hides it, include_ignored=true surfaces it. All other tests updated to pass the new is_ignored: false / Option<bool> fields explicitly. cargo test --lib: 181/0; fmt + clippy clean for new code. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-29 22:48:16 +00:00
Cameron Cordes	0e160f5d22	faces: include bbox on /faces/embeddings response Apollo's cluster suggester wants to render a face-cropped thumbnail for each cluster's representative — a multi-person photo with the cluster about 'one' of them was unreadable when the thumb showed the whole image. Plumbing bbox through means the UI can crop to the rep face without an extra round-trip per cluster. FaceEmbeddingRow gains bbox_x/y/w/h (Optional<f32>, mirrors the column nullability — for status='detected' rows the CHECK constraint guarantees they're populated, but the type stays nullable as documentation). list_embeddings already loaded these from the underlying FaceDetectionRow; this commit just stops dropping them when constructing the response. No DB changes; no behavior change for existing callers (the new fields are additive). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-29 21:01:58 +00:00
Cameron Cordes	a24fac5511	faces: backfill missing content_hash from the file watcher Photos indexed before content-hashing landed (or where the hash compute failed silently on insert) end up in image_exif with NULL content_hash. build_face_candidates keys on content_hash, so those rows would never become face candidates without backfill — symptom: face detection logs nothing despite photos being in the library and the watcher running. The dedicated `backfill_hashes` binary already handles this; this commit lets the watcher self-heal during full scans so the deploy 'just works' for face recognition without operator action. Idempotent — subsequent scans see populated hashes and no-op. Bounded per tick by FACE_HASH_BACKFILL_MAX_PER_TICK (default 500) so a watcher tick on a 50k-photo legacy library doesn't blake3 every file in one shot. For very large backlogs the dedicated binary is still faster (no DAO mutex contention with the watcher loop). Only runs when face_client.is_enabled(), so legacy deploys without APOLLO_FACE_API_BASE_URL keep the same behavior. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-29 20:41:08 +00:00
Cameron Cordes	23f4941471	faces: surface enabled/disabled state + per-tick candidate count Manual deploy debugging: 'Saved thumbnail' logs were visible (boot-time thumbnail backfill) but no face_watch logs were appearing, with no obvious way to tell whether the integration was disabled, hadn't reached a full scan yet, or had simply seen no new files. Two log lines: - watch_files startup: 'Face detection: ENABLED' / 'DISABLED (set APOLLO_FACE_API_BASE_URL or APOLLO_API_BASE_URL to enable)' so you can tell at a glance whether the env wired through. - process_new_files (debug-level): 'face_watch: scan tick — N image file(s) walked, M candidate(s) (library 'main', modified_since=...)' so an empty-candidate scan is distinguishable from a misconfigured or skipped one without bumping log level for the rest of the watcher. No behavior change. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-29 20:19:17 +00:00
Cameron Cordes	41f93d70d1	faces: tighten bootstrap candidate filter, bump to 1.1.0 Filter <3-char tags and emoji/symbol-bearing tags out of the bootstrap candidate list before grouping. Manual testing surfaced these as noise the operator never tickets — they pushed real candidates lower in the list and made the UI harder to scan. This is a hard filter (drop from candidates entirely), not a heuristic flag — looks_like_person still governs the default-checked decision for the rows that do survive. is_plausible_name_token rules: - >= 3 chars after trimming (rejects "AB", "OK", whitespace-only) - Each char is alphabetic (any script — covers Renée, José, 田中太郎), whitespace, name-punctuation (' - . _ U+2019), or ASCII digit - Anything else (emoji, symbols, math, arrows, control codes) drops the whole tag Digits stay allowed at this layer; looks_like_person handles "Trip 2018" on the heuristic side. Lets a "Sarah2" alias still appear so the operator can spot and confirm it manually, just unticked by default. Cargo version bump 1.0.0 → 1.1.0 marks the face-recog feature surface landing — Phase 2's schema + endpoints, Phase 3's file-watch hook, and Phase 4's bootstrap + auto-bind are all behind APOLLO_FACE_API_BASE_URL, so legacy 1.0 deploys without that env see no behavior change. Tests: 1 new (faces::tests::is_plausible_name_token_filters_short_and_emoji) covers the accept-list (Latin/accented/Asian scripts, hyphenated and apostrophe names) and the reject-list (length floor, emoji classes, symbols, leading/trailing whitespace handling). cargo test --lib: 180 / 0; fmt + clippy clean. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-29 19:05:04 +00:00
Cameron Cordes	1859399759	faces: phase 4 — people-tag bootstrap + auto-bind on detection Wires the existing string people-tags into the new persons table and auto-binds new detections to a same-named person when the photo carries exactly one matching tag. ImageApi has no notion of which tags are people-tags today (purely a user mental model), so this is operator- confirmed: the suggester surfaces candidates with a heuristic flag, the operator confirms, then bootstrap creates persons rows. Auto-bind follows on every detection thereafter. New endpoints: GET /tags/people-bootstrap-candidates Per case-insensitive name group: display name (most-frequent capitalization), normalized lowercase, summed usage_count, looks_like_person heuristic flag, already_exists check against the persons table. Sorted persons-likely-first then by count. POST /persons/bootstrap Body: {names: [string]}. Idempotent — pre-fetches the existing- name set so a duplicate request reports per-row "already exists" instead of 409-ing each insert. Created rows get created_from_tag=true; failed rows surface in `skipped` with a reason. looks_like_person heuristic — conservative on purpose because the operator confirms in the UI: - 1–2 whitespace-separated words - Each word starts uppercase, no digits anywhere - Single-word names not on a small denylist (cat, christmas, beach, sunset, untagged, ...). Two-word names skip the denylist so "Sarah Smith" is never false-rejected. FaceDao additions: - find_persons_by_names_ci — bulk lowercase-name → person_id lookup via sql_query (Diesel's BoxedSelectStatement + LOWER() doesn't play well with the type system). - person_reference_embedding — L2-normalized mean of a person's detected embeddings, filtered by model_version so a future buffalo_xl row can never contaminate an in-flight buffalo_l auto- bind decision. Returns None when the person has no faces yet. - assign_face_to_person — sets face_detections.person_id and, only when persons.cover_face_id is NULL, claims this face as cover. The UI's hand-picked cover survives later auto-binds. - decode_embedding_bytes / cosine_similarity helpers — pub(crate) so face_watch can decode the wire bytes once and feed them through the cosine threshold. Auto-bind in face_watch::process_one: After every successful detect, for each newly-stored auto face we pull the photo's tags, look up which (if any) map to existing persons, and: - skip when zero or multiple distinct persons are matched (multi-match is genuinely ambiguous; cluster suggester handles it) - on first face for a person: bind unconditionally so bootstrap can ever produce a usable reference - thereafter: bind iff cosine(new_emb, person_ref) >= FACE_AUTOBIND_MIN_COS (default 0.4, env-tunable to 0..=1) The reference embedding comes from person_reference_embedding under the same model_version as the candidate, so a model upgrade never silently re-anchors a person's centroid. Plumbing: watch_files now constructs its own SqliteTagDao alongside the other watcher DAOs and threads it through process_new_files → run_face_detection_pass → process_one. The handler-side TagDao registration in main.rs already covers bootstrap_candidates_handler; no extra app_data wiring needed. Tests: 8 new (faces.rs): - looks_like_person accepts/rejects/two-word-skips-denylist (3) - cosine_similarity on identical / orthogonal / opposite / mismatch / zero / empty inputs - decode_embedding_bytes round-trip + size validation - find_persons_by_names_ci groups case + handles empty input - person_reference_embedding filters by model_version (buffalo_l ref must not include buffalo_xl rows) - assign_face_to_person sets cover when unset, doesn't overwrite cargo test --lib: 179 / 0; fmt + clippy clean for new code. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-29 18:55:01 +00:00
Cameron Cordes	f985a0d658	faces: surface UNIQUE constraint as 409, not 500 Manual smoke test caught a bug: POST /persons with a duplicate name returned 500 with the body 'insert person Cameron' instead of the intended 409 Conflict. Root cause: the handler keyed on `format!("{}", e).contains("unique")`, but anyhow's plain Display only renders the outermost context ("insert person Cameron") and hides the diesel error nested below ('UNIQUE constraint failed: persons.name'). The string check was a false negative on every duplicate. Fix: walk the source chain and downcast for diesel::result::Error::DatabaseError(UniqueViolation, _) — exposed via a shared `is_unique_violation` helper used by both create_person_handler and update_person_handler. Error bodies for non-unique failures now use `{:#}` so the body actually carries the underlying cause when the user surfaces it. merge_persons_handler also moves to `{:#}` for richer error bodies; the "itself" check was already structural and unaffected. Regression test (faces::tests::is_unique_violation_walks_chain) pins both the bug shape ({} doesn't surface UNIQUE) and the fix (is_unique_violation correctly downcasts the chain), so a future refactor of error handling can't silently re-bury this. cargo test --lib: 171 / 0; fmt + clippy clean. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-29 18:44:10 +00:00
Cameron Cordes	4dee7b6f73	faces: phase 3 — file-watch hook drives auto detection Wire face detection into ImageApi's existing scan loop so new uploads pick up faces automatically and the initial backlog grinds through on full-scan ticks. No new job system; Phase 2's already_scanned check makes the work implicitly idempotent (one face_detections row per content_hash, including no_faces / failed marker rows). face_watch.rs (new): - run_face_detection_pass(library, excluded_dirs, face_client, face_dao, candidates) — sync entry point. Builds a per-pass tokio runtime and fans out detect calls bounded by FACE_DETECT_CONCURRENCY (default 8). The watcher thread itself stays sync. - filter_excluded — applies the same PathExcluder /memories uses, so @eaDir / .thumbnails / EXCLUDED_DIRS-listed paths skip detection before we burn a detect call (and Apollo's GPU memory) on junk. - read_image_bytes_for_detect — RAW/HEIC route through extract_embedded_jpeg_preview because opencv-python-headless can't decode either; everything else gets a plain std::fs::read so EXIF orientation reaches Apollo's exif_transpose intact. - process_one — translates Apollo's response into the Phase 2 marker contract: faces[] empty → no_faces; FaceDetectError::Permanent → failed (don't retry); Transient → no marker (next scan retries); success with N faces → N detected rows with the embeddings unpacked. main.rs (process_new_files + watch_files): - watch_files now also takes face_client + excluded_dirs; the watcher thread builds a SqliteFaceDao the same way it builds ExifDao / PreviewDao. - After the EXIF write loop, build_face_candidates queries image_exif for the just-walked image paths' content_hashes (covers new uploads and pre-existing backlog), filters out anything already_scanned, and hands the rest to face_watch::run_face_detection_pass. - Bypassed wholesale when face_client.is_enabled() is false — keeps the watcher usable on legacy deploys where Apollo isn't configured. Tests: 5 face_watch unit tests cover the parts that don't need a real Apollo: - filter_excluded drops dir-component patterns (@eaDir) without matching substring file names (eaDir-not-a-thing.jpg keeps). - filter_excluded drops absolute-under-base subtrees (/private). - empty EXCLUDED_DIRS short-circuits cleanly. - read_image_bytes_for_detect passes JPEG bytes through verbatim (orientation must reach Apollo unmodified). - read_image_bytes_for_detect falls through to plain read when a RAW-extension file has no embedded preview, so Apollo gets a chance to 422 and we mark failed rather than infinitely-retrying. cargo test --lib: 170 / 0; fmt and clippy clean for new code. End-to-end (drop a photo → face_detections row appears) needs Apollo running and is deferred to deploy-time verification. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-29 18:21:19 +00:00
Cameron Cordes	f77e44b34d	faces: fix PathExcluder false-positive + cover face_client/crop in tests PathExcluder was iterating every component of the absolute path, including the system prefix. Two of the existing memories tests had been failing on master because tempdir() lives under /tmp on Linux and a pattern like "tmp" then matched the system /tmp component rather than anything the user actually asked to exclude. Phase 3's file-watch hook will use the same code to skip @eaDir / .thumbnails under each library's BASE_PATH, so the bug would hide every photo on a host whose BASE_PATH passes through a directory named the same as a user pattern. Fix: store base in PathExcluder and strip it before scanning components. A path that lives outside base falls through to the no-match branch (defensive — nothing legit hits that today). Also extracted the face_client error classification into a pure classify_error_response(status, body) so the marker-row contract with Apollo (422 → Permanent / 'failed', 5xx → Transient / defer) is unit-testable without spinning up an HTTP server. New tests: memories::tests::test_path_excluder_* — 2 previously failing tests now pass. ai::face_client::tests::classify_* — 4 cases: 422 decode_failed → Permanent, 503 cuda_oom → Transient (handles both string and {code:..} detail shapes), 5xx → Transient + other 4xx → Permanent, unparseable HTML body still classifies on status. faces::tests::crop_* — 3 cases: invalid bbox rejected, valid bbox round-trips through JPEG decode, corner crop with 10% padding clamps inside source. cargo test --lib: 165 passed / 0 failed (was 156 / 2 failed). cargo fmt and clippy on new code clean. The remaining sort_by clippy warnings in pre-existing files (memories.rs, files.rs, exif.rs) are unrelated and present on master. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-29 18:09:44 +00:00
Cameron Cordes	860169032b	faces: phase 2 — schema + manual face/person CRUD Land the persistence model and HTTP surface for local face recognition. Inference still lives in Apollo (Phase 1); this side adds the data home plus every endpoint Apollo's UI and FileViewer-React will consume. Schema (new migration 2026-04-29-000000_add_faces): - persons: visual identities. Optional entity_id bridges to the existing knowledge-graph entities table; auto-bridging is left to the management UI (we don't muddy LLM provenance from face rows). UNIQUE(name COLLATE NOCASE) so 'alice' / 'Alice' fold to one row. - face_detections: keyed on content_hash (cross-library dedup), with status='detected' carrying bbox + 512-d embedding BLOB, and 'no_faces' / 'failed' marker rows that tell Phase 3's file watcher not to re-scan. Marker invariant enforced via CHECK; partial UNIQUE on content_hash WHERE status='no_faces' guards against double-marks. Schema regenerated with `diesel print-schema` against a clean migration run; joinables added for face_detections → libraries / persons and persons → entities. face_client.rs (sibling of apollo_client.rs): - reqwest multipart, 60 s timeout (CPU inference on a backlog can be slow; bounded threadpool on Apollo serializes calls anyway). - FaceDetectError::{Permanent, Transient, Disabled} — Phase 3 keys its marker-row decision on this. 422 → mark failed, 5xx → defer. - APOLLO_FACE_API_BASE_URL falls back to APOLLO_API_BASE_URL when unset; both unset = is_enabled() false, callers no-op. faces.rs (DAO + handlers): - SqliteFaceDao implements the full FaceDao trait; person face counts go through sql_query because diesel's BoxedSelectStatement + group_by trips trait-resolver recursion. - merge_persons re-points face rows in a transaction, copies notes when target's are empty, deletes src. - manual POST /image/faces resolves content_hash through image_exif, crops the user-drawn bbox with 10% padding (detector wants context around ears/jaw), POSTs the crop to face_client.embed for a real ArcFace vector, then inserts source='manual'. - Cluster-suggest (Phase 6) gets its data from GET /faces/embeddings — base64-encoded paged BLOBs so Apollo's DBSCAN can stream them without ImageApi pre-aggregating. Endpoints registered alongside add_*_services in main.rs: GET /faces/stats?library= GET /faces/embeddings?library=&unassigned=&limit=&offset= GET /image/faces?path=&library= POST /image/faces (manual create via embed) PATCH /image/faces/{id} DELETE /image/faces/{id} GET /persons?library= POST /persons GET /persons/{id} PATCH /persons/{id} DELETE /persons/{id}?cascade=set_null\|delete (set_null default) POST /persons/{id}/merge GET /persons/{id}/faces?library= The file-watch hook (Phase 3) and the rerun-on-one-photo handler (Phase 6) live behind the FaceDao methods marked dead_code today — they're called only when those phases land. Same shape for the trait methods that aren't reached by Phase 2 routes. Tests: 3 DAO unit tests cover person CRUD + case-insensitive uniqueness, marker-row idempotency (mark_status is a no-op when any row exists), and merge re-pointing faces. Cargo.toml: reqwest gains the `multipart` feature. cargo build / cargo test --lib / cargo fmt / cargo clippy --all-targets all clean for the new code; the two pre-existing test_path_excluder failures and the pre-existing sort_by clippy warnings are unrelated and present on master. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-29 18:03:42 +00:00
cameron	6642db3c8b	Merge pull request 'feat/apollo-places-tool and Geo Tagging Exif' (#60 ) from feat/apollo-places-tool into master Reviewed-on: #60	2026-04-28 23:09:33 +00:00
Cameron Cordes	57fb0bcd3c	EXIF GPS write: POST /image/exif/gps via exiftool New endpoint accepts {path, library, latitude, longitude} and shells out to exiftool to write GPSLatitude/GPSLongitude (with N/S, E/W refs) into the file's EXIF in place. After the write, the handler re-extracts EXIF and updates the image_exif row so the DB stays in sync — the response carries the updated metadata block in one round-trip. Falls through to store_exif if the row is missing. `exif::write_gps` is the small helper. `-overwrite_original` so no .orig sidecar is left behind. Validates lat/lon range + supports_exif before spawning exiftool. Format support matches the existing read path (JPEG / TIFF / RAW / HEIF / PNG / WebP) — videos still need a different writer and aren't covered. Apollo's "+ PIN" carousel button (separate commit on the Apollo side) calls this through /api/photos/exif/gps. Drive-by: cargo fmt one-line collapse on apollo_client.rs. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-28 22:25:40 +00:00
Cameron Cordes	4ae7be35e9	Apollo Places: enrich insights with personal place name + notes Optional integration with the sibling Apollo project's user-defined Places (name + lat/lon + radius_m + description + category). When APOLLO_API_BASE_URL is set, the per-photo location resolver folds the most-specific containing Place into the LLM prompt's location string — "Home (My house in Cambridge) — near Cambridge, MA" rather than the city name alone. Smallest-radius wins; Apollo sorts server-side via /api/places/contains, so the carousel badge in Apollo and the prompt string here always agree. Adds an agentic tool `get_personal_place_at(latitude, longitude)` that the LLM can call during chat continuation. Tool description tells the model the call returns the user's free-text notes, not just a name. Deliberately narrow — no enumerate-all variant, lat/lon required. Unset APOLLO_API_BASE_URL = legacy Nominatim-only path, tool is not registered. 5 s timeout; all errors degrade silently. Tests: 5 unit tests for compose_location_string (Apollo only, Nominatim only, both, both-with-description, neither). Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-28 19:11:12 +00:00
cameron	9d58547ce3	Merge pull request 'feat/raw-thumb-embedded-preview' (#59 ) from feat/raw-thumb-embedded-preview into master Reviewed-on: #59	2026-04-28 17:21:27 +00:00
Cameron Cordes	6521a328bf	RAW preview: exiftool fallback for MakerNote / SubIFD previews kamadak-exif's In::PRIMARY / In::THUMBNAIL only address IFD0 and IFD1. On modern Nikon NEFs the full-res review JPEG lives in the MakerNote's PreviewIFD (and many Canon CR2s / DNGs put theirs in a SubIFD chain) — both unreachable through the existing reader, so the previous patch still produced no preview for those files and the pipeline fell through to ffmpeg, which writes black frames when it can't decode the RAW. Add a slow-path layer in extract_embedded_jpeg_preview that shells out to exiftool for PreviewImage / JpgFromRaw / OtherImage (one process per tag). All candidates from both layers are pooled and the largest valid JPEG wins. exiftool not on PATH degrades to fast-path-only behavior rather than breaking — the fallback is a strict superset. Documented the new optional dependency in README.md and CLAUDE.md with install commands for apt / brew / winget / choco. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-28 17:13:36 +00:00
Cameron Cordes	00b3c80141	RAW: try IFD0 + IFD1 for embedded preview, serve at full size The thumbnail pipeline's embedded-JPEG extractor only checked IFD1 (THUMBNAIL), which on many Nikon NEFs is missing or zero-length even when IFD0 (PRIMARY) carries a perfectly good 1-2 MP reduced-resolution preview the camera writes for in-body review. The previous behavior produced black thumbs on disk: the buggy IFD1 pointer resolved to a short byte sequence that happened to satisfy the SOI sanity check, image::load_from_memory accepted it, and the resize path quietly wrote a black JPEG. Now both IFDs are checked and the larger valid JPEG wins. Format- agnostic: applies to every TIFF-based RAW (NEF / ARW / CR2 / DNG / RAF / ORF / RW2 / PEF / SRW / TIFF). is_tiff_raw is now pub so main.rs can gate its full-size handler on it. Also extends the /image handler so size=full requests for RAW formats serve the embedded preview as image/jpeg instead of NamedFile-streaming the original RAW bytes - browsers can't decode a .nef container, so <img src=...> would otherwise land as a broken image. Falls through to NamedFile if no preview is present, preserving the historical behavior for callers that genuinely want the original bytes. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-28 16:52:10 +00:00
cameron	a53c3ae514	Merge pull request 'feature/exif-batch-endpoint for Apollo' (#58 ) from feature/exif-batch-endpoint into master Reviewed-on: #58	2026-04-28 12:58:30 +00:00
Cameron Cordes	7621282419	Thumb orientation + library filter on /photos/exif Two follow-ups on the same feature branch: 1. Bake EXIF orientation into generated thumbnails. The `image` crate doesn't apply Orientation on load, and `save_with_format(..Jpeg)` drops EXIF — so portrait phone shots ended up sideways in any client that displays the cached thumb directly (no EXIF tag for the browser to compensate from). New `exif::read_orientation` reads the tag cheaply (no full EXIF parse) and `exif::apply_orientation` does the rotate/flip via image's existing `rotate90/180/270` + `fliph/flipv`. Applied in both branches of `generate_image_thumbnail` (RAW embedded- JPEG path and the regular `image::open` path). Existing thumbnails in the cache are still wrong-orientation; wipe the thumb dir or run a one-off backfill once this lands. 2. Optional `library` query param on `/photos/exif`. Accepts numeric id or name (same shape as `/image?library=...`), resolved via the existing `resolve_library_param` helper so a bad value 400s before we touch the DAO. Filter is applied post-query in the handler rather than pushed into `query_by_exif` to keep the DAO trait (and its test mocks) unchanged. Cheap enough at typical library counts; can be moved into SQL later if it ever isn't. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-27 17:29:36 -04:00
Cameron Cordes	c6f82ebaba	Batch EXIF endpoint: GET /photos/exif Adds a single round-trip projection of `image_exif` for every photo whose `date_taken` falls in `[date_from, date_to]`. Wraps the existing `ExifDao::query_by_exif` DAO method which already handles the SQL filter in one query against the covering index — the only missing piece was HTTP plumbing. Designed for window-scoped consumers like Apollo's photo-to-track matcher, which currently does N+1 (one `/photos` listing + one `/image/metadata` per photo). Because `/image/metadata` serializes on `Data<Mutex<dyn ExifDao>>`, that pattern can take 10s+ for windows with hundreds of photos. The new endpoint takes one mutex acquisition for the whole batch. Response shape: { photos: [ { file_path, library_id, library_name, camera_model, width, height, gps_latitude, gps_longitude, date_taken } ], total: N } Two notes on scope: - Photos with NULL `date_taken` are excluded by `query_by_exif`'s semantics. Filename-extracted dates are not synthesized here; rare callers that need that fallback can still hit `/image/metadata`. - GPS columns are stored as f32 in image_exif to keep row size small; the JSON shape widens to f64 so clients don't have to know about the on-disk precision. Library names are pre-mapped from `app_state.libraries` once and stamped on each row, avoiding an O(rows × libraries) linear scan. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-27 16:38:53 -04:00
cameron	9cf3af383d	Merge pull request '006-bin-cleanup-and-progress' (#57 ) from 006-bin-cleanup-and-progress into master Reviewed-on: #57	2026-04-27 20:28:32 +00:00
Cameron	b9d5578653	feat(bins): multi-library populate_knowledge + progress UX populate_knowledge now loads real libraries from the DB instead of fabricating a single library_id=1 row from BASE_PATH. Adds --library <id\|name> to restrict the walk and validates --path against the selected library roots. The full library set is still passed to InsightGenerator so resolve_full_path can probe every root when an insight resolves to a different library than the one being walked. Adds indicatif progress bars across the long-running utility binaries via a shared src/bin_progress.rs helper (determinate bar + open-ended spinner with consistent styling). Per-batch info! noise is replaced by the bar's throughput/ETA; warnings and errors route through pb.println so they scroll above the bar instead of fighting with it. populate_knowledge spinner during scan, determinate bar over all libs backfill_hashes spinner with running hashed/missing/errors counts import_calendar determinate bar; embedding/store failures inline import_location_* determinate bar advancing by chunk size import_search_* determinate bar; pb cloned into the spawn task cleanup_files P1 determinate bar over DB paths cleanup_files P2 determinate bar; pb.suspend() around y/n/a/s prompt Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-26 23:55:33 -04:00
Cameron	d5f944c7b6	chore(bins): retire unused migrate_exif Single-library hardcoded (library_id=1) and missing content_hash/size_bytes backfill, so the watcher's full-scan path subsumes everything it does. Removed the binary and its CLAUDE.md reference. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-26 23:55:06 -04:00
cameron	2db611e1c1	Merge pull request 'OpenRouter Support, Insight Chat and User injection' (#56 ) from 005-llm-client-trait into master Reviewed-on: #56	2026-04-26 23:01:33 +00:00
Cameron	21e624da6b	fix(video): sentinel for failed HLS encodes to stop retry loop Previously a corrupt source (e.g. truncated mp4 with no moov atom) would be re-queued on every directory scan: cleanup_partial_hls wipes the temp playlist on ffmpeg failure, leaving no .m3u8 to short-circuit the next pass. Mirrors the thumbnail .unsupported sentinel pattern: on ffmpeg failure, write <playlist>.m3u8.unsupported, and treat its presence as "done" in both the ScanDirectoryMessage filter and the QueueVideosMessage check. Delete the sentinel to force a retry. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-26 01:06:13 -04:00
Cameron	021d1bffc0	chore: ignore db backups and local .idea config files Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-24 19:13:28 -04:00
Cameron	fa21b0d73d	chore(ai): disable default few-shot insight ids Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-24 19:12:25 -04:00
Cameron	0e55a6b125	fix(ai): treat rewind at end of history as no-op success The mobile client's regenerate-after-failure flow sends a discard index equal to the server's rendered count (its optimistic user bubble for the failed turn was never persisted). find_raw_cut treated this as out of range, surfacing as "Chat rewind failed: discard_from_rendered_index out of range" and blocking the retry. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-24 19:12:17 -04:00
Cameron	0ebc2e9003	feat(ai): rerank timing + think:false + OpenRouter error detail - search_rag reranker now logs wall-clock time around the ollama.generate call, the candidate count / top-N going in, and the final reordering. The "final indices" + swap-count line is info level so it's always visible; detailed before/after previews stay at debug for when you want to inspect reranker quality. - New OllamaClient::generate_no_think convenience that sets Ollama's top-level think:false on the request, plumbed through try_generate via a new internal generate_with_options. Used only by the reranker today; avoids the chain-of-thought tax on reasoning models (Qwen3/VL, DeepSeek-R1 distills, GPT-OSS) when the task has nothing to reason about. Server-side no-op on non-reasoning models. - OpenRouter chat_with_tools "missing choices[0]" error now includes the actual response body — extracts structured {error: {code, message}} when OpenRouter surfaces it (common for upstream-provider issues like rate limits and content moderation), otherwise falls back to a truncated raw-JSON view. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-24 16:19:45 -04:00
Cameron	e5781325c6	fix(ai): render tool-call arguments as compact JSON in logs Switch the "Agentic tool call" log from {:?} (Debug) to {} (Display) on serde_json::Value. Display produces compact JSON — `{"date":"2023-08-15"}` instead of `Object {"date": String("2023-08-15")}` — which is what the model actually sent and what a human reading the log wants to see. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-24 14:25:53 -04:00
Cameron	d43f5fc991	docs: document OLLAMA_REQUEST_TIMEOUT_SECONDS env var Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-24 13:54:23 -04:00
Cameron	f0ae9f95dc	feat(ai): few-shot exemplars + sticky Ollama preference - Few-shot injection on /insights/generate/agentic: compresses prior training_messages into trajectory blocks (tool calls + result summaries) and injects into the system prompt. Hardcoded default ids with optional request override. - New fewshot_source_ids column on photo_insights (+ migration) to track which exemplars influenced a given row, for downstream training-set filtering. Chat amend rows stamp None with a lineage note. - Ollama client now remembers which server (primary/fallback) most recently succeeded and tries it first on the next call, via a shared Arc<AtomicBool>. Avoids re-404ing the primary on every agent iteration when the chosen model only lives on the fallback. - Demote noisy logs: daily_summary "Summary match" lines to debug; inner chat_with_tools non-2xx body log from error to warn (outer layer owns the terminal-error signal). - Drift-guard tests for summarize_tool_result covering the success / empty / error / unknown shape for every tool. - Tidy: three pre-existing clippy warnings cleaned up. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-24 13:54:06 -04:00
Cameron	29f32b9d22	FFMPEG playlist improvements Better playlist management, .tmp renaming, HLS playlist parameter and concurrency tweaking.	2026-04-24 10:08:03 -04:00
Cameron	13b9d54861	fix(scan): quiet startup scans & thumbnail RAW/HEIC Three recurring issues on every full scan: 1. Video playlist scans re-enqueued every file only to reject it as AlreadyExists. Pre-filter in ScanDirectoryMessage and QueueVideosMessage so we skip videos whose .m3u8 already exists, and demote the leaked AlreadyExists log to debug. 2. image crate was built with only jpeg/png features, so webp/tiff/avif files logged "format not supported" every scan. Enable those features. 3. RAW (ARW/NEF/CR2/...) and HEIC thumbnails weren't generated, so the scan kept retrying them. Try the file's embedded JPEG preview via kamadak-exif first (fast, pure-Rust, works on Sony ARW where ffmpeg's TIFF decoder fails). Fall back to ffmpeg for HEIC/HEIF and RAWs with no preview. Anything still undecodable gets a <thumb>.unsupported sentinel so future scans skip it. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-23 20:47:13 -04:00
Cameron	dc2a96162e	fix(dates): prefer earliest of fs created/modified as fallback On copied or restored files (e.g. a backup library), the OS stamps created at copy time while modified is preserved from the source, so the earlier of the two is a better proxy for when the content originated. Adds utils::earliest_fs_time and threads it through the three spots that fall back to filesystem dates: photos-list sort, memories grouping, and insight-generation timestamp. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-23 17:20:12 -04:00
Cameron	d54419e779	style: cargo fmt drift Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-23 17:19:59 -04:00
Cameron	aa651d1c7b	feat(ai): iteration budget in prompt + preserve photo-knowledge links - Inject the max-iterations budget into the agentic system prompt for both insight generation and chat turns. Chat does this per-turn by appending a note to the replayed system message and restoring it before persistence so the note doesn't accumulate across turns. - Stop deleting entity_photo_links at the start of agentic insight generation. The clear made recall_facts_for_photo always return empty, wasting a tool call and discarding knowledge from prior runs. Re-linking the same entity is already an INSERT OR IGNORE no-op. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-23 16:28:48 -04:00
Cameron	6831f50993	feat(ai): USER_NAME env + shared summary prompt + test-bin knobs Introduces USER_NAME (default "Me") as the single source for the message sender label and the first-person persona across daily summaries, SMS context, insight generation, and chat. Eliminates the "Me:" transcript / "what I did" ambiguity that confused smaller models, and unhardcodes "Cameron" from prompt text + the knowledge-graph owner entity. Set USER_NAME=Cameron in .env to preserve the existing owner entity row (keyed on UNIQUE(name, entity_type)) — otherwise the next run creates a fresh owner entity and orphans the existing facts/photo-links. Also: - search_messages redirect: when the model calls it with date/contact but no query, return a hint pointing at get_sms_messages instead of a bare missing-parameter error (prevents same-turn retry loops) - sharpen search_messages vs get_sms_messages tool descriptions so content-vs-time-based intent is unambiguous - extract build_daily_summary_prompt (+ DAILY_SUMMARY_MESSAGE_LIMIT, DAILY_SUMMARY_SYSTEM_PROMPT) shared by daily_summary_job and test_daily_summary binary — prompt tweaks now land in both - EMBEDDING_MODEL const; fixes both insert sites that stored "mxbai-embed-large:335m" while generate_embeddings actually runs "nomic-embed-text:v1.5" - test_daily_summary: add --num-ctx / --temperature / --top-p / --top-k / --min-p flags wired into OllamaClient setters, and print the configured knobs at the top of each run - OllamaClient::generate now logs prompt/gen token counts and tok/s via log_chat_metrics (symmetric with chat_with_tools) Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-22 23:39:37 -04:00
Cameron	e4a3536f87	feat(ai): search_messages tool + RAG reranker Adds a search_messages tool that hits the Django FTS5/semantic/hybrid endpoint for keyword-quality text search over message bodies, and an LLM-based reranker inside tool_search_rag (gated by SEARCH_RAG_RERANK, default on). Reranker pulls ~3x candidates from the vector index, asks the chat model to rank by relevance, and falls back to vector order on parse failure. The reranker shares the active chat turn's OllamaClient so num_ctx and sampling match — otherwise Ollama unloads/reloads the model on every rerank call. (Unverified end-to-end; caught by inspection, awaiting e2e confirmation.) Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com>	2026-04-22 10:56:03 -04:00

1 2 3 4 5 ...

421 Commits