matthew/claw-apply - claw-apply - pq.io

matthew/claw-apply

Author	SHA1	Message	Date
Matthew Jackson	253d1888e9	Add S3-backed storage to prevent data loss - New lib/storage.mjs: async S3 backup on every queue/log save - Versioned S3 bucket (claw-apply-data) keeps every revision - Auto-restore from S3 if local file is missing or corrupt - saveQueue/saveLog now validate data type before writing (prevents the exact bug that corrupted the queue) - IAM role attached to EC2 instance for credential-free S3 access - Config: storage.type = "local" (default) or "s3" Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-06 21:56:37 -08:00
Matthew Jackson	273a32e571	Add sidecar update pattern to prevent concurrent queue writes Secondary processes (standalone classifier, ad-hoc scripts) now write to queue_updates.jsonl via writePendingUpdate() instead of modifying jobs_queue.json directly. Primary processes pick up updates on next loadQueue() call using atomic rename-then-read. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-06 19:46:36 -08:00
Matthew Jackson	e62756c6ca	fix: robustness improvements — atomic writes, timeouts, shell injection, validation errors - Atomic JSON writes (write-to-tmp + rename) prevent queue/log corruption - Per-job (3min) and overall run (45min) timeouts prevent hangs - execFileSync in ai_answer.mjs prevents shell injection with resume paths - Validation error detection after form fill in Easy Apply modal - Config-driven enabled_apply_types (from settings.json) - isRequired() detects required/aria-required/label * patterns - getLabel() strips trailing * from required field labels - Actionable logging on failures ("Action: ..." messages) Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-06 10:01:53 -08:00
Claw	3575f06018	fix: dedupeAfterFilter skips groups with unscored members — wait until all copies are scored	2026-03-06 16:00:23 +00:00
Claw	c9b527c83a	feat: find-all → filter → dedup flow - addJobs: allows same job on multiple tracks (dedup key = track::id) - Cross-track copies get composite id (job.id_track) to avoid batch collisions - dedupeAfterFilter(): after collect, keeps highest-scored copy per URL, marks rest as 'duplicate' - Called automatically at end of collect phase	2026-03-06 15:55:00 +00:00
Claw	2f05a40954	fix: addJobs always reads fresh from disk to prevent searcher clobbering filter scores	2026-03-06 11:24:05 +00:00
Claw	dee6e98603	feat: searcher Phase 2 classifies apply type; applier sorts by priority; already-applied detection	2026-03-06 00:58:22 +00:00
Matthew Jackson	a244a5fddf	chore: clean up dead code, use shared loadConfig, cache queue I/O - Remove unused readFileSync import from job_applier.mjs - Remove unused makeJobId (dead code, nothing imports it) - setup.mjs: use shared loadConfig instead of inline cfg() - queue.mjs: add in-memory cache for queue and log to avoid redundant disk reads during a single run Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-05 16:06:37 -08:00
Matthew Jackson	e71f940687	fix: add config validation and retry logic for failed jobs - Add loadConfig() helper with clear errors for missing/malformed JSON - Replace raw JSON.parse(readFileSync(...)) in both entry points - Track retry_count on jobs; re-queue as 'new' up to max_retries (default 2) - Add max_retries and DEFAULT_MAX_RETRIES constant Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>	2026-03-05 16:03:33 -08:00
Claw	52a56f59f6	feat: claw-apply v0.1 — full implementation - job_searcher.mjs: LinkedIn + Wellfound search, queue population - job_applier.mjs: Easy Apply + Wellfound apply, Mode A/B - lib/form_filler.mjs: config-driven form filling, custom answers.json - lib/linkedin.mjs: two-panel Easy Apply flow - lib/wellfound.mjs: Wellfound search + apply - lib/browser.mjs: Kernel stealth browser factory with local fallback - lib/queue.mjs: jobs_queue.json management - lib/notify.mjs: Telegram notifications - setup.mjs: setup wizard with login verification - Config templates: profile, search_config, answers, settings - SKILL.md: OpenClaw skill definition	2026-03-05 23:24:09 +00:00