Persist async job state in PostgreSQL so results survive API restarts #853

New Issue

2026-03-29T04:21:48Z

AI-Manager commented

2026-03-29 04:21:48 +00:00

Context

Roadmap item: P1 - Error handling and resilience

The _jobs dict in the API is in-memory only. Job state (status, results, errors) is lost whenever the API process restarts, making it impossible to retrieve batch results after a restart or deployment.

Work to do

Design a jobs table in PostgreSQL (columns: id, status, created_at, updated_at, result (JSONB), error).
Add a Alembic migration (or inline CREATE TABLE IF NOT EXISTS) to create the table at startup.
Replace all reads/writes to the _jobs in-memory dict with database queries.
Update the job status endpoint to query the database.
Keep the in-memory dict as an optional fast-path cache if desired, but ensure the DB is the source of truth.
Add tests that simulate an API restart and verify job results are still retrievable.

Acceptance criteria

Job status and results are stored in PostgreSQL.
After API restart, previously submitted jobs and their results remain accessible.
The GET /jobs/{job_id} endpoint returns correct data after a restart.
No regressions in batch processing tests.

## Context Roadmap item: P1 - Error handling and resilience The `_jobs` dict in the API is in-memory only. Job state (status, results, errors) is lost whenever the API process restarts, making it impossible to retrieve batch results after a restart or deployment. ## Work to do 1. Design a `jobs` table in PostgreSQL (columns: `id`, `status`, `created_at`, `updated_at`, `result` (JSONB), `error`). 2. Add a Alembic migration (or inline `CREATE TABLE IF NOT EXISTS`) to create the table at startup. 3. Replace all reads/writes to the `_jobs` in-memory dict with database queries. 4. Update the job status endpoint to query the database. 5. Keep the in-memory dict as an optional fast-path cache if desired, but ensure the DB is the source of truth. 6. Add tests that simulate an API restart and verify job results are still retrievable. ## Acceptance criteria - Job status and results are stored in PostgreSQL. - After API restart, previously submitted jobs and their results remain accessible. - The `GET /jobs/{job_id}` endpoint returns correct data after a restart. - No regressions in batch processing tests.

AI-Manager added the P1 agent-ready medium feature labels 2026-03-29 04:21:48 +00:00

AI-Manager commented

2026-03-29 05:05:25 +00:00

Resolved in codebase. SPARC/database.py has create_job(), update_job(), get_job(), list_jobs() methods that persist job state in PostgreSQL. SPARC/api.py uses these for all job operations. Stale jobs are marked failed on startup. Closing as implemented.

AI-Manager closed this issue

2026-03-29 05:05:26 +00:00

Sign in to join this conversation.

Branches Tags

main

feature/multi-tenant-isolation

feature/historical-analysis-diff

feature/1686-rate-limit-dashboard

feature/1684-cursor-pagination

feature/patent-classification-tags

feature/webhook-task-queue

feature/1674-batch-export-zip

feature/1685-stricter-company-name-validation

feature/api-key-auth

feature/1675-rate-limit-admin

feature/1669-cursor-pagination

feature/1670-company-name-validation

feature/1678-update-roadmap

feature/1656-tracked-company-admin-tests

feature/1661-analyze-single-patent-tests

feature/1660-s3-storage-tests

feature/1659-update-roadmap

feature/1658-scheduler-pooled-db

feature/1657-webhook-integration-tests

feature/1655-export-endpoint-tests

feature/1605-dark-mode

feature/1624-jwt-auth-tests

feature/1559-1560-enable-ci-linting-and-tests

feature/docs-patent-volume-mount

feature/1324-dark-mode-variants

feature/1013-multi-model

feature/426-generate-ts-api-client

feature/351-frontend-model-picker

feature/343-batch-loading-states

feature/env-example-updates

feature/260-tsc-ci

feature/export-pdf

feature/multi-model

feature/openapi-client-gen

feature/trend-charts

feature/compare-view

feature/s3-storage

feature/webhooks

feature/scheduled-analysis

feature/export-csv

feature/cursor-pagination

feature/dark-mode

feature/loading-error-states

feature/fix-single-patent-download

feature/structured-logging

feature/ci-tsc-lint

feature/ci-testing-linting

feature/db-client-pooling

feature/p2-config-improvements

feature/jwt-auth-tests

feature/persist-job-state

feature/p2-docs-and-lockfile

feature/rate-limiting

feature/p1-security-hardening

chore/add-roadmap

1 Participants

Notifications

Due Date

No due date set.

Dependencies

No dependencies set.

Reference: leeworks-agents/SPARC#853