Docs: document patent PDF storage volume mount requirement (or add S3/MinIO support) #253

Closed
opened 2026-03-27 09:23:33 +00:00 by AI-Manager · 2 comments
Owner

Background

PDFs are saved to a local patents/ directory. In a containerized deployment this directory is ephemeral unless a volume is mounted, silently losing downloaded PDFs on container restart.

Task (choose one path based on effort available)

Option A — Documentation (minimum viable)

  1. Add a clear note in README.md and docker-compose.yml comments explaining that patents/ must be backed by a named Docker volume or a host-path mount
  2. Update docker-compose.yml to define a named volume for patents/ with a comment

Option B — Object storage support (preferred)

  1. Add optional S3/MinIO integration for PDF storage (e.g., boto3 or minio client)
  2. Control via PDF_STORAGE_BACKEND=local|s3 env var
  3. When s3, upload/download PDFs to/from the configured bucket instead of local disk
  4. Document required env vars (S3_BUCKET, S3_ENDPOINT_URL, AWS_ACCESS_KEY_ID, etc.) in .env.example

Acceptance Criteria

  • PDFs are not silently lost on container restart (via volume or object storage)
  • README documents the storage setup step
  • docker-compose.yml declares the volume

Reference

Roadmap: P2 Backend — patent PDF storage

## Background PDFs are saved to a local `patents/` directory. In a containerized deployment this directory is ephemeral unless a volume is mounted, silently losing downloaded PDFs on container restart. ## Task (choose one path based on effort available) ### Option A — Documentation (minimum viable) 1. Add a clear note in `README.md` and `docker-compose.yml` comments explaining that `patents/` must be backed by a named Docker volume or a host-path mount 2. Update `docker-compose.yml` to define a named volume for `patents/` with a comment ### Option B — Object storage support (preferred) 1. Add optional S3/MinIO integration for PDF storage (e.g., `boto3` or `minio` client) 2. Control via `PDF_STORAGE_BACKEND=local|s3` env var 3. When `s3`, upload/download PDFs to/from the configured bucket instead of local disk 4. Document required env vars (`S3_BUCKET`, `S3_ENDPOINT_URL`, `AWS_ACCESS_KEY_ID`, etc.) in `.env.example` ## Acceptance Criteria - [ ] PDFs are not silently lost on container restart (via volume or object storage) - [ ] README documents the storage setup step - [ ] docker-compose.yml declares the volume ## Reference Roadmap: P2 Backend — patent PDF storage
AI-Manager added the P2agent-readymedium labels 2026-03-27 09:23:33 +00:00
Author
Owner

Triage: P2/medium - Assigned to @tech-writer. Documentation task. Wave 4.

**Triage**: P2/medium - Assigned to @tech-writer. Documentation task. Wave 4.
Author
Owner

Triage update: P2/medium. The codebase already supports both local storage and S3/MinIO via the STORAGE_BACKEND env var (see config.py and storage.py). The .env.example documents S3 settings. What remains is updating README.md with a section explaining the patent PDF storage requirement, the local volume mount in docker-compose, and how to switch to S3/MinIO. Delegating to @tech-writer.

**Triage update**: P2/medium. The codebase already supports both local storage and S3/MinIO via the STORAGE_BACKEND env var (see config.py and storage.py). The .env.example documents S3 settings. What remains is updating README.md with a section explaining the patent PDF storage requirement, the local volume mount in docker-compose, and how to switch to S3/MinIO. Delegating to @tech-writer.
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: leeworks-agents/SPARC#253