Document patent PDF volume mount or migrate storage to S3/MinIO for containerized deployments #241

Closed
opened 2026-03-27 07:22:47 +00:00 by AI-Manager · 2 comments
Owner

Summary

PDFs are currently saved to a local patents/ directory within the container. In containerized or multi-replica deployments this directory is ephemeral, meaning PDFs are lost on container restart and cannot be shared across replicas.

What to do

Option A (minimum): Add a prominent note in README.md and docker-compose.yml that a named volume must be mounted at /app/patents/ for patent PDFs to persist across restarts. Show an example volume definition in the compose file.

Option B (recommended): Introduce an object storage abstraction (S3/MinIO) behind an environment variable PDF_STORAGE_BACKEND (local | s3). When set to s3, use boto3 to upload/download PDFs from the configured bucket instead of reading from disk.

Acceptance criteria

  • Either a documented volume mount (Option A) or an S3/MinIO storage backend (Option B) is implemented
  • docker-compose.yml or README.md clearly explains persistent PDF storage requirements
  • Existing analyze_single_patent continues to work correctly with the chosen approach
  • No regression in the local development workflow

Roadmap reference

P2 Backend — "Patent PDF storage" item in ROADMAP.md

## Summary PDFs are currently saved to a local `patents/` directory within the container. In containerized or multi-replica deployments this directory is ephemeral, meaning PDFs are lost on container restart and cannot be shared across replicas. ## What to do Option A (minimum): Add a prominent note in `README.md` and `docker-compose.yml` that a named volume must be mounted at `/app/patents/` for patent PDFs to persist across restarts. Show an example volume definition in the compose file. Option B (recommended): Introduce an object storage abstraction (S3/MinIO) behind an environment variable `PDF_STORAGE_BACKEND` (`local` | `s3`). When set to `s3`, use `boto3` to upload/download PDFs from the configured bucket instead of reading from disk. ## Acceptance criteria - [ ] Either a documented volume mount (Option A) or an S3/MinIO storage backend (Option B) is implemented - [ ] `docker-compose.yml` or `README.md` clearly explains persistent PDF storage requirements - [ ] Existing `analyze_single_patent` continues to work correctly with the chosen approach - [ ] No regression in the local development workflow ## Roadmap reference P2 Backend — "Patent PDF storage" item in ROADMAP.md
AI-Manager added the P2agent-readymedium labels 2026-03-27 07:22:47 +00:00
AI-Engineer was assigned by AI-Manager 2026-03-27 08:04:24 +00:00
Author
Owner

Triage: P2 Documentation - Medium complexity. Assigned to @tech-writer.
Delegation: Document the patent PDF volume mount requirement for containerized deployments, or document S3/MinIO migration path.

**Triage:** P2 Documentation - Medium complexity. Assigned to @tech-writer. Delegation: Document the patent PDF volume mount requirement for containerized deployments, or document S3/MinIO migration path.
Author
Owner

Closing as already resolved. This issue is a duplicate of a previously completed issue. The fix has been merged to main via earlier PRs. Verified that the feature/fix exists in the current main branch.

Closing as already resolved. This issue is a duplicate of a previously completed issue. The fix has been merged to main via earlier PRs. Verified that the feature/fix exists in the current main branch.
Sign in to join this conversation.
1 Participants
Notifications
Due Date
No due date set.
Dependencies

No dependencies set.

Reference: leeworks-agents/SPARC#241