forked from 0xWheatyz/SPARC
Document or migrate patent PDF storage to object storage (S3/MinIO) #1039
Reference in New Issue
Block a user
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Background
Roadmap item: P2 -- Backend -- Patent PDF storage
PDFs are currently saved to a local
patents/directory inside the container. This is not suitable for containerized or multi-replica deployments because the directory is ephemeral and not shared across instances.Work to do
STORAGE_BACKENDenv var (local/s3), useboto3to upload/download PDFs whens3is selected.docker-compose.ymland the README so operators know to persist thepatents/directory.analyzer.py/analyze_single_patentto fetch from the configured backend rather than assuming a local path.PATENTS_STORAGE_BACKEND,PATENTS_S3_BUCKET,PATENTS_S3_ENDPOINT(for MinIO) env vars toconfig.py.Acceptance criteria
localfallback.docker-compose.ymland the README explains the required volume mount with an example snippet.Ref: ROADMAP.md P2 -- Patent PDF storage
Resolved. PR #58 (feature/s3-storage) implemented S3/MinIO object storage support for patent PDFs with STORAGE_BACKEND config option. PR #31 also documented the volume mount requirement. Verified: SPARC/storage.py exists with S3 support in current main.