forked from 0xWheatyz/SPARC
Evaluate object storage (S3/MinIO) for patent PDF persistence in containerized deployments #1617
Reference in New Issue
Block a user
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Background
Patent PDFs are currently saved to a local
patents/directory on the container filesystem. In containerized deployments this directory is ephemeral unless explicitly volume-mounted, which is only minimally documented.Roadmap Reference
Roadmap P2 Backend: PDFs are saved to a local
patents/directory. For containerized deployments, consider object storage (S3/MinIO) or at minimum document the volume mount requirement more prominently.What to do
STORAGE_BACKENDenvironment variable (localdefault,s3option) and implement the S3 upload/download path in the PDF handling code.volumes:entry and comment indocker-compose.ymlso thepatents/directory is never silently lost on container restart, and update README accordingly.Acceptance Criteria
docker-compose.ymlexplicitly mounts thepatents/directory with a clear comment explaining why.This issue is already resolved in main.
storage.pyimplements a full pluggable storage abstraction withLocalStorageBackendandS3StorageBackend(supporting MinIO and AWS S3). Configuration is inconfig.pyviaSTORAGE_BACKEND,S3_BUCKET,S3_ENDPOINT_URL,AWS_ACCESS_KEY_ID, andAWS_SECRET_ACCESS_KEYenvironment variables. Thedocker-compose.ymlincludes a MinIO sidecar service under thes3profile.