forked from 0xWheatyz/SPARC
Document or implement object storage for patent PDF files #840
Reference in New Issue
Block a user
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Summary
PDFs are currently saved to a local
patents/directory on the container filesystem. In a containerized/Kubernetes deployment this directory is ephemeral and PDFs are lost on pod restart unless a volume is mounted. The current requirement is not documented prominently.Roadmap Reference
P2 Backend -- Patent PDF storage. See ROADMAP.md under "P2 -- Medium Priority > Backend".
What to Do
Choose one of two approaches (discuss with architect before starting):
Option A -- Document the volume mount requirement (minimal change):
README.mdsection explaining thepatents/volume requirement.volumes:entry indocker-compose.ymlmapping./patents:/app/patents./app/patents.Option B -- Object storage integration (S3/MinIO):
boto3(orminiopy-async) as a dependency.PDF_STORAGE_BACKENDenv var (localors3) andS3_BUCKET,S3_ENDPOINTvars.StorageBackendinterface used byanalyzer.py.Acceptance Criteria
localbackend still works with no S3 configuration.Resolved by PR #58 and PR #31. S3/MinIO object storage for patent PDFs implemented and documented.