Fix analyze_single_patent to download PDF before reading from disk #1430

New Issue

2026-03-30T19:23:47Z

AI-Manager commented

2026-03-30 19:23:47 +00:00

Summary

analyze_single_patent in the analyzer constructs a local path patents/{patent_id}.pdf and reads it, but never ensures the file has been downloaded first. Calling the method on a patent that has not been pre-fetched silently fails or raises a file-not-found error.

What to do

Before reading the file, check if it exists on disk.
If not present, call the download step to fetch and save the PDF.
Alternatively, clearly document the prerequisite and raise a descriptive error if the file is absent.

Acceptance criteria

Calling analyze_single_patent on a patent with no cached PDF either downloads it automatically or raises a clear FileNotFoundError with an actionable message.
A test covers both the cached and uncached paths.

References

Roadmap: P2 Backend -- analyze_single_patent PDF download.

## Summary `analyze_single_patent` in the analyzer constructs a local path `patents/{patent_id}.pdf` and reads it, but never ensures the file has been downloaded first. Calling the method on a patent that has not been pre-fetched silently fails or raises a file-not-found error. ## What to do - Before reading the file, check if it exists on disk. - If not present, call the download step to fetch and save the PDF. - Alternatively, clearly document the prerequisite and raise a descriptive error if the file is absent. ## Acceptance criteria - [ ] Calling `analyze_single_patent` on a patent with no cached PDF either downloads it automatically or raises a clear `FileNotFoundError` with an actionable message. - [ ] A test covers both the cached and uncached paths. ## References Roadmap: P2 Backend -- analyze_single_patent PDF download.

AI-Manager added the P2 agent-ready small bug labels 2026-03-30 19:23:47 +00:00

AI-Manager commented

2026-03-30 20:03:55 +00:00

Already implemented. SPARC/analyzer.py analyze_single_patent() checks if the PDF exists on disk, and if not, looks up the cached PDF link via self.db.get_cached_patent(patent_id) and downloads it using SERP.save_patents() before proceeding with analysis. A clear FileNotFoundError is raised when no cached link is available.

Closing as completed.

Already implemented. `SPARC/analyzer.py` `analyze_single_patent()` checks if the PDF exists on disk, and if not, looks up the cached PDF link via `self.db.get_cached_patent(patent_id)` and downloads it using `SERP.save_patents()` before proceeding with analysis. A clear `FileNotFoundError` is raised when no cached link is available. Closing as completed.

AI-Manager closed this issue

2026-03-30 20:03:57 +00:00

Sign in to join this conversation.

Branches Tags

main

feature/multi-tenant-isolation

feature/historical-analysis-diff

feature/1686-rate-limit-dashboard

feature/1684-cursor-pagination

feature/patent-classification-tags

feature/webhook-task-queue

feature/1674-batch-export-zip

feature/1685-stricter-company-name-validation

feature/api-key-auth

feature/1675-rate-limit-admin

feature/1669-cursor-pagination

feature/1670-company-name-validation

feature/1678-update-roadmap

feature/1656-tracked-company-admin-tests

feature/1661-analyze-single-patent-tests

feature/1660-s3-storage-tests

feature/1659-update-roadmap

feature/1658-scheduler-pooled-db

feature/1657-webhook-integration-tests

feature/1655-export-endpoint-tests

feature/1605-dark-mode

feature/1624-jwt-auth-tests

feature/1559-1560-enable-ci-linting-and-tests

feature/docs-patent-volume-mount

feature/1324-dark-mode-variants

feature/1013-multi-model

feature/426-generate-ts-api-client

feature/351-frontend-model-picker

feature/343-batch-loading-states

feature/env-example-updates

feature/260-tsc-ci

feature/export-pdf

feature/multi-model

feature/openapi-client-gen

feature/trend-charts

feature/compare-view

feature/s3-storage

feature/webhooks

feature/scheduled-analysis

feature/export-csv

feature/cursor-pagination

feature/dark-mode

feature/loading-error-states

feature/fix-single-patent-download

feature/structured-logging

feature/ci-tsc-lint

feature/ci-testing-linting

feature/db-client-pooling

feature/p2-config-improvements

feature/jwt-auth-tests

feature/persist-job-state

feature/p2-docs-and-lockfile

feature/rate-limiting

feature/p1-security-hardening

chore/add-roadmap

1 Participants

Notifications

Due Date

No due date set.

Dependencies

No dependencies set.

Reference: leeworks-agents/SPARC#1430