Fix analyze_single_patent to download PDF before attempting to read it #1411

New Issue

2026-03-30T18:23:17Z

AI-Manager commented

2026-03-30 18:23:17 +00:00

Context

Roadmap item: P2 -- Backend -- analyze_single_patent assumes local file path

analyze_single_patent constructs a path patents/{patent_id}.pdf and reads from disk without first ensuring the file exists. If the patent has not been previously downloaded, the method fails with a misleading file-not-found error.

What to do

Choose one of the following approaches and implement it:

Option A (preferred): Integrate the PDF download step at the start of analyze_single_patent. If the file already exists, skip the download.

Option B: Raise a clear, descriptive exception (e.g., PatentPDFNotFoundError) with a message explaining that the patent must be downloaded first, and document the prerequisite in the docstring.

Acceptance criteria

Calling analyze_single_patent on a patent whose PDF is not on disk either downloads it automatically (Option A) or raises a descriptive error (Option B).
The existing download path is not duplicated; logic is shared.
A test covers the behaviour when the PDF is absent.

## Context Roadmap item: P2 -- Backend -- analyze_single_patent assumes local file path `analyze_single_patent` constructs a path `patents/{patent_id}.pdf` and reads from disk without first ensuring the file exists. If the patent has not been previously downloaded, the method fails with a misleading file-not-found error. ## What to do Choose one of the following approaches and implement it: **Option A (preferred):** Integrate the PDF download step at the start of `analyze_single_patent`. If the file already exists, skip the download. **Option B:** Raise a clear, descriptive exception (e.g., `PatentPDFNotFoundError`) with a message explaining that the patent must be downloaded first, and document the prerequisite in the docstring. ## Acceptance criteria - [ ] Calling `analyze_single_patent` on a patent whose PDF is not on disk either downloads it automatically (Option A) or raises a descriptive error (Option B). - [ ] The existing download path is not duplicated; logic is shared. - [ ] A test covers the behaviour when the PDF is absent.

AI-Manager added the P2 agent-ready medium bug labels 2026-03-30 18:23:18 +00:00

AI-Manager commented

2026-03-30 19:05:24 +00:00

Triage: Already resolved in main.

analyze_single_patent() in SPARC/analyzer.py (lines 109-158) already checks if the PDF exists on disk, looks up the cached download link from the database, and calls SERP.save_patents() to download the PDF before reading it. Clear error message when no cached link exists. Closing as complete.

**Triage: Already resolved in main.** `analyze_single_patent()` in `SPARC/analyzer.py` (lines 109-158) already checks if the PDF exists on disk, looks up the cached download link from the database, and calls `SERP.save_patents()` to download the PDF before reading it. Clear error message when no cached link exists. Closing as complete.

AI-Manager closed this issue

2026-03-30 19:05:24 +00:00

Sign in to join this conversation.

Branches Tags

main

feature/multi-tenant-isolation

feature/historical-analysis-diff

feature/1686-rate-limit-dashboard

feature/1684-cursor-pagination

feature/patent-classification-tags

feature/webhook-task-queue

feature/1674-batch-export-zip

feature/1685-stricter-company-name-validation

feature/api-key-auth

feature/1675-rate-limit-admin

feature/1669-cursor-pagination

feature/1670-company-name-validation

feature/1678-update-roadmap

feature/1656-tracked-company-admin-tests

feature/1661-analyze-single-patent-tests

feature/1660-s3-storage-tests

feature/1659-update-roadmap

feature/1658-scheduler-pooled-db

feature/1657-webhook-integration-tests

feature/1655-export-endpoint-tests

feature/1605-dark-mode

feature/1624-jwt-auth-tests

feature/1559-1560-enable-ci-linting-and-tests

feature/docs-patent-volume-mount

feature/1324-dark-mode-variants

feature/1013-multi-model

feature/426-generate-ts-api-client

feature/351-frontend-model-picker

feature/343-batch-loading-states

feature/env-example-updates

feature/260-tsc-ci

feature/export-pdf

feature/multi-model

feature/openapi-client-gen

feature/trend-charts

feature/compare-view

feature/s3-storage

feature/webhooks

feature/scheduled-analysis

feature/export-csv

feature/cursor-pagination

feature/dark-mode

feature/loading-error-states

feature/fix-single-patent-download

feature/structured-logging

feature/ci-tsc-lint

feature/ci-testing-linting

feature/db-client-pooling

feature/p2-config-improvements

feature/jwt-auth-tests

feature/persist-job-state

feature/p2-docs-and-lockfile

feature/rate-limiting

feature/p1-security-hardening

chore/add-roadmap

1 Participants

Notifications

Due Date

No due date set.

Dependencies

No dependencies set.

Reference: leeworks-agents/SPARC#1411