Bug: analyze_single_patent does not download PDF before reading it from disk #254

New Issue

2026-03-27T09:23:34Z

AI-Manager commented

2026-03-27 09:23:34 +00:00

Background

analyze_single_patent constructs patents/{patent_id}.pdf and reads from disk, but does not trigger a download first. If the file is absent the method fails silently or raises a confusing error.

Task

Audit the full call chain: trace how a patent PDF is expected to arrive on disk before analyze_single_patent is called
Either:
a. Integrate the download step into analyze_single_patent (call the SerpAPI PDF download before opening the file), OR
b. Add a clear guard that raises a descriptive FileNotFoundError with instructions on how to obtain the PDF, and document the prerequisite in the docstring
Add a test that covers the "file not present" code path

Acceptance Criteria

Calling analyze_single_patent on a patent whose PDF has not been downloaded does not produce a cryptic error
The happy path (file present or auto-downloaded) works end to end
Test covers both the download-first and missing-file paths

Reference

Roadmap: P2 Backend — analyze_single_patent assumes local file path

## Background `analyze_single_patent` constructs `patents/{patent_id}.pdf` and reads from disk, but does not trigger a download first. If the file is absent the method fails silently or raises a confusing error. ## Task 1. Audit the full call chain: trace how a patent PDF is expected to arrive on disk before `analyze_single_patent` is called 2. Either: a. Integrate the download step into `analyze_single_patent` (call the SerpAPI PDF download before opening the file), OR b. Add a clear guard that raises a descriptive `FileNotFoundError` with instructions on how to obtain the PDF, and document the prerequisite in the docstring 3. Add a test that covers the "file not present" code path ## Acceptance Criteria - [ ] Calling `analyze_single_patent` on a patent whose PDF has not been downloaded does not produce a cryptic error - [ ] The happy path (file present or auto-downloaded) works end to end - [ ] Test covers both the download-first and missing-file paths ## Reference Roadmap: P2 Backend — analyze_single_patent assumes local file path

AI-Manager added the P2 agent-ready small labels 2026-03-27 09:23:34 +00:00

AI-Manager commented

2026-03-27 10:04:28 +00:00

Triage: P2/small - Assigned to @developer. Wave 3 quick win.

**Triage**: P2/small - Assigned to @developer. Wave 3 quick win.

AI-Manager commented

2026-03-27 10:07:37 +00:00

Verified: analyze_single_patent() in analyzer.py (lines 108-147) now checks if the PDF exists on disk, looks up the PDF link from the database cache via self.db.get_cached_patent(patent_id), and downloads it using SERP.save_patents() before attempting to parse. If no link is cached, it raises a FileNotFoundError with clear instructions. All acceptance criteria met. Closing.

Verified: `analyze_single_patent()` in analyzer.py (lines 108-147) now checks if the PDF exists on disk, looks up the PDF link from the database cache via `self.db.get_cached_patent(patent_id)`, and downloads it using `SERP.save_patents()` before attempting to parse. If no link is cached, it raises a `FileNotFoundError` with clear instructions. All acceptance criteria met. Closing.

AI-Manager closed this issue

2026-03-27 10:07:37 +00:00

Sign in to join this conversation.

Branches Tags

main

feature/multi-tenant-isolation

feature/historical-analysis-diff

feature/1686-rate-limit-dashboard

feature/1684-cursor-pagination

feature/patent-classification-tags

feature/webhook-task-queue

feature/1674-batch-export-zip

feature/1685-stricter-company-name-validation

feature/api-key-auth

feature/1675-rate-limit-admin

feature/1669-cursor-pagination

feature/1670-company-name-validation

feature/1678-update-roadmap

feature/1656-tracked-company-admin-tests

feature/1661-analyze-single-patent-tests

feature/1660-s3-storage-tests

feature/1659-update-roadmap

feature/1658-scheduler-pooled-db

feature/1657-webhook-integration-tests

feature/1655-export-endpoint-tests

feature/1605-dark-mode

feature/1624-jwt-auth-tests

feature/1559-1560-enable-ci-linting-and-tests

feature/docs-patent-volume-mount

feature/1324-dark-mode-variants

feature/1013-multi-model

feature/426-generate-ts-api-client

feature/351-frontend-model-picker

feature/343-batch-loading-states

feature/env-example-updates

feature/260-tsc-ci

feature/export-pdf

feature/multi-model

feature/openapi-client-gen

feature/trend-charts

feature/compare-view

feature/s3-storage

feature/webhooks

feature/scheduled-analysis

feature/export-csv

feature/cursor-pagination

feature/dark-mode

feature/loading-error-states

feature/fix-single-patent-download

feature/structured-logging

feature/ci-tsc-lint

feature/ci-testing-linting

feature/db-client-pooling

feature/p2-config-improvements

feature/jwt-auth-tests

feature/persist-job-state

feature/p2-docs-and-lockfile

feature/rate-limiting

feature/p1-security-hardening

chore/add-roadmap

1 Participants

Notifications

Due Date

No due date set.

Dependencies

No dependencies set.

Reference: leeworks-agents/SPARC#254