docs: update documentation for React frontend and cache mode

Update all documentation to reflect recent changes: - Replace Streamlit dashboard references with React TypeScript dashboard - Update dashboard port from 8501 to 8080 - Add auth.py and database.py to architecture section - Change USE_DATABASE terminology to USE_CACHE - Add JWT_SECRET to environment variables reference - Document default admin credentials and user seeding 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude <noreply@anthropic.com>
2026-03-14 14:30:21 -04:00
parent ebba983a1d
commit ed81ae4569
3 changed files with 96 additions and 78 deletions
@@ -1,16 +1,19 @@
-# Database Mode for Testing and Analytics
+# Database Storage and Caching

-This document explains how to use SPARC's database mode for storing LLM messages for testing and analytics purposes.
+This document explains how SPARC uses PostgreSQL for storing LLM messages, enabling response caching and analytics.

 ## Overview

-SPARC supports two modes of operation:
+SPARC stores all LLM interactions in PostgreSQL, providing:

-1. **API Mode** (default): Messages are sent to OpenRouter's API and you receive real LLM responses
-2. **Database Mode**: Messages are stored in a PostgreSQL database without making API calls, useful for:
-   - Testing the application without consuming API credits
-   - Collecting analytics on message patterns and usage
-   - Development and debugging
+- **Response Caching**: Avoid redundant API calls for previously analyzed patents
+- **Analytics**: Track usage patterns, token consumption, and analysis history
+- **Persistence**: Maintain analysis history across sessions
+
+SPARC supports two cache modes:
+
+1. **Cache Mode** (default, `USE_CACHE=true`): Check database for cached responses before making API calls
+2. **Fresh Mode** (`USE_CACHE=false`): Always make fresh API calls (still stores results in database)

 ## Setup

@@ -45,43 +48,43 @@ cp .env.example .env
 Edit `.env` and set:

 ```env
-# For database mode (testing/analytics)
-USE_DATABASE=true
+# Database connection (required)
 DATABASE_URL=postgresql://postgres:postgres@localhost:5432/sparc

-# For API mode (production)
-USE_DATABASE=false
+# Cache mode: use cached responses when available
+USE_CACHE=true
+
+# API key for fresh LLM calls
 OPENROUTER_API_KEY=your_openrouter_key_here
 ```

 ## Usage

-### Running in Database Mode
+### Running with Cache Mode (Default)

-Set `USE_DATABASE=true` in your `.env` file, then run the application normally:
+Set `USE_CACHE=true` in your `.env` file, then run the application normally:

 ```bash
 python main.py
 ```

-Instead of sending messages to OpenRouter, the application will:
- Store all prompts in the database
- Return a placeholder response
- Log metadata (company name, analysis type, timestamps)
+The application will:
+- Check the database for cached responses matching the request
+- If found, return the cached response (no API call)
+- If not found, make an API call and store the response for future use

-### Running in API Mode
+### Running with Fresh Mode

-Set `USE_DATABASE=false` in your `.env` file, then run the application normally:
+Set `USE_CACHE=false` in your `.env` file to always get fresh responses:

 ```bash
 python main.py
 ```

-The application will send messages to OpenRouter and return real LLM responses.
-
-### Hybrid Mode (Optional)
-
-You can also enable database logging while still using the API by initializing the database client in your code. The `LLMAnalyzer` will automatically log all API calls to the database if a database client is available.
+The application will:
+- Always send messages to OpenRouter for real LLM responses
+- Store all responses in the database
+- Useful when you need the latest analysis or want to refresh cached data

 ## Viewing Analytics

@@ -195,16 +198,16 @@ docker-compose down -v

 ## Toggling Between Modes

-You can easily switch between modes by changing the `USE_DATABASE` environment variable:
+You can easily switch between modes by changing the `USE_CACHE` environment variable:

-### Quick Toggle (temporary, for testing)
+### Quick Toggle (temporary)

 ```bash
-# Run in database mode
-USE_DATABASE=true python main.py
+# Run with caching enabled
+USE_CACHE=true python main.py

-# Run in API mode
-USE_DATABASE=false python main.py
+# Run with fresh API calls
+USE_CACHE=false python main.py
 ```

 ### Persistent Toggle
@@ -212,38 +215,48 @@ USE_DATABASE=false python main.py
 Edit your `.env` file:

 ```env
-# For testing/analytics
-USE_DATABASE=true
+# Use cached responses when available (recommended for most use)
+USE_CACHE=true

-# For production use
-USE_DATABASE=false
+# Always make fresh API calls
+USE_CACHE=false
 ```

 ## Use Cases

-### Testing Without API Costs
+### Cost Optimization with Caching

-During development, enable database mode to test the full application flow without consuming API credits:
+Cache mode reduces API costs by reusing previous analysis results:

 ```bash
-USE_DATABASE=true python main.py
+USE_CACHE=true python main.py
+```
+
+If the same company/patent combination was analyzed before, the cached response is returned instantly.
+
+### Fresh Analysis
+
+When you need the latest LLM analysis (e.g., after model updates):
+
+```bash
+USE_CACHE=false python main.py
 ```

 ### Collecting Usage Analytics

-Enable database mode in a test environment to collect analytics on:
+The database stores all interactions, enabling analytics on:
 - Which companies are analyzed most frequently
 - Types of analyses performed
- Prompt patterns and lengths
- Usage over time
+- Token usage and costs over time
+- Response caching hit rates

 ### Development and Debugging

-Database mode is useful for:
- Testing patent parsing logic without API calls
+Database storage is useful for:
+- Reviewing actual prompts sent to the LLM
+- Analyzing response patterns
 - Debugging the full pipeline end-to-end
- Collecting sample prompts for optimization
- Understanding token usage patterns (when in API mode with logging)
+- Understanding token usage patterns

 ## Troubleshooting

@@ -64,7 +64,7 @@ docker-compose ps
 # You should see:
 # - sparc-postgres (healthy)
 # - sparc-api (running on port 8000)
-# - sparc-dashboard (running on port 8501)
+# - sparc-dashboard (running on port 8080)
 ```

 The database is automatically initialized by the `init-db` service.
@@ -116,11 +116,13 @@ docker-compose up -d postgres
 # Wait for database to be healthy, then initialize
 python scripts/init_database.py

-# Terminal 1: Start FastAPI backend
+# Start FastAPI backend
 uvicorn SPARC.api:app --host 0.0.0.0 --port 8000 --reload

-# Terminal 2: Start Streamlit dashboard
-streamlit run dashboard.py --server.port 8501 --server.address 0.0.0.0
+# For the React frontend (separate terminal)
+cd frontend
+npm install
+npm run dev
 ```

 ---
@@ -141,7 +143,7 @@ Access the services:
 |---------|-----|
 | REST API | http://localhost:8000 |
 | API Documentation (Swagger) | http://localhost:8000/docs |
-| Dashboard (Web UI) | http://localhost:8501 |
+| Dashboard (Web UI) | http://localhost:8080 |

 ---

@@ -149,16 +151,17 @@ Access the services:

 ### Via Dashboard (Web UI)

-1. Open http://localhost:8501
-2. Select **"Company Analysis"** from the sidebar
-3. Enter a company name (e.g., "Intel")
-4. Click **"Analyze"**
+1. Open http://localhost:8080
+2. Register a new account or login (default admin: `admin` / `admin`)
+3. Navigate to **"Analysis"** from the sidebar
+4. Enter a company name (e.g., "Intel")
+5. Click **"Analyze"**

 This will:
 - Query SerpAPI for recent patents
 - Download and parse patent PDFs
 - Send patent content to Claude for analysis
- Store prompt/response in PostgreSQL
+- Store prompt/response in PostgreSQL (with caching)
 - Display results in the dashboard

 ### Via REST API
@@ -233,12 +236,12 @@ docker exec -it sparc-postgres psql -U postgres -d sparc -c \

 | Component | Purpose |
 |-----------|---------|
-| **Dashboard** | Streamlit web UI for interactive analysis |
-| **FastAPI** | REST API for programmatic access |
+| **Dashboard** | React TypeScript web UI with authentication |
+| **FastAPI** | REST API with JWT authentication |
 | **Analyzer** | Orchestrates patent retrieval and LLM analysis |
 | **SerpAPI** | Retrieves patent data from Google Patents |
 | **OpenRouter** | Routes requests to Claude for AI analysis |
-| **PostgreSQL** | Stores prompts, responses, and analytics |
+| **PostgreSQL** | Stores prompts, responses, users, and cached results |

 ---

@@ -248,10 +251,9 @@ docker exec -it sparc-postgres psql -U postgres -d sparc -c \
 |----------|----------|---------|-------------|
 | `API_KEY` | Yes | - | SerpAPI key for patent search |
 | `OPENROUTER_API_KEY` | Yes | - | OpenRouter API key for Claude access |
-| `DATABASE_URL` | Yes* | - | PostgreSQL connection string |
-| `USE_DATABASE` | No | `false` | Set to `true` to enable database storage |
-
-*Required when `USE_DATABASE=true`
+| `DATABASE_URL` | Yes | - | PostgreSQL connection string |
+| `USE_CACHE` | No | `true` | Check database for cached responses before API calls |
+| `JWT_SECRET` | Yes | - | Secret key for JWT authentication (change in production!) |

 ### Database URL Format

@@ -273,9 +275,9 @@ The `docker-compose.yml` includes all services needed for production:
 | Service | Container | Port | Description |
 |---------|-----------|------|-------------|
 | `postgres` | sparc-postgres | 5432 | PostgreSQL database |
-| `init-db` | sparc-init-db | - | One-time database initialization |
-| `api` | sparc-api | 8000 | FastAPI REST API |
-| `dashboard` | sparc-dashboard | 8501 | Streamlit web UI |
+| `init-db` | sparc-init-db | - | One-time database initialization (seeds admin user) |
+| `api` | sparc-api | 8000 | FastAPI REST API with JWT auth |
+| `dashboard` | sparc-dashboard | 8080 | React TypeScript web UI |

 ### Common Docker Compose Commands

@@ -382,11 +384,11 @@ cp .env.example .env
 docker-compose up -d postgres
 python scripts/init_database.py
 uvicorn SPARC.api:app --reload &
-streamlit run dashboard.py
+cd frontend && npm install && npm run dev &

 # Check status
 curl http://localhost:8000/health
-open http://localhost:8501
+open http://localhost:8080

 # View data
 python scripts/view_analytics.py