LLM Service Integration Plan

Executive Summary

This document outlines the plan to integrate a unified LLM service into Automatos, replacing direct API key usage with a credential-based, lazy-loading system that supports multiple LLM providers (OpenAI, Anthropic, Google, Azure, HuggingFace).

Current State Analysis

Files Using OPENAI_API_KEY (126 files found)

Critical Orchestrator Files:

orchestrator/services/llm_provider.py - Current LLM abstraction layer
orchestrator/api/codegraph.py - CodeGraph service (uses OpenAI for embeddings)
orchestrator/api/chatbot_llm.py - Chatbot API (uses Anthropic directly)
orchestrator/services/codegraph_service.py - CodeGraph service implementation
orchestrator/context_engineering/embeddings.py - Embedding generation
orchestrator/api/documents.py - Document processing
orchestrator/config.py - Configuration loader

Files Using ANTHROPIC_API_KEY (37 files found)

Critical Files:

orchestrator/services/llm_provider.py - Already supports Anthropic
orchestrator/api/chatbot_llm.py - Direct Anthropic client usage
orchestrator/services/credential_resolver.py - Credential resolution

Current LLM Architecture

Orchestrator LLM Provider (`services/llm_provider.py`)

Structure: LLMManager → BaseLLMProvider → Provider implementations
Providers: OpenAI, Anthropic
Features: Async/sync support, tool calling, token tracking
Configuration: Environment variables + credential system fallback
Limitation: Only supports OpenAI/Anthropic, no Google/Azure/HuggingFace

LLM Service (Reference Implementation)

Structure: LLM → LLMClient (base) → Provider clients
Providers: OpenAI, Anthropic, Google, Azure, Bow (custom)
Interface: inference(), inference_stream(), test_connection()
Location: /backend/app/ai/llm/

Credential System (PRD-18)

Status: ✅ Fully implemented
LLM Credential Types: openai_api, anthropic_api, huggingface_api (exists!)
Storage: Encrypted database storage
Resolver: CredentialResolver with caching and fallback
UI: Full credential management UI in Settings

System Settings

Location: orchestrator/models/system_settings.py
Category: orchestrator_llm, codegraph
API: /api/system-settings endpoints
Frontend: OrchestratorLLMSettingsTab.tsx, CodeGraphSettingsTab.tsx

Target Architecture

Refactored LLM Provider Structure

Key Decision: Refactor existing llm_provider.py instead of creating new service!

orchestrator/services/
├── llm_provider.py                    # Main entry point (refactored, backward compatible)
└── llm_provider/                      # NEW: Module structure
    ├── __init__.py                     # Export main classes for backward compatibility
    ├── config.py                       # LLMConfig, LLMResponse dataclasses (moved)
    ├── manager.py                      # LLMManager (refactored with settings support)
    └── clients/
        ├── __init__.py
        ├── base.py                     # BaseLLMProvider (moved from llm_provider.py)
        ├── openai_client.py            # OpenAIProvider (moved from llm_provider.py)
        ├── anthropic_client.py         # AnthropicProvider (moved from llm_provider.py)
        ├── google_client.py            # GoogleProvider (NEW)
        ├── azure_client.py             # AzureProvider (NEW)
        └── huggingface_client.py       # HuggingFaceProvider (NEW)

Backward Compatibility: services/llm_provider.py will import from llm_provider/ to maintain existing imports.

Key Features

Lazy Loading: Service initializes on first use, not at startup
Credential-Based: Pulls API keys from credential system via settings
Multi-Provider: OpenAI, Anthropic, Google, Azure, HuggingFace
Unified Interface: inference(), inference_stream(), test_connection()
Service-Specific: Orchestrator and CodeGraph can use different providers
Graceful Failure: Returns None/error if credentials not configured

Integration Points

1. System Settings Integration

Orchestrator LLM Setting: orchestrator_llm.provider, orchestrator_llm.model
CodeGraph LLM Setting: codegraph.provider, codegraph.model
Settings UI: Select provider from dropdown, linked to credentials

2. Credential Resolution Flow

User selects "OpenAI" in OrchestratorLLM Settings Tab
  ↓
System looks up credential: "development_openai" or "production_openai"
  ↓
CredentialResolver resolves encrypted API key
  ↓
UnifiedLLMService creates OpenAI client
  ↓
Client ready for inference

3. Service-Specific Configuration

Orchestrator: Uses orchestrator_llm.provider + orchestrator_llm.model settings
CodeGraph: Uses codegraph.provider + codegraph.model settings
Other Services: Default to Orchestrator settings

Implementation Plan

Phase 1: Refactor Existing LLM Provider Service

Step 1.1: Create Clients Directory Structure

mkdir -p orchestrator/services/llm_provider/clients
touch orchestrator/services/llm_provider/__init__.py
touch orchestrator/services/llm_provider/clients/__init__.py

Step 1.2: Move Existing Base Classes

File: orchestrator/services/llm_provider/clients/base.py

Move BaseLLMProvider from llm_provider.py
Move LLMConfig, LLMResponse dataclasses
Keep existing interface (backward compatible)

Step 1.3: Split Provider Clients into Separate Files

Files:

clients/openai_client.py - Move OpenAIProvider from llm_provider.py
clients/anthropic_client.py - Move AnthropicProvider from llm_provider.py
clients/google_client.py - NEW - Google Gemini client
clients/azure_client.py - NEW - Azure OpenAI client
clients/huggingface_client.py - NEW - HuggingFace Inference API client

Step 1.4: Refactor Main LLM Service

File: orchestrator/services/llm_provider/manager.py

Move LLMManager from llm_provider.py
Add lazy loading from system settings
Add per-service configuration support (orchestrator, codegraph, etc.)
Keep backward compatibility with existing API

Phase 2: Integrate with System Settings

Step 2.1: Update System Settings Model

File: orchestrator/models/system_settings.py

Ensure orchestrator_llm category has provider and model settings
Ensure codegraph category has provider and model settings

Step 2.2: Update OrchestratorLLMSettingsTab

File: frontend/components/settings/OrchestratorLLMSettingsTab.tsx

Provider dropdown: OpenAI, Anthropic, Google, Azure, HuggingFace
Model dropdown: Dynamic based on provider selection
Link to credential management (show warning if credential not configured)
Save settings to orchestrator_llm.provider and orchestrator_llm.model

Step 2.3: Update CodeGraphSettingsTab

File: frontend/components/settings/CodeGraphSettingsTab.tsx

Add LLM Provider Configuration section (same as OrchestratorLLM)
Provider dropdown: OpenAI, Anthropic, Google, Azure, HuggingFace
Model dropdown: Dynamic based on provider
Save to codegraph.provider and codegraph.model

Phase 3: Migrate Existing Services

Step 3.1: Update LLM Provider Service

File: orchestrator/services/llm_provider.py

Option A: Replace with UnifiedLLMService adapter (maintain backward compatibility)
Option B: Refactor to use UnifiedLLMService internally
Keep LLMManager interface for backward compatibility

Step 3.2: Update CodeGraph Service

File: orchestrator/services/codegraph_service.py

Replace direct OpenAI client with UnifiedLLMService
Use codegraph.provider setting for provider selection
Lazy load on first use

Step 3.3: Update CodeGraph API

File: orchestrator/api/codegraph.py

Remove get_openai_key() function
Update get_codegraph_service() to use UnifiedLLMService

Step 3.4: Update Chatbot LLM API

File: orchestrator/api/chatbot_llm.py

Replace direct Anthropic client with UnifiedLLMService
Use orchestrator_llm.provider setting (or chatbot-specific setting)

Phase 4: Update Other Services

Step 4.1: Document Processing

File: orchestrator/api/document_processing.py

Migrate to use UnifiedLLMService with Orchestrator settings

Step 4.2: Embeddings Service

File: orchestrator/context_engineering/embeddings.py

Migrate to use UnifiedLLMService for text embeddings
Support provider selection via settings

Step 4.3: RAG Service

File: orchestrator/services/rag_service.py

Migrate to use UnifiedLLMService

Phase 5: Add HuggingFace Support

Step 5.1: Create HuggingFace Client

File: orchestrator/services/unified_llm_service/clients/huggingface_client.py

Use huggingface_hub or requests library
Support HuggingFace Inference API: https://api-inference.huggingface.co/models/{model}
API token from credential: development_huggingface.api_token

Step 5.2: Update Credential Type

Status: ✅ Already exists in credential_types_seed.json as huggingface_api

Verify credential type is seeded
Ensure it has api_token field

Phase 6: Testing & Validation

Step 6.1: Unit Tests

Test each client implementation
Test credential resolution
Test lazy loading
Test error handling

Step 6.2: Integration Tests

Test OrchestratorLLM settings flow
Test CodeGraph settings flow
Test credential → service flow
Test multi-provider scenarios

Step 6.3: User Journey Tests

User creates OpenAI credential
User selects OpenAI in OrchestratorLLM Settings
User tests LLM functionality
User switches to Anthropic
User tests CodeGraph with different provider

File-by-File Migration Plan

Files Requiring Migration

High Priority (Core Services)

✅ services/llm_provider.py - Migrate to UnifiedLLMService
✅ services/codegraph_service.py - Use UnifiedLLMService
✅ api/codegraph.py - Remove direct API key usage
✅ api/chatbot_llm.py - Use UnifiedLLMService

Medium Priority (Document Services)

api/document_processing.py - Migrate LLM calls
api/documents.py - Migrate LLM calls
services/rag_service.py - Migrate LLM calls
context_engineering/embeddings.py - Migrate LLM calls

Low Priority (Other Services)

services/nl_to_sql_service.py - Check LLM usage
services/database_knowledge_service.py - Check LLM usage
core/llm/orchestrator_llm.py - Check integration
services/agent_factory.py - Check LLM usage

Migration Strategy per File

Pattern 1: Direct API Key Usage

# BEFORE
api_key = os.getenv("OPENAI_API_KEY")
client = OpenAI(api_key=api_key)

# AFTER
from services.unified_llm_service import get_unified_llm_service
llm_service = get_unified_llm_service(service_name="orchestrator")
response = llm_service.inference(prompt, model_id)

Pattern 2: Credential Resolver (Already Migrated)

# BEFORE (Already using resolver)
resolver = get_credential_resolver()
api_key = resolver.get_openai_key()

# AFTER (Use UnifiedLLMService)
from services.unified_llm_service import get_unified_llm_service
llm_service = get_unified_llm_service(service_name="orchestrator")

Pattern 3: LLMManager Usage

# BEFORE
llm_manager = LLMManager()
response = await llm_manager.generate_response(messages)

# AFTER (Maintain compatibility)
llm_manager = LLMManager()  # Now uses UnifiedLLMService internally
response = await llm_manager.generate_response(messages)

Configuration Schema

System Settings Required

Orchestrator LLM Settings

{
  "category": "orchestrator_llm",
  "settings": [
    {
      "key": "provider",
      "default_value": "openai",
      "validation_rules": {
        "options": ["openai", "anthropic", "google", "azure", "huggingface"]
      }
    },
    {
      "key": "model",
      "default_value": "gpt-4",
      "validation_rules": {
        "depends_on": {"provider": "..."}
      }
    }
  ]
}

CodeGraph LLM Settings

{
  "category": "codegraph",
  "settings": [
    {
      "key": "provider",
      "default_value": "openai"
    },
    {
      "key": "model",
      "default_value": "gpt-4"
    }
  ]
}

Credential Naming Convention

Development: development_openai, development_anthropic, etc.
Production: production_openai, production_anthropic, etc.
Pattern: {environment}_{provider_type}

Error Handling Strategy

Lazy Loading Errors

# Service initialization doesn't fail
llm_service = get_unified_llm_service(service_name="orchestrator")
# Returns service instance even if credentials not configured

# First use fails gracefully
try:
    response = llm_service.inference("Hello", "gpt-4")
except CredentialNotFoundError as e:
    logger.warning(f"LLM service not configured: {e}")
    # Return user-friendly error

Missing Credentials

Show warning in UI: "OpenAI credential not configured. Add it in Settings > Credentials"
Graceful degradation: Service returns error, doesn't crash application

Missing Settings

Default to openai provider, gpt-4 model
Log warning: "Using default LLM configuration"

Testing Checklist

Unit Tests

Each client implementation (OpenAI, Anthropic, Google, Azure, HuggingFace)
UnifiedLLMService initialization
Credential resolution
Settings retrieval
Error handling

Integration Tests

OrchestratorLLM settings → service flow
CodeGraph settings → service flow
Credential creation → service usage
Provider switching
Multi-service configuration

User Journey Tests

Create credential → Select provider → Use service
Switch provider → Verify service uses new provider
Missing credential → See warning → Add credential → Service works
CodeGraph with different provider than Orchestrator

Deployment Checklist

Pre-Deployment

All unit tests passing
Integration tests passing
Credential types seeded (verify HuggingFace exists)
System settings seeded (orchestrator_llm, codegraph)
Frontend components updated

Deployment Steps

Deploy new unified_llm_service module
Migrate existing services (one at a time)
Test each service after migration
Update frontend settings tabs
Verify credential flow
Test end-to-end user journey

Post-Deployment

Monitor error logs for credential resolution failures
Verify lazy loading works (no startup failures)
Check service performance (response times)
User feedback on provider selection UI

Success Criteria

Functional

✅ All LLM services use UnifiedLLMService
✅ Orchestrator and CodeGraph can use different providers
✅ Credentials stored securely (PRD-18)
✅ Settings UI allows provider selection
✅ Lazy loading doesn't fail on startup
✅ HuggingFace support added

Non-Functional

✅ No direct API key usage in codebase
✅ Graceful error handling
✅ Backward compatibility maintained (LLMManager still works)
✅ Clear user journey (credential → settings → usage)

Timeline Estimate

Phase 1 (Unified Service): 2-3 days
Phase 2 (Settings Integration): 1-2 days
Phase 3 (Core Services Migration): 2-3 days
Phase 4 (Other Services): 1-2 days
Phase 5 (HuggingFace): 1 day
Phase 6 (Testing): 2-3 days

Total: ~10-14 days

Risks & Mitigations

Risk 1: Breaking Changes

Mitigation: Maintain backward compatibility, gradual migration

Risk 2: Credential Resolution Failures

Mitigation: Fallback to environment variables during transition

Risk 3: Performance Impact

Mitigation: Lazy loading, caching, async operations

Risk 4: Missing Provider Support

Mitigation: Start with core providers (OpenAI, Anthropic), add others incrementally

Next Steps

✅ Review and approve plan
Create UnifiedLLMService structure
Implement provider clients (including HuggingFace)
Integrate with system settings
Migrate core services
Update frontend components
Test and validate
Deploy incrementally

Document Version: 1.0 Last Updated: 2025-01-25 Status: Ready for Implementation

PreviousPRD 22: Anthropic-Style Dynamic Skill Loading via Git-Backed Repositories NextPRD-26: System Settings Audit - Current Status

Last updated 23 days ago

Good afternoon

hashtagExecutive Summary

hashtagCurrent State Analysis

hashtagFiles Using OPENAI_API_KEY (126 files found)

hashtagFiles Using ANTHROPIC_API_KEY (37 files found)

hashtagCurrent LLM Architecture

hashtagOrchestrator LLM Provider (services/llm_provider.py)

hashtagLLM Service (Reference Implementation)

hashtagCredential System (PRD-18)

hashtagSystem Settings

hashtagTarget Architecture

hashtagRefactored LLM Provider Structure

hashtagKey Features

hashtagIntegration Points

hashtag1. System Settings Integration

hashtag2. Credential Resolution Flow

hashtag3. Service-Specific Configuration

hashtagImplementation Plan

hashtagPhase 1: Refactor Existing LLM Provider Service

hashtagStep 1.1: Create Clients Directory Structure

hashtagStep 1.2: Move Existing Base Classes

hashtagStep 1.3: Split Provider Clients into Separate Files

hashtagStep 1.4: Refactor Main LLM Service

hashtagPhase 2: Integrate with System Settings

hashtagStep 2.1: Update System Settings Model

hashtagStep 2.2: Update OrchestratorLLMSettingsTab

hashtagStep 2.3: Update CodeGraphSettingsTab

hashtagPhase 3: Migrate Existing Services

hashtagStep 3.1: Update LLM Provider Service

hashtagStep 3.2: Update CodeGraph Service

hashtagStep 3.3: Update CodeGraph API

hashtagStep 3.4: Update Chatbot LLM API

hashtagPhase 4: Update Other Services

hashtagStep 4.1: Document Processing

hashtagStep 4.2: Embeddings Service

hashtagStep 4.3: RAG Service

hashtagPhase 5: Add HuggingFace Support

hashtagStep 5.1: Create HuggingFace Client

hashtagStep 5.2: Update Credential Type

hashtagPhase 6: Testing & Validation

hashtagStep 6.1: Unit Tests

hashtagStep 6.2: Integration Tests

hashtagStep 6.3: User Journey Tests

hashtagFile-by-File Migration Plan

hashtagFiles Requiring Migration

hashtagHigh Priority (Core Services)

hashtagMedium Priority (Document Services)

hashtagLow Priority (Other Services)

hashtagMigration Strategy per File

hashtagPattern 1: Direct API Key Usage

hashtagPattern 2: Credential Resolver (Already Migrated)

hashtagPattern 3: LLMManager Usage

hashtagConfiguration Schema

hashtagSystem Settings Required

hashtagOrchestrator LLM Settings

hashtagCodeGraph LLM Settings

hashtagCredential Naming Convention

hashtagError Handling Strategy

hashtagLazy Loading Errors

hashtagMissing Credentials

hashtagMissing Settings

hashtagTesting Checklist

hashtagUnit Tests

hashtagIntegration Tests

hashtagUser Journey Tests

hashtagDeployment Checklist

hashtagPre-Deployment

hashtagDeployment Steps

hashtagPost-Deployment

hashtagSuccess Criteria

hashtagFunctional

hashtagNon-Functional

hashtagTimeline Estimate

hashtagRisks & Mitigations

hashtagRisk 1: Breaking Changes

hashtagRisk 2: Credential Resolution Failures

hashtagRisk 3: Performance Impact

hashtagRisk 4: Missing Provider Support

hashtagNext Steps