Agents

Relevant source files

The following files were used as context for generating this wiki page:

Purpose and Scope

This document covers the Agent Management System in Automatos AI, including agent creation, configuration, lifecycle management, and capability assignment. Agents are the core AI entities that execute tasks, coordinate workflows, and interact with external tools.

For information about agent execution and orchestration, see Universal Router. For agent-to-agent coordination patterns, see Agent Coordination. For workflow recipe execution using agents, see Workflows & Recipes.

Agent Entity Model

Agents are represented by the Agent SQLAlchemy model with the following core structure:

Field

Type

Description

id

Integer

Primary key

name

String

Unique agent name within workspace

description

Text

Agent purpose and capabilities

agent_type

String

Database type (e.g., code_architect, custom)

marketplace_category

String

UI category name (e.g., DevOps, Custom)

status

String

active, inactive, maintenance

configuration

JSONB

Priority, concurrency, resource limits

tags

ARRAY

Searchable keywords

model_config

JSONB

LLM provider, model, parameters

model_usage_stats

JSONB

Token counts, costs, request metrics

performance_metrics

JSONB

Success rate, tasks completed

workspace_id

UUID

Multi-tenant isolation

created_by

String

Creator identifier

created_at

Timestamp

Creation time

updated_at

Timestamp

Last modification time

Agent Data Model with Relationships

Diagram: Agent Database Schema with SQLAlchemy Models

Sources:

Agent Types and Categories

Agents use a dual-type system for flexibility:

Database Types (`agent_type`)

Backend database values stored in the agent_type column:

code_architect - Software architecture and design
security_expert - Security analysis and audits
performance_optimizer - Performance tuning
data_analyst - Data processing and insights
infrastructure_manager - Deployment and infrastructure
custom - User-defined agents
system - System-level operations
specialized - Domain-specific expertise

UI Categories (`marketplace_category`)

User-facing categories displayed in the frontend:

Personal Assistant
Customer Support
DevOps
Social Media
Accounting
E-commerce
Content Creation
HR
Data Analysis
Custom

Mapping Logic: The frontend uses CATEGORY_TO_DB_MAP and DB_TO_CATEGORY_MAP to translate between UI categories and database types. This allows specialized database types (e.g., security_expert) to preserve their identity while displaying as a generic UI category (e.g., Custom).

Sources:

Agent Creation

Diagram: Agent Creation Flow with API Endpoints

Sources:

Agent Creation API

Endpoint: POST /api/agents

Request Payload:

{
  "name": "Research Assistant",
  "agent_type": "custom",
  "marketplace_category": "Personal Assistant",
  "description": "Helps with research and documentation",
  "tool_ids": [101, 205],
  "tags": ["research", "writing", "pdf"],
  "configuration": {
    "specializations": ["research", "summarization"]
  }
}

Response:

{
  "id": 42,
  "name": "Research Assistant",
  "agent_type": "custom",
  "status": "active",
  "tools": [...],
  "plugins": [],
  "created_at": "2025-01-15T10:30:00Z"
}

Backend Flow:

Name Uniqueness Check - orchestrator/api/agents.py:369-372
Tag Normalization - orchestrator/api/agents.py:374 uses _normalize_tags()
Agent Creation - orchestrator/api/agents.py:377-388
Skill Assignment (if provided) - orchestrator/api/agents.py:392-401
Tool Assignment - orchestrator/api/agents.py:408-421 via AgentAppAssignment

Sources:

Agent Configuration

The AgentConfigurationModal provides a comprehensive configuration interface with seven tabs:

Configuration Tabs Structure

Diagram: AgentConfigurationModal Component Hierarchy

Sources:

General Configuration

Basic agent properties and operational settings:

Field

Type

Description

priority_level

Enum

low, medium, high, critical

max_concurrent_tasks

Integer

Max parallel task execution (1-10)

auto_start

Boolean

Start agent automatically on system boot

retry_attempts

Integer

Failed task retry count (0-5)

timeout_seconds

Integer

Task timeout in seconds (60-3600)

Resource Limits:

{
  "resource_limits": {
    "memory_mb": 1024,
    "cpu_percent": 50,
    "network_bandwidth": 100
  }
}

Environment & Logging:

environment: development, staging, production
logging_level: debug, info, warning, error
performance_monitoring: Boolean

Sources:

Model Configuration (PRD-15)

LLM provider and parameter settings stored in Agent.model_config JSONB field:

{
  "provider": "openai",
  "model_id": "gpt-4",
  "temperature": 0.7,
  "max_tokens": 2000,
  "top_p": 1.0,
  "frequency_penalty": 0.0,
  "presence_penalty": 0.0,
  "fallback_model_id": "gpt-3.5-turbo"
}

ModelSelector Component:

The ModelSelector component filters available models by provider and displays model metadata (cost, context window, capabilities).

Update Endpoint: PUT /api/models/agents/{agent_id}/config

Sources:

Heartbeat Configuration (PRD-55)

Autonomous agent check-ins for proactive monitoring:

{
  "enabled": true,
  "interval_minutes": 60,
  "inherit_active_hours": true,
  "active_hours_start": "08:00",
  "active_hours_end": "20:00",
  "prompt": "Check for critical system alerts and pending tasks",
  "auto_act": false,
  "report_to": "orchestrator"
}

Endpoints:

GET /api/heartbeat/agents/{agent_id}/config - Fetch config
PUT /api/heartbeat/agents/{agent_id}/config - Update config
POST /api/heartbeat/agents/{agent_id}/run - Trigger manual heartbeat
GET /api/heartbeat/agents/{agent_id}/last - Last result

Sources:

Agent Personas (US-021)

Personas define agent identity, voice, and behavior through system prompts.

Persona Modes

Persona API

Fetch Agent Persona: GET /api/agents/{agent_id}/persona

Response:

{
  "persona_id": "uuid-123",
  "persona_name": "Professional Analyst",
  "system_prompt": "You are a professional data analyst...",
  "use_custom_persona": false,
  "suggested_temperature": 0.5
}

Update Persona: PUT /api/agents/{agent_id}/persona

Request (Predefined):

{
  "persona_id": "uuid-123",
  "use_custom": false
}

Request (Custom):

{
  "use_custom": true,
  "custom_prompt": "You are a friendly assistant who explains complex topics in simple terms..."
}

Persona Library

Personas are stored in the system_prompts table (reusing the prompt management infrastructure) and filtered by category to match agent types.

Category Filtering: When creating a DevOps agent, only personas with category = 'devops' are shown. Custom agents see all personas.

Pre-fill Behavior: Switching from "Predefined" to "Custom" mode pre-fills the textarea with the selected persona's system prompt for easy editing.

Sources:

Capability Assignment

Agents gain functionality through three mechanisms: Skills, Plugins, and Tools.

Assignment Architecture

Skills Assignment

Add Skill: POST /api/agents/{agent_id}/skills

Request:

{
  "skill_ids": [10, 25, 42]
}

Skills are stored in a many-to-many relationship via the agent_skills join table. At runtime, the SkillLoader fetches skill content progressively (metadata → core → resources) to optimize token usage.

Sources:

Plugins Assignment

Fetch Workspace Plugins: GET /api/workspaces/{workspace_id}/plugins

Fetch Agent Plugins: GET /api/agents/{agent_id}/plugins

Update Assignment: PUT /api/agents/{agent_id}/plugins

Request:

{
  "plugin_ids": ["plugin-uuid-1", "plugin-uuid-2"]
}

Three-Tier Enablement:

Global Approval - Admin approves plugin in marketplace (marketplace_plugins.status = 'published')
Workspace Enable - Workspace admin enables plugin (workspace_plugins entry)
Agent Assignment - Agent assigned specific plugins (agent_assigned_plugins entry)

Only workspace-enabled plugins appear in the agent configuration UI. Plugin content is cached in Redis with 1-hour TTL.

Token Estimation: The UI displays total token estimate for assigned plugins:

const assignedTokenEstimate = workspacePlugins
  .filter(p => assignedPluginIds.has(p.plugin_id))
  .reduce((sum, p) => sum + (p.token_estimate || 0), 0)

Sources:

Tools Assignment

Tools are Composio integrations (Slack, GitHub, Gmail, etc.). Only connected tools are assignable to agents.

Tool Resolution Logic:

Stable Tool ID: Frontend uses stableId() hash function to generate negative IDs for apps without database cache entries. Backend matches these hashes via _stable_tool_id().

Update Tools: PUT /api/agents/{agent_id}

Request:

{
  "tool_ids": [101, 205]
}

Backend converts IDs to app names, then inserts/updates AgentAppAssignment rows:

INSERT INTO agent_app_assignments (agent_id, app_name, is_active, priority, config)
VALUES (42, 'SLACK', true, 0, '{}')

Sources:

Agent Lifecycle & Status

Status States

Status

Description

Icon

Color

active

Agent running, accepting tasks

CheckCircle

Success green

inactive

Agent stopped, no task execution

Clock

Muted gray

maintenance

Agent undergoing updates/repairs

Settings

Warning yellow

paused

Agent paused, tasks queued

Pause

Primary blue

Status Control Flow

Impact Analysis

When changing agent status, the system analyzes:

Active Workflows - Workflows currently using the agent
Queued Tasks - Tasks waiting for the agent
Dependent Agents - Other agents that depend on this agent
System Impact - Performance degradation estimate

Shutdown Options:

Immediate - Stop agent immediately (may interrupt tasks)
Graceful - Finish current tasks before stopping
Scheduled - Schedule shutdown after workflow completion

Required Confirmations:

Acknowledge active workflows will be affected
Confirm backup/recovery plan in place
Confirm dependent systems have been notified

Sources:

Agent Runtime Architecture

Agents are instantiated through the AgentFactory class, which creates AgentRuntime instances with complete execution context.

Agent Factory Initialization

The AgentFactory class (orchestrator/modules/agents/factory/agent_factory.py) manages agent lifecycle:

class AgentFactory:
    def __init__(self, db_session: Session = None):
        self.db_session = db_session
        self.active_agents: Dict[int, AgentRuntime] = {}

Key Methods:

activate_agent(agent_id, use_system_llm=False) - Creates runtime instance
_build_tools_prompt(required_tools) - Generates tool usage instructions
_build_skill_tool_schemas(agent_skills) - Extracts skill tools
_ensure_llm_provider() - Lazy LLM initialization

AgentRuntime Dataclass

The AgentRuntime dataclass represents an active agent instance:

@dataclass
class AgentRuntime:
    agent_id: int
    metadata: AgentMetadata
    llm_manager: LLMManager
    lifecycle_state: AgentLifecycle
    execution_count: int = 0
    total_tokens_used: int = 0
    performance_metrics: Dict[str, Any]
    memory: List[Dict[str, Any]]
    tools: List[Dict[str, Any]]
    tool_executor: Any = None  # UnifiedToolExecutor
    workspace_id: Optional[Any] = None

Runtime Assembly Process

Diagram: Agent Activation Flow with Code Entities

Sources:

Tool Schema Generation

Agents receive tool schemas through two mechanisms:

1. Skill-Based Tools (_build_skill_tool_schemas)

Extracts executable tools from assigned skills' tools_schema JSONB field:

def _build_skill_tool_schemas(agent_skills: List) -> List[Dict]:
    tools = []
    for skill in agent_skills:
        if not hasattr(skill, 'tools_schema') or not skill.tools_schema:
            continue
        skill_tools = skill.tools_schema.get('tools', [])
        for tool_def in skill_tools:
            tools.append({
                "type": "function",
                "function": {
                    "name": tool_def.get('name'),
                    "description": tool_def.get('description'),
                    "parameters": tool_def.get('parameters')
                }
            })
    return tools

2. Composio Integration Tools

Via ComposioToolService.get_tools_for_step():

Strategy A (Primary): SDK semantic search for relevant actions
Strategy B (Fallback): Hint-based mega-tool with composio_execute function

Sources:

System Prompt Assembly in Chat Service

The StreamingChatService.stream_response_with_agent() method assembles the final context:

Diagram: System Prompt Construction Pipeline

Prompt Assembly Sequence:

Base System Prompt - From SmartChatIntegration.prepare()
Agent Identity - Name, description, agent_type
Persona - Custom or predefined system prompt
Skill Summaries - Brief descriptions from agent.agent_skills
Plugin Content - From PluginContentCache (if assigned)
Tool Definitions - OpenAI function calling schemas
Execution Policy - Multi-step task handling instructions
Composio Scope - Available app names for direct action calls
Memory Context - Retrieved from Mem0

Special Case: CTO Agent Prompt Override (PRD-67)

For agents with slug='auto-cto':

if _is_cto_agent:
    from consumers.chatbot.cto_prompt_builder import CtoPromptBuilder
    _cto_prompt = CtoPromptBuilder.build(
        soul_document=_soul,
        architecture_context=_arch_ctx,
        memories=_cto_memories,
        tool_names=[...],
        platform_state=_platform_state,
    )
    llm_messages[0]["content"] = _cto_prompt

Sources:

Tool Execution Pipeline

Diagram: Tool Call Execution Flow with Code Classes

ToolExecutionTracker Deduplication:

The ToolExecutionTracker class prevents infinite loops through:

Exact Matching - Hash of (tool_name, args_hash)
Semantic Similarity - For search tools, compares query strings with 75% threshold
Retry Limits - Per-tool execution limits (e.g., composio_execute: 2, read_file: 3)

SEARCH_TOOLS = {
    'search_knowledge', 'semantic_search', 'search_codebase',
    'search_tables', 'search_images', 'search_formulas'
}

TOOL_RETRY_LIMITS = {
    'composio_execute': 2,
    'search_knowledge': 2,
    'read_file': 3,
    'default': 3
}

Sources:

Agent Analytics & Usage Tracking (PRD-54)

Every agent execution generates usage records for cost tracking and optimization.

Usage Tracking Flow

Usage Record Schema

CREATE TABLE llm_usage (
  id SERIAL PRIMARY KEY,
  workspace_id UUID NOT NULL,
  agent_id INTEGER REFERENCES agents(id),
  execution_id VARCHAR,
  model_id VARCHAR NOT NULL,
  provider VARCHAR NOT NULL,
  tier VARCHAR, -- 'direct', 'router', 'fallback'
  request_type VARCHAR DEFAULT 'chat',
  input_tokens INTEGER NOT NULL,
  output_tokens INTEGER NOT NULL,
  total_tokens INTEGER NOT NULL,
  input_cost NUMERIC(12,8) NOT NULL,
  output_cost NUMERIC(12,8) NOT NULL,
  total_cost NUMERIC(12,8) NOT NULL,
  latency_ms INTEGER,
  status VARCHAR DEFAULT 'success',
  is_byok BOOLEAN DEFAULT false,
  error_message TEXT,
  created_at TIMESTAMP DEFAULT NOW()
);

Agent Usage Statistics

Aggregated statistics stored in Agent.model_usage_stats JSONB:

{
  "total_tokens": 145820,
  "total_cost": 3.42,
  "total_requests": 238,
  "avg_tokens_per_request": 612,
  "last_used_at": "2025-01-15T14:32:00Z",
  "by_model": {
    "gpt-4": {
      "requests": 180,
      "tokens": 110400,
      "cost": 2.76
    },
    "gpt-3.5-turbo": {
      "requests": 58,
      "tokens": 35420,
      "cost": 0.66
    }
  }
}

Zero-Impact Design: The UsageTracker uses a separate database session (SessionLocal()) to ensure tracking failures never break agent execution.

Sources:

Agent API Reference

Core Endpoints

Method

Endpoint

Description

GET

/api/agents

List agents (filtered by workspace)

GET

/api/agents/{agent_id}

Get single agent with relationships

POST

/api/agents

Create new agent

PUT

/api/agents/{agent_id}

Update agent configuration

DELETE

/api/agents/{agent_id}

Delete agent and relationships

GET

/api/agents/types

List available agent types

GET

/api/agents/stats

Workspace-wide agent statistics

Status & Execution

Method

Endpoint

Description

GET

/api/agents/{agent_id}/status

Current status and workload

POST

/api/agents/{agent_id}/execute

Trigger agent execution

POST

/api/agents/bulk

Bulk agent creation

Relationships

Method

Endpoint

Description

GET

/api/agents/{agent_id}/skills

List agent skills

POST

/api/agents/{agent_id}/skills

Add skills to agent

GET

/api/agents/{agent_id}/plugins

List assigned plugins

PUT

/api/agents/{agent_id}/plugins

Update plugin assignments

GET

/api/agents/{agent_id}/persona

Get agent persona

PUT

/api/agents/{agent_id}/persona

Update persona

Query Parameters

List Agents (GET /api/agents):

Parameter

Type

Description

skip

Integer

Pagination offset (default: 0)

limit

Integer

Page size (default: 100, max: 1000)

status

Enum

Filter by status (active, inactive, maintenance)

agent_type

Enum

Filter by type (code_architect, custom, etc.)

priority_level

Enum

Filter by priority (low, medium, high, critical)

search

String

Search in name or description

Response Format:

All endpoints return workspace-filtered results. The workspace_id is extracted from the JWT token or X-Workspace-ID header.

Sources:

orchestrator/api/agents.py:32-744

Frontend Components

Component Hierarchy

React Query Hooks

All API interactions use React Query for caching and state management:

// Query hooks (auto-caching)
useAgents() // List all agents
useAgent(agentId) // Single agent (polls every 10s)
useAgentStats() // Workspace stats (polls every 30s)
useAgentTypes() // Agent types (cached 5 min)
useAgentConfig(agentId) // Configuration
useAgentSkills(agentId) // Skills (polls every 30s)

// Mutation hooks (auto-invalidation)
useCreateAgent() // Create + invalidate agents list
useUpdateAgent() // Update + invalidate agent
useDeleteAgent() // Delete + remove from cache
useUpdateAgentConfig() // Update config
useAddSkillToAgent() // Add skill
useRemoveSkillFromAgent() // Remove skill
useStartAgent() // Start agent
useStopAgent() // Stop agent

Cache Keys: All queries use workspace-scoped cache keys to prevent cross-workspace data leakage:

agentQueryKeys.agents // ['agents']
agentQueryKeys.agent(id) // ['agents', '42']
agentQueryKeys.agentConfig(id) // ['agents', '42', 'configuration']

Invalidation Strategy: Mutations automatically invalidate related queries:

// Creating an agent invalidates:
queryClient.invalidateQueries({ queryKey: ['agents'] })
queryClient.invalidateQueries({ queryKey: ['agents', 'stats'] })

Sources:

Multi-Tenancy & Workspace Isolation

All agent operations are workspace-scoped via the workspace_id foreign key.

Workspace Context Flow

Workspace Resolution:

JWT Claims - workspace_id extracted from Clerk JWT
Header Override - X-Workspace-ID header (validated against user permissions)
Admin Override - Special __all__ sentinel for platform-wide queries

Query Filtering: All agent queries include workspace filter:

query = db.query(Agent).filter(Agent.workspace_id == ctx.workspace_id)

Admin Access: Admin users with X-Workspace-ID: __all__ header can query across all workspaces (e.g., for platform analytics).

Sources:

Previousagents NextAgent API Reference

Last updated 23 days ago

Good afternoon

hashtagPurpose and Scope

hashtagAgent Entity Model

hashtagAgent Data Model with Relationships

hashtagAgent Types and Categories

hashtagDatabase Types (agent_type)

hashtagUI Categories (marketplace_category)

hashtagAgent Creation

hashtagCreate Agent Modal Workflow

hashtagAgent Creation API

hashtagAgent Configuration

hashtagConfiguration Tabs Structure

hashtagGeneral Configuration

hashtagModel Configuration (PRD-15)

hashtagHeartbeat Configuration (PRD-55)

hashtagAgent Personas (US-021)

hashtagPersona Modes

hashtagPersona API

hashtagPersona Library

hashtagCapability Assignment

hashtagAssignment Architecture

hashtagSkills Assignment

hashtagPlugins Assignment

hashtagTools Assignment

hashtagAgent Lifecycle & Status

hashtagStatus States

hashtagStatus Control Flow

hashtagImpact Analysis

hashtagAgent Runtime Architecture

hashtagAgent Factory Initialization

hashtagAgentRuntime Dataclass

hashtagRuntime Assembly Process

hashtagTool Schema Generation

hashtagSystem Prompt Assembly in Chat Service

hashtagTool Execution Pipeline

hashtagAgent Analytics & Usage Tracking (PRD-54)

hashtagUsage Tracking Flow

hashtagUsage Record Schema

hashtagAgent Usage Statistics

hashtagAgent API Reference

hashtagCore Endpoints

hashtagStatus & Execution

hashtagRelationships

hashtagQuery Parameters

hashtagFrontend Components

hashtagComponent Hierarchy

hashtagReact Query Hooks

hashtagMulti-Tenancy & Workspace Isolation

hashtagWorkspace Context Flow

Purpose and Scope

Agent Entity Model

Agent Data Model with Relationships

Agent Types and Categories

Database Types (`agent_type`)

UI Categories (`marketplace_category`)

Agent Creation

Create Agent Modal Workflow

Agent Creation API

Agent Configuration

Configuration Tabs Structure

General Configuration

Model Configuration (PRD-15)

Heartbeat Configuration (PRD-55)

Agent Personas (US-021)

Persona Modes

Persona API

Persona Library

Capability Assignment

Assignment Architecture

Skills Assignment

Plugins Assignment

Tools Assignment

Agent Lifecycle & Status

Status States

Status Control Flow

Impact Analysis

Agent Runtime Architecture

Agent Factory Initialization

AgentRuntime Dataclass

Runtime Assembly Process

Tool Schema Generation

System Prompt Assembly in Chat Service

Tool Execution Pipeline

Agent Analytics & Usage Tracking (PRD-54)

Usage Tracking Flow

Usage Record Schema

Agent Usage Statistics

Agent API Reference

Core Endpoints

Status & Execution

Relationships

Query Parameters

Frontend Components

Component Hierarchy

React Query Hooks

Multi-Tenancy & Workspace Isolation

Workspace Context Flow