🤖Agent Management

Complete guide to creating, configuring, and managing AI agents in Automatos AI

Master the art of creating and managing specialized AI agents

📖 Table of Contents

Overview

What is Agent Management?

The Agent Management page is your control center for creating, configuring, and monitoring AI agents. Think of agents as specialized AI employees - each with their own expertise, tools, and performance metrics.

Access: Navigate to Agents from the main sidebar

What Can You Do Here?

✅ Create specialized agents with custom models, skills, and tools
✅ Monitor agent performance with real-time metrics
✅ Configure agent behavior down to temperature and token limits
✅ Assign skills and tools for specialized capabilities
✅ Track coordination between multiple agents
✅ Analyze performance trends over time

Page Layout

The Agents page has 5 main tabs:

👥 Roster - View and manage all your agents
🧠 Skills - Browse and assign skills to agents
⚙️ Configuration - Fine-tune agent settings
🤝 Coordination - Multi-agent collaboration
📊 Performance - Metrics and analytics

Quick Start

Creating Your First Agent (3 Minutes)

Goal: Create a Code Review agent in under 3 minutes

Steps:

Click "Create Agent" button (top right)
Select "Code Architect" template
Name it: "Code Reviewer Pro"
Choose model: GPT-4 Turbo
Click "Create Agent"

✅ Done! Your agent is ready to review code.

See detailed walkthrough →

Agent Roster Tab

Overview

The Roster tab shows all your agents at a glance with key metrics.

Statistics Cards

At the top, you'll see 4 metric cards:

📊 Total Agents 💡 Tooltip: "Total number of agents you've created. Each agent is a specialized AI entity."

What it shows: Count of all agents (active + inactive)
Example: "12 agents"

⚡ Active Agents 💡 Tooltip: "Agents currently ready to execute tasks. Inactive agents need to be started."

What it shows: Agents with status = "active"
Percentage: Shows % of total agents that are active
Example: "8 agents (67% online)"

⚙️ Agent Types 💡 Tooltip: "Different agent specializations (Code Architect, Security Expert, etc.)"

What it shows: Unique agent type count
Example: "5 types"

📈 Avg Performance 💡 Tooltip: "Average success rate across all agents based on task completion quality"

What it shows: Performance score (0-100%)
Status: "Excellent" (>90%), "Needs optimization" (<90%)
Example: "94.2%"

Search and Filter

Search Bar 💡 Tooltip: "Search agents by name, type, or description"

Type agent name or keywords
Filters list in real-time
Case-insensitive

Status Filter 💡 Tooltip: "Filter agents by their current status"

All: Show everything
Active: Only running agents
Idle: Agents waiting for tasks
Learning: Agents currently updating knowledge

Type Filter 💡 Tooltip: "Filter by agent specialization"

All Types: No filter
Code Architect: Code review, system design
Security Expert: Security audits
Data Analyst: Data analysis
Custom: User-created types

Agent Cards

Each agent is displayed as a card showing:

Header:

Agent name (e.g., "CodeMaster Pro")
Status badge: Active (green), Idle (gray), Busy (yellow), Error (red)
Type: Agent specialization
Model: LLM being used (GPT-4, Claude, etc.)

Metrics Row:

💡 Tooltip: "Quick performance snapshot for this agent"

Tasks: Total tasks executed
Success: Success rate percentage
Quality: Average quality score
Cost: Total or average cost

Action Buttons:

View Details: Opens Agent Details Modal
Execute Task: Run a task immediately
Edit: Opens Agent Configuration Modal
⋮ Menu: More actions (delete, duplicate, export)

Creating an Agent

Click the "Create Agent" button to open the Create Agent Modal.

Skills Tab

Overview

The Skills tab manages the expertise and capabilities agents can have.

Sub-Tabs

The Skills tab has 3 sub-tabs:

1. All Skills

💡 Tooltip: "Complete library of available skills. Add these to agents to enhance their capabilities."

What it shows:

All skills available in the system
Skill categories (Code Analysis, Security, Data Processing, etc.)
How many agents have each skill
Skill descriptions

Search Skills:

Type skill name or category
Filter by category dropdown
Results update in real-time

Skill Cards Show:

Skill Name: E.g., "Code Analysis"
Category: E.g., "Development"
Description: What the skill does
Agent Count: How many agents have this skill
Add to Agent button: Quick assignment

2. Agent Skills

💡 Tooltip: "View and manage skills for a specific agent. Add or remove skills to customize agent expertise."

What it shows:

Skills assigned to the selected agent
Skill proficiency levels
When each skill was added
Quick remove option

How to Use:

Select an agent from the dropdown
View their current skills
Click "Add Skill" to assign new skills
Click "✕" to remove a skill

Skill Proficiency Indicators:

⭐⭐⭐⭐⭐ Expert: High success rate with this skill
⭐⭐⭐⭐ Advanced: Good performance
⭐⭐⭐ Intermediate: Developing capability
⭐⭐ Beginner: New skill, learning

3. Skill Categories

💡 Tooltip: "Skills organized by domain. Browse categories to find relevant capabilities."

Categories Include:

🔧 Development: Code analysis, system design, API design, refactoring
🛡️ Security: Security audit, penetration testing, compliance
📊 Data: Data processing, statistics, visualization, ML
⚡ Performance: Performance analysis, optimization, profiling
☁️ Cloud: Cloud architecture, Kubernetes, CI/CD, monitoring

Using Categories:

Click a category to expand
View all skills in that category
Click "Add to Agent" on any skill
Select target agent from dropdown

Creating Custom Skills

Click "Create Skill" to open the Create Skill Modal.

Configuration Tab

Overview

The Configuration tab lets you fine-tune agent behavior and settings.

💡 Tooltip: "Advanced agent configuration. Modify model settings, resource limits, and behavior parameters."

Agent Selector

Dropdown: Select which agent to configure

💡 Tooltip: "Choose an agent to view and modify its configuration"

Lists all agents by name
Shows agent type and status
Auto-selects first agent on tab load

Configuration Sections

Quick Stats Panel:

Shows current configuration at a glance:

Model: Which LLM (GPT-4, Claude, etc.)
Temperature: Creativity level (0.0-2.0)
Max Tokens: Output limit
Skills: Count of assigned skills
Tools: Count of connected tools

Edit Configuration Button:

Opens the Agent Configuration Modal with 5 detailed tabs.

Configuration Display

The main area shows read-only view of:

General Settings:

Name, description, agent type
Status (active/inactive)
Creation date
Last updated

Model Configuration:

Provider (OpenAI, Anthropic, HuggingFace)
Model ID (gpt-4-turbo-preview, claude-3-opus, etc.)
Temperature setting
Max tokens
Fallback model (if configured)

Skills Assigned:

List of all skills with badges
Skill categories color-coded
Add/remove skill buttons

Tools Assigned:

Connected MCP tools
Tool status (active/inactive)
Permission levels
Test connection buttons

Coordination Tab

Overview

The Coordination tab manages multi-agent collaboration and communication.

💡 Tooltip: "Set up agents to work together on complex tasks. Agents can share knowledge and coordinate actions."

Sub-Tabs

1. Agent Coordination

What it shows:

Active agent teams
Communication patterns between agents
Shared memory spaces
Coordination metrics

Features:

Agent Teams Panel 💡 Tooltip: "Groups of agents that work together. Create teams for recurring collaboration patterns."

View existing teams
Create new teams
Assign agents to teams
Set team leader (optional)

Communication Flow 💡 Tooltip: "Visualize how agents communicate during workflow execution"

Network graph showing agent connections
Message frequency indicators
Communication success rates
Bottleneck identification

Shared Context 💡 Tooltip: "Memory and knowledge shared between team members. Enables agents to build on each other's work."

View shared memory items
See knowledge transfer
Monitor context synchronization

2. Collaborative Reasoning

What it shows:

Multi-agent problem-solving sessions
Consensus-building results
Ensemble decision outcomes
Collaboration quality scores

Features:

Active Collaborations 💡 Tooltip: "Ongoing multi-agent problem-solving sessions"

Currently running collaborative sessions
Participating agents
Problem being solved
Progress indicators

Collaboration History 💡 Tooltip: "Past collaborative sessions with outcomes and lessons learned"

Historical collaboration data
Success rates by agent combination
Quality improvements from collaboration
Lessons learned

Performance Tab

Overview

Track and analyze agent performance with detailed metrics and trends.

💡 Tooltip: "Monitor agent efficiency, quality, and resource usage. Identify optimization opportunities."

Sub-Tabs

The Performance tab has 4 sub-tabs:

1. Resource Usage

What it shows:

Token consumption over time
Cost analysis
Memory utilization
API call frequency

Metrics Cards:

💰 Total Cost 💡 Tooltip: "Total spend on this agent across all executions. Based on model pricing."

Shows total dollars spent
Cost trend (increasing/decreasing)
Average cost per task

🎯 Token Usage 💡 Tooltip: "Tokens are units of text processed by AI models. Lower usage = better efficiency."

Total tokens consumed
Input vs output ratio
Token efficiency trend

⏱️ Execution Time 💡 Tooltip: "Average time to complete tasks. Helps identify performance bottlenecks."

Average task duration
Fastest/slowest tasks
Time trend

Charts:

Token Usage Over Time: Line chart showing daily token consumption
Cost Breakdown: Pie chart of costs by task type
Efficiency Trend: Line chart showing tokens per task over time

2. Task History

💡 Tooltip: "Complete record of all tasks this agent has executed"

Task List shows:

Each task row displays:

Task ID: Unique identifier
Description: What the task was
Status: ✅ Success, ❌ Failed, ⏸️ Paused
Quality Score: 0.0-1.0 rating
Duration: Time taken
Tokens: Tokens used
Cost: Estimated cost
Timestamp: When executed

Filtering:

Filter by status (all/success/failed)
Filter by date range
Search by task description
Sort by any column

Actions:

Click row to see full task details
Retry failed tasks
View execution logs
Download task results

3. Performance Trends

💡 Tooltip: "Visualize how agent performance changes over time. Identify improvements or degradation."

Charts Include:

Success Rate Trend Shows success percentage over time (last 7/30/90 days)

Green line = good trend (improving)
Red line = concerning trend (declining)
Target line at 85%

Quality Score Distribution Histogram showing distribution of quality scores

Most tasks should be >0.8
Few tasks should be <0.5
Normal distribution is healthy

Execution Time Trend How long tasks take over time

Downward trend = agent getting faster
Upward trend = may need optimization

Token Usage Patterns

Daily token consumption
Peak usage times
Efficiency improvements

Cost Analysis

Daily/weekly/monthly cost trends
Cost per task type
Budget vs actual

4. Activity Logs

💡 Tooltip: "Detailed log of all agent actions. Useful for debugging and auditing."

Log Entries Show:

Timestamp: Exact time
Action: What the agent did
Result: Success/failure
Details: Additional context
Duration: How long it took

Log Types:

Task Started: Agent began execution
Tool Called: Agent used an external tool
Memory Access: Agent retrieved/stored memory
Communication: Agent sent/received messages
Error: Something went wrong
Learning: Agent updated knowledge

Filtering:

Filter by log type
Filter by time range
Search log messages
Export logs to CSV

Creating Agents

Click "Create Agent" to open the 4-step wizard.

Step 1: Agent Type

💡 Tooltip: "Choose a template or start from scratch. Templates come pre-configured with recommended skills and settings."

Template Options:

🏗️ Code Architect

Best for: Code review, system design, refactoring
Pre-configured with: Code analysis, API design, system design skills
Recommended model: GPT-4 Turbo
Use cases: Pull request reviews, architecture planning

🛡️ Security Expert

Best for: Security audits, vulnerability scanning, compliance
Pre-configured with: Security audit, OWASP, threat modeling skills
Recommended model: Claude 3 Opus
Use cases: Security reviews, penetration testing

📊 Data Analyst

Best for: Data analysis, statistics, insights generation
Pre-configured with: Data processing, statistics, visualization skills
Recommended model: GPT-4
Use cases: Business intelligence, data exploration

⚡ Performance Optimizer

Best for: Performance analysis, optimization, scalability
Pre-configured with: Performance analysis, profiling, optimization skills
Recommended model: GPT-4 Turbo
Use cases: Performance reviews, bottleneck identification

🚀 Infrastructure Manager

Best for: Cloud infrastructure, DevOps, deployment
Pre-configured with: Cloud architecture, Kubernetes, CI/CD skills
Recommended model: Claude 3 Sonnet
Use cases: Infrastructure planning, deployment automation

⚙️ Custom Agent

Best for: Specialized use cases
Pre-configured with: Nothing (you configure everything)
Use cases: Domain-specific agents

Choosing the Right Template:

💡 Beginner Tip: Start with a template that matches your use case. You can customize everything later.

🔧 Advanced: Custom agents give you complete control but require more configuration.

Step 2: Configuration

Basic Information:

Agent Name (required) 💡 Tooltip: "Give your agent a descriptive name. Examples: 'Python Security Reviewer', 'API Documentation Expert'"

Enter a clear, descriptive name
Good: "Code Reviewer - Python Security"
Bad: "Agent 1" or "Test"

Description (optional) 💡 Tooltip: "Explain what this agent does. Helps team members understand its purpose."

Describe the agent's role
Mention specific use cases
Note any special configurations

Agent Type (pre-filled from Step 1) 💡 Tooltip: "Agent specialization. Determines default skills and recommended settings."

Auto-filled based on template selection
Can be changed if needed

Priority Level 💡 Tooltip: "When multiple agents need resources, higher priority agents execute first."

High: Critical agents, production tasks
Medium: Standard operations (default)
Low: Background tasks, experimental agents

Max Concurrent Tasks 💡 Tooltip: "Maximum tasks this agent can handle simultaneously. Higher = more parallel execution."

Range: 1-10 tasks
Recommended: 3-5 for most agents
Higher for simple tasks, lower for complex analyses

Auto-Start 💡 Tooltip: "Automatically activate this agent when platform starts. Recommended for critical agents."

✅ Enabled: Agent starts automatically
❌ Disabled: Manual start required (default)

Step 3: Model Selection

Choose the AI model that powers your agent.

💡 Tooltip: "Different models have different strengths. GPT-4 excels at code, Claude excels at analysis."

Provider Selection:

OpenAI

GPT-4 Turbo (best for code, large context)
GPT-4 (excellent quality, smaller context)
GPT-3.5 Turbo (fast, cost-effective)

Anthropic

Claude 3 Opus (excellent analysis, 200K context)
Claude 3 Sonnet (balanced performance)
Claude 3 Haiku (fast, economical)

Model Details Card:

When you select a model, you'll see:

📊 Model Information:

Display Name: Human-readable name
Model Family: Model series
Provider Badge: OpenAI/Anthropic/HuggingFace

📏 Specifications:

Context Window: How much text it can process (e.g., "128K tokens")
Max Output: Maximum response length (e.g., "4K tokens")

💰 Costs:

Input Cost: Price per 1,000 input tokens
Output Cost: Price per 1,000 output tokens
Example: "$0.01/1K input, $0.03/1K output"

⚡ Capabilities:

Reasoning: Excellent/Good/Fair
Coding: Excellent/Good/Fair
Analysis: Excellent/Good/Fair

✨ Features:

✅ Function Calling (can use tools)
✅ Streaming (real-time responses)
✅ Vision (can understand images) - future feature

🎯 Recommended For:

Code analysis
Complex reasoning
System design
etc.

Model Configuration:

Temperature Slider 💡 Tooltip: "Controls randomness. Lower (0.0-0.5) = focused/consistent. Higher (0.7-2.0) = creative/varied."

Range: 0.0 to 2.0
0.0-0.3: Deterministic, consistent (good for code)
0.5-0.7: Balanced (good for most tasks)
0.8-1.2: Creative (good for ideation)
1.5-2.0: Very creative (experimental)

Max Tokens Slider 💡 Tooltip: "Maximum length of agent responses. Higher = longer responses but more expensive."

Range: 100 to 4096 tokens
500-1000: Short responses
1500-2500: Medium responses (recommended)
3000-4096: Long, detailed responses

Fallback Model (optional) 💡 Tooltip: "Backup model used if primary model fails or hits rate limits. Ensures reliability."

Select a cheaper/faster fallback
Common: GPT-3.5 Turbo as fallback for GPT-4

Step 4: Skills & Tools

Skills Selection:

💡 Tooltip: "Choose expertise areas for this agent. Skills enhance prompts with domain knowledge."

How to Add Skills:

Browse available skills (grouped by category)
Click "+ Add" next to desired skills
Selected skills appear in "Assigned Skills" list
Remove by clicking "✕" on assigned skills

Recommended Skills (based on agent type):

Shows suggested skills for your template
Click "Add All Recommended" for quick setup

Tools Selection:

💡 Tooltip: "External integrations this agent can use. Connect GitHub, Slack, databases, and 400+ other tools."

Available Tools by Category:

💻 Developer Tools: GitHub, GitLab, Bitbucket
💬 Communication: Slack, Discord, Email
☁️ Cloud: AWS, Azure, GCP
🗄️ Databases: PostgreSQL, MySQL, MongoDB
🔧 Utilities: File operations, shell commands

How to Add Tools:

Browse tools by category
Check boxes for tools you want
Tools appear in "Assigned Tools" list
Configure tool permissions (read/write/execute)

⚠️ Important: Some tools require credentials to be configured first. See Credentials Guide.

Review & Create

Final Review Screen:

Shows summary of all configurations:

Agent name and type
Selected model
Temperature and token settings
Assigned skills (count)
Assigned tools (count)
Estimated monthly cost (if high usage)

Buttons:

← Back: Return to previous step to modify
Create Agent: Finalize creation
Save as Template: Save configuration for reuse

Creation Process:

After clicking "Create Agent":

Creating database record... ✓ (1-2 seconds)
Initializing LLM connection... ✓ (2-3 seconds)
Loading skills... ✓ (<1 second)
Connecting tools... ✓ (1-2 seconds)
Verifying capabilities... ✓ (1-2 seconds)
Agent ready! 🎉

Success:

Agent appears in Roster tab
Status is set to "Active"
Ready to execute tasks immediately

Agent Modals Reference

How to open: Click "View Details" on any agent card

💡 Tooltip: "Comprehensive overview of agent status, performance, and capabilities"

Tab 1: Overview

Displays:

Agent Information:

Full name and description
Agent type with icon
Current status (active/idle/busy)
Creation date
Last active timestamp
Model configuration

Current Activity:

Currently executing task (if any)
Task progress percentage
Estimated time remaining
Cancel task button (if applicable)

Quick Stats:

Total tasks executed
Success rate
Average quality score
Total cost to date

Assigned Resources:

Skills list with proficiency
Tools list with usage counts
Memory size (MB)

Tab 2: Performance

Real-time Performance Metrics:

Success Rate Chart 💡 Tooltip: "Percentage of tasks completed successfully. Target: >90%"

Last 7/30/90 days
Trend line showing improvement/decline
Benchmark comparison

Quality Score Chart 💡 Tooltip: "Average quality rating of agent outputs. Range: 0.0 (poor) to 1.0 (excellent)"

Distribution histogram
Average quality line
Quality trend over time

Execution Time Chart 💡 Tooltip: "How long tasks take. Lower is better for efficiency."

Average time per task type
Time trend (getting faster/slower)
Outlier detection

Performance Breakdown by Task Type:

Table showing:

Task type name
Count of executions
Success rate %
Avg quality score
Avg duration
Avg cost

Tab 3: Workload

💡 Tooltip: "Current and historical task load for this agent"

Current Workload:

Active Tasks: Currently executing (with progress)
Queued Tasks: Waiting for execution
Capacity: Used/Total (e.g., "2/5 slots")

Workload Chart:

Tasks over time (hourly/daily)
Peak usage hours
Idle time percentage

Task Queue:

List of pending tasks
Priority order
Estimated queue time
Cancel/reorder options

Tab 4: Skills

Skills Management:

Assigned Skills List:

All skills currently assigned
Proficiency level for each
Tasks completed using each skill
Success rate per skill

Add Skills Button:

Opens skill selector
Shows available skills not yet assigned
Add multiple skills at once

Skill Performance:

Chart showing which skills are used most
Quality scores broken down by skill
Recommendations for skill additions

Actions:

Remove underperforming skills
Add complementary skills
Adjust skill priorities

How to open: Click "Edit" on agent card, or "Edit Configuration" in Configuration tab

💡 Tooltip: "Deep configuration of all agent parameters. Advanced users can fine-tune every aspect."

Tab 1: General

Basic Settings:

Name 💡 Tooltip: "Agent display name. Use descriptive names for team clarity."

Description 💡 Tooltip: "Explain this agent's purpose and use cases"

Agent Type 💡 Tooltip: "Specialization category. Affects default skills and behavior."

Status 💡 Tooltip: "Active agents can execute tasks. Inactive agents are disabled."

Active / Inactive toggle
Reason for status (if inactive)

Metadata:

Tags 💡 Tooltip: "Organize agents with tags. Example: 'production', 'experimental', 'python'"

Add multiple tags
Filter/search by tags later
Suggested tags based on skills

Created By

Shows who created the agent
Creation timestamp
Read-only field

Last Modified

Last edit timestamp
Who made the edit
Change history link

Tab 2: Performance

Execution Settings:

Priority Level 💡 Tooltip: "Determines queue position. High priority agents execute before low priority."

High / Medium / Low
Affects resource allocation

Max Concurrent Tasks 💡 Tooltip: "How many tasks this agent can handle simultaneously"

Range: 1-10
Higher = more parallelism
Consider model rate limits

Timeout Settings 💡 Tooltip: "Maximum time allowed for task execution before auto-cancel"

Task timeout: 30s to 30 minutes
Default: 5 minutes
Adjust based on task complexity

Retry Settings:

Auto-Retry Failed Tasks 💡 Tooltip: "Automatically retry tasks that fail due to temporary errors"

Enable/disable toggle
Max retries: 1-5
Retry delay: Exponential backoff
Which errors to retry (timeout, rate limit, etc.)

Performance Optimization:

Caching 💡 Tooltip: "Cache similar task results to save time and cost. Recommended for repetitive tasks."

Enable result caching
Cache duration: 5min to 24 hours
Similarity threshold for cache hits

Tab 3: Resources

Resource Limits:

💡 Tooltip: "Set limits to control costs and prevent runaway executions"

Token Budget:

Daily Limit: Max tokens per day
Per Task Limit: Max tokens per single task
Warning Threshold: Alert when approaching limit

Cost Budget:

Daily Budget: Max spend per day (USD)
Monthly Budget: Max spend per month
Alert Email: Who to notify when limit reached

Memory Limits:

Working Memory: Max items in active memory (default: 7)
Short-term Memory: Retention period (default: 24 hours)
Long-term Memory: Max storage (MB)

🔧 Advanced: Rate Limiting

Configure API rate limits per model:

Requests per minute
Requests per hour
Requests per day
Concurrent request limit

Tab 4: Skills

Skills Configuration:

💡 Tooltip: "Add or remove skills. Each skill enhances the agent's system prompt with domain expertise."

Assigned Skills Panel:

Shows each skill with:

Skill name and category
Proficiency level (calculated from performance)
Usage count
Remove button

Add Skills Panel:

Browse available skills
Search by name or category
Multi-select for batch adding
Preview skill descriptions

Skill Ordering:

Drag to reorder skills
Higher priority skills emphasized in prompts
Affects agent specialization

Tab 5: Model

Model Configuration:

Provider & Model:

💡 Tooltip: "Select the LLM that powers this agent. See model comparison for guidance."

Provider dropdown (OpenAI, Anthropic, HuggingFace)
Model dropdown (filtered by provider)
Model details card (same as Step 3)

Advanced Model Settings:

🔧 Advanced

Temperature 💡 Tooltip: "Sampling temperature controls randomness in responses"

Slider: 0.0 to 2.0
Decimal precision: 0.1 increments
Live preview of behavior

Top P (Nucleus Sampling) 💡 Tooltip: "Alternative to temperature. Limits token selection to top probability mass."

Range: 0.0 to 1.0
Default: 1.0 (disabled)
Use OR temperature, not both

Frequency Penalty 💡 Tooltip: "Reduces repetition by penalizing frequently used tokens"

Range: -2.0 to 2.0
Default: 0.0
Positive values reduce repetition

Presence Penalty 💡 Tooltip: "Encourages exploring new topics by penalizing tokens already used"

Range: -2.0 to 2.0
Default: 0.0
Positive values increase topic diversity

Fallback Model:

Secondary model if primary fails
Auto-selected based on primary
Can customize

Model Verification:

Test model connection
Verify API key works
Check rate limits
Estimate response time

Completing Agent Creation

After configuring all steps:

Click "Create Agent" button
Watch progress indicators
Wait for verification (5-10 seconds)
Success message appears
Agent is now in Roster tab

How to open: Click agent status badge or use "⋮" menu → "Change Status"

💡 Tooltip: "Control agent lifecycle state. Start, pause, or stop agents as needed."

Status Options:

🟢 Activate:

Makes agent available for tasks
Agent joins execution pool
Can receive task assignments

🟡 Pause:

Temporarily disable
Completes current tasks
Refuses new tasks
Preserves all state

🔴 Deactivate:

Fully disable agent
Stops all current tasks
Cannot receive assignments
Good for maintenance

Status Transition Rules:

Active → Pause → Deactivate
Can't deactivate while tasks running
Paused agents can be activated directly

Reason Field 💡 Tooltip: "Note why you're changing status. Helps team understand agent lifecycle."

Optional text field
Example: "Upgrading to new model"
Logged in agent history

How to open: Click "⋮" menu → "Delete Agent"

⚠️ Warning: Deleting an agent is permanent and cannot be undone.

Safety Checks:

Before deletion allowed:

✅ Agent must be inactive (no running tasks)
✅ Type agent name to confirm
✅ Acknowledge data loss warning

What Gets Deleted:

❌ Agent configuration
❌ Performance history
❌ Task execution logs
❌ Skills assignments
❌ Tool connections
✅ Memories preserved (optional checkbox)

Alternatives to Deletion:

Instead of deleting, consider:

Deactivate: Keep agent but disable
Archive: Move to archived agents (future feature)
Export: Save configuration before deleting

How to open: Click "Create Skill" button in Skills tab

💡 Tooltip: "Define custom skills for domain-specific expertise"

Fields:

Skill Name (required) 💡 Tooltip: "Descriptive name for the skill. Example: 'Python Security Audit'"

Clear, specific names
Include technology if relevant
Use title case

Category (required) 💡 Tooltip: "Organize skills by domain for easy browsing"

Select from: Development, Security, Data, Performance, Cloud, Custom
Affects skill icon and color

Description (required) 💡 Tooltip: "Explain what this skill does. Used to enhance agent prompts."

What expertise does this provide?
When should agents use it?
Example use cases

Prompt Enhancement (advanced) 💡 Tooltip: "Custom prompt text added to agents with this skill. Use to inject domain knowledge."

🔧 Advanced:

You are an expert in [domain] with deep knowledge of:
- [Key concept 1]
- [Key concept 2]
- [Key concept 3]

When using this skill, you:
1. [Approach step 1]
2. [Approach step 2]
3. [Approach step 3]

Example Enhancement:

You are an expert in Python security auditing with deep knowledge of:
- OWASP Top 10 vulnerabilities
- Secure coding best practices
- Python-specific security patterns

When reviewing code for security, you:
1. Check for SQL injection risks
2. Validate input sanitization
3. Review authentication/authorization
4. Examine cryptographic implementations

How to open: Click "⚙️" icon on skill card in Skills tab

💡 Tooltip: "Modify skill parameters and enhancement text"

Configurable Settings:

Skill Priority 💡 Tooltip: "Higher priority skills are emphasized more in agent prompts"

High / Medium / Low
Affects prompt weighting

Enable/Disable

Temporarily disable without removing
Useful for testing skill impact

Prompt Enhancement:

Edit custom prompt text
Preview how it affects agent
Save versions for A/B testing

Performance Tracking:

View tasks using this skill
Success rate with skill
Quality scores
Disable if underperforming

Common Tasks

Task 1: Creating a Code Review Agent

Scenario: You want an agent to review Python pull requests

Steps:

Go to Agents page → Click "Create Agent"
Step 1: Select "Code Architect" template
Step 2:
- Name: "Python Code Reviewer"
- Description: "Reviews Python PRs for security and quality"
- Priority: High
Step 3:
- Provider: OpenAI
- Model: GPT-4 Turbo
- Temperature: 0.7
- Max Tokens: 4000
Step 4:
- Skills: ✅ Code Analysis, ✅ Security Audit, ✅ Python
- Tools: ✅ GitHub, ✅ CodeGraph, ✅ File Operations
Click "Create Agent"

⏱️ Time: 3 minutes 🎯 Result: Agent ready to review PRs

Task 2: Executing a Task on an Agent

Scenario: Test your new agent with a code review

Steps:

Find your agent in Roster tab
Click "Execute Task" button
Task Description: "Review this authentication middleware for security issues"

Add Context (optional):

{
  "file_path": "middleware/auth.py",
  "focus_areas": ["SQL injection", "authentication bypass"]
}

Click "Execute"
Watch real-time progress
View results when complete

⏱️ Time: 5-30 seconds execution 🎯 Result: Detailed security analysis

Task 3: Assigning a Tool to an Agent

Scenario: Give your agent access to Slack notifications

Steps:

Select agent in Configuration tab
Click "Edit Configuration"
Go to Tab 5: Model (or tools section)
Find "Slack" in available tools
Click "+ Add"
Configure Slack credentials (if not set)
Set permissions: ✅ Read ✅ Write
Click "Save"

⏱️ Time: 2 minutes 🎯 Result: Agent can now send Slack messages

Task 4: Monitoring Agent Performance

Scenario: Check if your agent is performing well

Steps:

Go to Performance tab
Select agent from dropdown
Review Resource Usage:
- Is cost within budget?
- Are tokens reasonable?
Check Task History:
- Are most tasks successful?
- Any patterns in failures?
View Performance Trends:
- Is quality improving?
- Is execution time decreasing?

⏱️ Time: 5 minutes 🎯 Result: Understanding of agent health

Task 5: Improving Agent Quality

Scenario: Your agent's quality score is 0.65 (target: >0.8)

Steps to Improve:

Add relevant skills:
- Go to Configuration → Edit
- Tab 4: Skills
- Add missing domain skills
Use better model:
- Tab 5: Model
- Upgrade to GPT-4 (from GPT-3.5)
- Or Claude 3 Opus for analysis tasks
Increase max tokens:
- Tab 2: Performance
- Increase max tokens (more detailed responses)
- From 1000 → 2500 tokens
Add more context:
- Use CodeGraph tool for code tasks
- Upload relevant documents to knowledge base
- Enable RAG for document reference
Monitor improvements:
- Check quality scores after changes
- Compare before/after metrics
- Fine-tune further as needed

⏱️ Time: 10 minutes 🎯 Result: Quality score improvement to >0.8

Advanced Features

Multi-Agent Teams

🔧 Advanced

Create agent teams that work together on complex tasks.

How to Create a Team:

Go to Coordination tab
Click "Create Team"
Team Name: "Security Review Team"
Add Agents:
- CodeArchitect-001 (lead)
- SecurityExpert-003 (specialist)
- DocumentationExpert-005 (documenter)
Configure Collaboration:
- Strategy: Hierarchical (lead coordinates)
- Shared Memory: Enabled
- Communication: Real-time
Click "Create Team"

Using Teams in Workflows:

When creating workflows, select team instead of individual agents for collaborative execution.

See Coordination Tab for details →

Agent Memory Management

🔧 Advanced

Agents maintain hierarchical memory across sessions.

Memory Tiers:

Working Memory (Redis, 5 min TTL) 💡 Tooltip: "Active task context. Cleared after task completion."

Current task details
Temporary variables
Capacity: 7 items (Miller's Law)

Short-term Memory (PostgreSQL, 24 hours) 💡 Tooltip: "Recent experiences and learnings. Consolidated nightly."

Recent task executions
Temporary insights
Session interactions

Long-term Memory (PostgreSQL + pgvector, Permanent) 💡 Tooltip: "Consolidated knowledge and learned patterns. Permanent storage."

Validated patterns
Domain knowledge
Success strategies

Collective Memory (Shared, Permanent) 💡 Tooltip: "Knowledge shared across all agents in your organization."

Cross-agent patterns
Organizational wisdom
Best practices

Viewing Agent Memory:

Open Agent Details Modal
Go to Advanced section (if available)
View memory statistics
Browse memory items
Manually add important memories

Custom Agent Types

🔧 Advanced

Developers can create completely custom agent types.

Requirements:

Understanding of agent architecture
Familiarity with prompting
Testing capability

See Developer Guide for details →

Agent Performance Optimization

🔧 Advanced

Optimization Strategies:

For Speed:

Use GPT-3.5 Turbo or Claude Haiku
Reduce max tokens
Enable caching
Limit tool usage

For Quality:

Use GPT-4 or Claude Opus
Increase max tokens
Add more relevant skills
Enable RAG/CodeGraph

For Cost:

Use cheaper models for simple tasks
Enable aggressive caching
Set strict token limits
Monitor and adjust

Benchmarking:

Use Performance tab to:

Compare agent variations
A/B test configurations
Track optimization impact
Validate improvements

Tips & Best Practices

Agent Naming

Good naming patterns:

✅ "Python Security Reviewer"
✅ "API Documentation Expert - v2"
✅ "CodeReviewer-Prod-001"

Poor naming patterns:

❌ "Agent 1"
❌ "Test"
❌ "My Agent"

💡 Tip: Include the agent's purpose and technology in the name.

Model Selection

Choose model based on task:

Task Type

Recommended Model

Why

Code review

GPT-4 Turbo

Excellent code understanding, large context

Security audit

Claude 3 Opus

Superior analytical reasoning

Data analysis

GPT-4

Balanced performance

Simple tasks

GPT-3.5 Turbo

Fast and cost-effective

Documentation

Claude 3 Sonnet

Good at structured writing

💡 Tip: Start with GPT-4 Turbo for most tasks, then optimize based on results.

Skill Assignment

Best practices:

✅ Assign 3-7 skills per agent (not too few, not too many)
✅ Choose complementary skills (e.g., "Python" + "Security Audit")
✅ Avoid redundant skills
✅ Remove underused skills after analysis

💡 Tip: Check Performance tab to see which skills are actually used.

Resource Limits

Recommended limits:

Agent Usage

Daily Tokens

Daily Budget

Light (testing)

50,000

Medium (development)

200,000

Heavy (production)

1,000,000

$25

💡 Tip: Start conservative, increase based on actual usage.

Performance Monitoring

Check weekly:

✅ Success rate (target: >90%)
✅ Quality score (target: >0.8)
✅ Cost per task (track trends)
✅ Execution time (look for increases)

💡 Tip: Set up alerts for performance degradation.

Troubleshooting

Agent Won't Execute Tasks

Symptom: Agent shows "idle" but tasks aren't starting

Solutions:

Check agent status:
- Must be "Active" to receive tasks
- Go to Roster → Check status badge
- Activate if needed
Verify model configuration:
- Open Configuration tab
- Click "Edit Configuration"
- Tab 5: Test model connection
- Ensure API key is valid
Check resource limits:
- Tab 3: Resources
- Verify not at token/cost limits
- Increase limits if needed
Review error logs:
- Performance tab → Activity Logs
- Look for error messages
- Address specific errors

Low Quality Scores

Symptom: Quality scores consistently <0.7

Solutions:

Improve model:
- Switch to GPT-4 or Claude Opus
- Increase temperature for creative tasks
- Decrease for analytical tasks
Add skills:
- Review what skills are missing
- Add domain-specific skills
- Check skill usage in Performance tab
Provide better context:
- Use CodeGraph for code tasks
- Upload relevant documents
- Enable RAG retrieval
Check task descriptions:
- Are instructions clear?
- Are examples provided?
- Is context sufficient?

High Costs

Symptom: Agent costing more than expected

Solutions:

Use cheaper model:
- GPT-3.5 Turbo instead of GPT-4
- Claude Haiku instead of Opus
Reduce max tokens:
- Lower from 4000 to 2000
- Shorter responses = lower cost
Enable caching:
- Configuration → Tab 2
- Enable result caching
- Set appropriate TTL
Limit tool calls:
- Remove unnecessary tools
- Some tools make expensive API calls
- Monitor tool usage

Agent Not Appearing in List

Symptom: Created agent but don't see it in Roster

Solutions:

Refresh the page: Browser cache may be stale
Check filters: May be filtered out by status/type
Clear search: Search bar may be filtering
Check creation success: Look for success message
View all agents: Remove all filters

Model Connection Failed

Symptom: "LLM verification error" during creation

Solutions:

Check API key:
- Go to Settings → Credentials
- Verify OpenAI/Anthropic API key exists
- Test connection
Verify key validity:
- API key may be expired
- Regenerate key in provider dashboard
- Update in Credentials tab
Check rate limits:
- You may have hit API rate limits
- Wait a few minutes
- Consider rate limit settings
Network issues:
- Check internet connection
- Verify firewall not blocking
- Test with curl to API endpoint

Workflows Guide: Use agents in workflows
Tools Guide: Assign tools to agents
Settings Guide: Configure credentials for models
Performance Guide: Deep analytics on agent performance

Keyboard Shortcuts

💡 Pro Tip: Speed up agent management with keyboard shortcuts

Ctrl/Cmd + K: Quick agent search
Ctrl/Cmd + N: Create new agent
Ctrl/Cmd + E: Edit selected agent
Ctrl/Cmd + D: View agent details
Tab: Navigate between tabs
Esc: Close modals

FAQ

How many agents can I create?

Unlimited for most plans. However, each agent:

Uses tokens when executing (costs money)
Maintains memory (uses storage)
Requires model API access

💡 Recommendation: Start with 3-10 agents, expand as needed.

What's the difference between agent types?

Agent types are templates with pre-configured skills and recommendations:

Same underlying system
Different default skills
Different recommended models
Optimized for specific use cases

You can customize any agent regardless of type.

Can agents learn and improve?

Yes! Agents have:

Memory systems: Remember past successes/failures
Performance tracking: Monitor what works
Adaptive prompting: Improve based on feedback
Pattern recognition: Identify successful strategies

Quality typically improves 20-30% over first 100 tasks.

How do I choose between GPT-4 and Claude?

General guidance:

Criteria

GPT-4

Claude 3 Opus

Code tasks

⭐⭐⭐⭐⭐ Better

⭐⭐⭐⭐ Good

Analysis

⭐⭐⭐⭐ Good

⭐⭐⭐⭐⭐ Better

Speed

⭐⭐⭐⭐ Fast

⭐⭐⭐ Moderate

Context

128K tokens

200K tokens

Cost

$$$

💡 Tip: Use GPT-4 Turbo as default, switch to Claude for complex analysis.

Can multiple agents work together?

Yes! Use the Coordination tab to:

Create agent teams
Enable shared memory
Configure collaboration strategies
Monitor team performance

See Coordination Tab for details →

What happens if an agent fails?

Depends on configuration:

With Auto-Retry Enabled:

Agent retries automatically (up to configured max)
Uses exponential backoff (waits longer each retry)
Marks as failed if all retries exhausted
Logs error for review

Without Auto-Retry:

Task marked as failed immediately
Error logged
Notification sent (if configured)
Manual retry available

Configure retry settings →

Glossary

Agent: An AI entity that executes tasks using an LLM, skills, and tools

Model: The underlying LLM (GPT-4, Claude, etc.) that powers the agent

Skills: Areas of expertise that enhance agent capabilities

Tools: External integrations the agent can use (GitHub, Slack, etc.)

Temperature: Controls randomness in responses (0=focused, 2=creative)

Tokens: Units of text processed by AI models (1 token ≈ 4 characters)

Quality Score: Rating of task output quality (0.0-1.0)

Working Memory: Temporary memory during task execution

Long-term Memory: Permanent knowledge and learned patterns

Next: Workflows Guide →

Master workflows to orchestrate multi-agent automation

API Reference

API

Authentication All API calls require headers:
X-API-Key: <your_key>
Authorization: Bearer <your_token>

POST /api/agents

{ "name":"Code Assistant","policy_id":"code_assistant","tools":["search","exec"],"limits":{"max_tokens":4096,"budget_usd":5} }

Edit / Delete

Update policy/tools/limits or remove unused agents.

API

GET /api/agents (list)
GET /api/agents/{id} (detail)
PUT /api/agents/{id} (update)
DELETE /api/agents/{id} (delete)

Runs

Inspect recent runs for a given agent.

API GET /api/runs?agent_id={id}&limit=50

Tips

Start with a conservative budget.
Keep tool lists minimal per agent to reduce ambiguity.

PreviousDashboard NextWorkflow Management

Last updated 3 months ago

Good afternoon

hashtag📖 Table of Contents

hashtagOverview

hashtagWhat is Agent Management?

hashtagWhat Can You Do Here?

hashtagPage Layout

hashtagQuick Start

hashtagCreating Your First Agent (3 Minutes)

hashtagAgent Roster Tab

hashtagOverview

hashtagStatistics Cards

hashtagSearch and Filter

hashtagAgent Cards

hashtagCreating an Agent

hashtagSkills Tab

hashtagOverview

hashtagSub-Tabs

hashtag1. All Skills

hashtag2. Agent Skills

hashtag3. Skill Categories

hashtagCreating Custom Skills

hashtagConfiguration Tab

hashtagOverview

hashtagAgent Selector

hashtagConfiguration Sections

hashtagConfiguration Display

hashtagCoordination Tab

hashtagOverview

hashtagSub-Tabs

hashtag1. Agent Coordination

hashtag2. Collaborative Reasoning

hashtagPerformance Tab

hashtagOverview

hashtagSub-Tabs

hashtag1. Resource Usage

hashtag2. Task History

hashtag3. Performance Trends

hashtag4. Activity Logs

hashtagCreating Agents

hashtagCreate Agent Modal

hashtagStep 1: Agent Type

hashtagStep 2: Configuration

hashtagStep 3: Model Selection

hashtagStep 4: Skills & Tools

hashtagReview & Create

hashtagAgent Modals Reference

hashtagAgent Details Modal

hashtagTab 1: Overview

hashtagTab 2: Performance

hashtagTab 3: Workload

hashtagTab 4: Skills

hashtagAgent Configuration Modal

hashtagTab 1: General

hashtagTab 2: Performance

hashtagTab 3: Resources

hashtagTab 4: Skills

hashtagTab 5: Model

hashtagCompleting Agent Creation

hashtagAgent Status Control Modal

hashtagAgent Confirm Delete Modal

hashtagCreate Skill Modal

hashtagSkill Configuration Modal

hashtagCommon Tasks

hashtagTask 1: Creating a Code Review Agent

hashtagTask 2: Executing a Task on an Agent

hashtagTask 3: Assigning a Tool to an Agent

hashtagTask 4: Monitoring Agent Performance

hashtagTask 5: Improving Agent Quality

hashtagAdvanced Features

hashtagMulti-Agent Teams

hashtagAgent Memory Management

hashtagCustom Agent Types

hashtagAgent Performance Optimization

hashtagTips & Best Practices

hashtagAgent Naming

hashtagModel Selection

hashtagSkill Assignment

hashtagResource Limits

hashtagPerformance Monitoring

hashtagTroubleshooting

📖 Table of Contents

Overview

What is Agent Management?

What Can You Do Here?

Page Layout

Quick Start

Creating Your First Agent (3 Minutes)

Agent Roster Tab

Overview

Statistics Cards

Search and Filter

Agent Cards

Creating an Agent

Skills Tab

Overview

Sub-Tabs

1. All Skills

2. Agent Skills

3. Skill Categories

Creating Custom Skills

Configuration Tab

Overview

Agent Selector

Configuration Sections

Configuration Display

Coordination Tab

Overview

Sub-Tabs

1. Agent Coordination

2. Collaborative Reasoning

Performance Tab

Overview

Sub-Tabs

1. Resource Usage

2. Task History

3. Performance Trends

4. Activity Logs

Creating Agents

Create Agent Modal

Step 1: Agent Type

Step 2: Configuration

Step 3: Model Selection

Step 4: Skills & Tools

Review & Create

Agent Modals Reference

Agent Details Modal

Tab 1: Overview

Tab 2: Performance

Tab 3: Workload

Tab 4: Skills

Agent Configuration Modal

Tab 1: General

Tab 2: Performance

Tab 3: Resources

Tab 4: Skills

Tab 5: Model

Completing Agent Creation

Agent Status Control Modal

Agent Confirm Delete Modal

Create Skill Modal

Skill Configuration Modal

Common Tasks

Task 1: Creating a Code Review Agent

Task 2: Executing a Task on an Agent

Task 3: Assigning a Tool to an Agent

Task 4: Monitoring Agent Performance

Task 5: Improving Agent Quality

Advanced Features

Multi-Agent Teams

Agent Memory Management

Custom Agent Types

Agent Performance Optimization

Tips & Best Practices

Agent Naming

Model Selection

Skill Assignment

Resource Limits

Performance Monitoring

Troubleshooting

Agent Won't Execute Tasks