Automatos Platform — Definitive Capability Report

Date: 2026-03-29 Purpose: Correct inaccurate findings from Mission Zero readiness review and provide authoritative reference of all platform capabilities. Status: All code phases complete, deployed, migration applied.

Corrections to Mission Zero Findings

The ATLAS-led readiness review (2026-03-28) identified gaps and constraints. Several findings were incorrect or outdated at time of review, and additional gaps have since been closed. This section corrects the record.

Finding: "Board limited to 5-column workflow"

WRONG. The board now has 6 columns: Inbox, Assigned, In Progress, Review, Blocked, Done.

frontend/types/board.ts:62-69 — BOARD_COLUMNS array defines all 6
orchestrator/api/board_tasks.py:28 — VALID_STATUSES includes "blocked"
orchestrator/services/orchestration_board_bridge.py:41 — blocked tasks sync to blocked (not review)
Blocked cards show red accent, ShieldAlert icon, and blocked reason text
Commit: 044068b22

Finding: "No built-in dependency visualization for DAGs"

PARTIALLY WRONG. Parent-child task hierarchy is fully implemented:

BoardTask.parent_task_id column exists in DB (core/models/core.py:1524)
Board bridge populates parent_task_id when syncing mission sub-tasks
Frontend groups child tasks under parent cards with expand/collapse (Layers icon + count badge)
frontend/hooks/use-board-tasks.ts — builds childCountMap, filters topLevelTasks
Mission tasks render as a tree: mission parent card with N subtask badges
Full DAG visualization (graphical arrows) is a future enhancement, but the hierarchy is navigable now

Finding: "No blocked column on the board"

FIXED. See correction #1 above. Blocked is a first-class status with:

blocked_at timestamp (auto-set on transition)
blocked_reason text (shown on card)
Auto-escalation after 24 hours → creates escalation inbox task
Files: escalation_service.py:24-90, board_tasks.py, board-card.tsx

Finding: "Heartbeat service — basic health checks only; lacks CTO-level orchestration"

WRONG. The heartbeat system is fully featured:

Configurable interval (minutes or cron expression)
Active hours with timezone support
Proactive levels (0-100 scale)
Auto-reporting: generates structured report from heartbeat results
platform_configure_agent_heartbeat tool exists for agents to self-configure
12 API endpoints for heartbeat management (config, run, history, analytics, toggle)
Auto's heartbeat prompt can call ANY platform tool — analytics, board, reports, monitoring — enabling full CTO-level orchestration
Files: orchestrator/api/heartbeat.py, orchestrator/services/heartbeat_service.py

Finding: "No pre-built Report-Validator agent"

NOT NEEDED AS AN AGENT. Report validation is built into the submission tool:

platform_submit_report accepts optional required_sections parameter
Validates markdown content contains required ##/### headings (case-insensitive)
Returns specific error: "Report missing required sections: Summary, Metrics"
Playbook templates can enforce standardized structure without a separate agent
Files: handlers_reports.py:33-40, actions_reports.py:59-62
Commit: 778b4a4a4

Finding: "Reports — no advanced search capabilities"

WRONG. Reports have:

Filter by agent, report_type, date range, status
Full-text content in markdown (searchable)
Per-agent report history endpoint
Cross-agent access via platform_get_latest_report
Report stats endpoint (count by type, agents, dates)
Star rating / grading system (POST /reports/{id}/grade)
Files: orchestrator/api/reports.py

Finding: "Limited to 7-day rolling windows; no long-term trend analysis"

PARTIALLY WRONG. The enhanced analytics suite provides configurable time ranges:

platform_get_cost_per_execution — 30-day daily breakdown
platform_get_llm_usage — configurable days parameter
platform_get_cost_breakdown — configurable period
platform_workspace_stats — today/7d/30d periods
NL2SQL (platform_query_data) can query ANY time range from the database directly
Long-term trend analysis IS possible via direct database queries

Finding: "Playbooks — limited to predefined steps; no dynamic adjustments"

WRONG. Playbooks support:

10 CRUD tools (create, update, add/update/delete steps, execute, get execution)
Execution config: mode (manual/auto/approval), max_retries, timeout_per_step, quality_threshold
Schedule config: manual, cron expressions, or trigger-based
Input variables passed at runtime (dynamic parameterization)
Step types include AI generation (LLM calls with dynamic prompts), HTTP requests, conditionals, integrations
Agents can create AND modify playbooks at runtime via platform tools
Files: actions_playbooks.py, orchestrator/api/workflow_recipes.py

Finding: "Requires human approval before [mission] execution"

CONFIGURABLE, NOT MANDATORY. Missions support multiple approval modes:

review_mode: "human" | "llm" | "auto"
Board tasks can be set to auto_approve for autonomous execution
auto_execution flag on tasks enables agents to auto-claim and run
Authority enforcement via blueprints: enforce_mode: "advisory" (warn only) or "strict" (block)
Human approval is the conservative default, but fully configurable per task/mission

Finding: "No unified dashboard for cross-system monitoring"

WRONG. The Command Centre dashboard provides exactly this:

12 widgets covering: Active Agents, Task Queue, Recent Activity, System Health, Errors, Cost Tracker, Agent Performance, Playbook Metrics, Approval Gates, and more
Widget picker: show/hide toggles, drag-to-reorder, persisted in localStorage
Widget registry pattern ready for marketplace extensions
KPI endpoints: /api/kpi/cost-overview, /api/kpi/agent-performance, /api/kpi/playbook-metrics, /api/kpi/approval-gates
Analytics dashboard: workflow trends, agent performance, skill demand, execution reports

Finding: "Authority boundaries described but not codified into platform-enforced policies"

FIXED. Authority enforcement is now live:

AgentBlueprint model with configurable rules: min_tools, require_system_prompt, required_tags, allowed_models
enforce_mode: "strict" blocks dispatch, "advisory" logs warnings
check_authority() called by dispatcher before every task claim
Strict mode failures: task → FAILED with authority_denied reason
6 governance tools: list/get/create/update blueprints, validate agent, check budget
Files: blueprint_validator.py, dispatcher.py, actions_governance.py
Commit: 4f03deee0

Finding: "No escalation framework"

FIXED. Three escalation types are operational:

Blocked >24h → auto-creates high-priority inbox task for human review (idempotent)
Budget exceeded → creates urgent inbox task with spend details + resume instructions
Repeated stalls (2+) → creates high-priority task flagging the stuck agent

All escalations are tagged for deduplication (escalation, blocked:{id}, budget:{id}, stalled:{id})
Files: escalation_service.py (203 lines), wired into reconciler.py and available for coordinator
Commit: 4f03deee0

Finding: "Budget — currently advisory only"

FIXED. Budget enforcement is now end-to-end:

budget_config on OrchestrationRun: max_cost, max_tokens, alert_at_pct
budget_spent tracking: cost, tokens, api_calls
Dispatcher transitions run to PAUSED when budget exceeded
platform_check_budget returns status: ok/warning/exceeded with remaining amounts
Resume API: POST /api/v1/missions/{run_id}/resume accepts additional_tokens and additional_cost to increase budget before resuming
Budget exceeded notification creates urgent board task
Files: missions.py (resume endpoint), dispatcher.py (pause logic), escalation_service.py (notification)

Finding: "No formal review cadence playbooks exist"

INFRASTRUCTURE EXISTS. The playbook + scheduler system fully supports this:

Cron-triggered playbooks operational (e.g., "0 8 * * *" for daily 8am)
Multi-step playbooks with agent-per-step, scratchpad data passing, quality thresholds
Auto needs to configure (not build) these: Daily CEO Briefing, Weekly Business Review, Monthly KB Audit
This is a Phase 4 configuration task, not a code gap

Finding: "Workspace filesystem — no version control"

PARTIALLY ADDRESSED. workspace_git tool provides:

Full git CLI access (status, log, diff, commit, push, branch, etc.)
Agents can version-control their workspace files
Not automatic versioning of every file, but git-based version control is available

Finding: "No integration with external calendars"

AVAILABLE VIA COMPOSIO. Google Calendar integration is possible:

composio_execute(app_name='GOOGLE_CALENDAR', ...) — requires OAuth connection
Can read events, create events, check availability
Not deeply integrated as a native feature, but functional via the Composio integration layer

Complete Platform Capability Inventory

98 Platform Tools Across 18 Domains

Domain

Count

Tools

Agents

list, get, create, update, delete, configure_heartbeat

Analytics (base)

llm_usage, cost_breakdown, workspace_stats, activity_feed, query_data (NL2SQL)

Analytics (enhanced)

success_rate, completion_time, error_rates, queue_depth, efficiency_score, cost_per_execution, peak_hours, bottlenecks, predictive_alerts, agent_ranking, sla_compliance

Assignments

assign_tool, assign_skill, assign_plugin

Board Tasks

create, list, summary, get, assign, update_status

Blog

publish, list, get, update

Documents

list, delete, reprocess

Field (shared context)

query, inject, stability

Governance

list/get/create/update blueprints, validate_agent, check_budget

Marketplace

browse plugins/agents/skills, list workspace plugins/skills/models, install plugin/skill/model

Missions

create, list, get

Monitoring

loki_logs, prometheus, alerts, railway_logs, list_services, system_health

Playbooks

list, get, create, update, add/update/delete step, execute, get_execution, delete

Reports

submit (with required_sections), get_latest

Scheduling

schedule, list, cancel

Search

chat_history, memory

Tools/LLMs

list_tools, list_llms, list_datasources

Workspace

info, memory_stats, store_memory, browse/delete_memories, connected_apps, system_health

Workspace I/O

read_file, write_file, list_dir, grep, exec, git

API Surface: 100+ Endpoints

Key API routers:

/api/v1/agents — Agent CRUD + config
/api/v1/tasks — Board task management (6 columns, drag-drop, bulk ops)
/api/v1/missions — Mission lifecycle (create, list, get, resume)
/api/v1/heartbeat — 12 endpoints (config, run, history, analytics, toggle)
/api/v1/reports — Submit, list, get, grade, download, stats
/api/v1/recipes — Playbook CRUD + execution + marketplace
/api/v1/documents — Upload, search, reprocess, analytics
/api/v1/analytics — Workflow trends, agent performance, skill demand
/api/v1/kpi — Cost overview, agent performance, playbook metrics, approval gates
/api/v1/composio — App connections, agent features, OAuth
/api/v1/memory — Store, search, consolidate, backup/restore
/api/v1/blog — Publish, list, get, update
/api/v1/scheduled-tasks — Cron scheduling
/api/v1/blueprints — Governance rules

Core Systems

System

Capability

Status

Board (Kanban)

6-column workflow, parent-child hierarchy, SLA deadlines, blocked escalation, drag-drop, review modes (human/llm/auto)

Fully operational

Mission Orchestrator

Goal decomposition, DAG execution, parallel dispatch, budget tracking, pause/resume, replanning, stall detection, authority checks

Fully operational

Escalation Engine

Blocked >24h auto-escalate, budget exceeded notification, repeated stall escalation (2+), idempotent dedup

Fully operational

Governance (Blueprints)

Configurable rules, strict/advisory enforcement, pre-dispatch authority check, agent readiness badge

Fully operational

Analytics Engine

16 analytics tools, KPI dashboard, workflow + mission efficiency, agent ranking, SLA compliance, predictive alerts

Fully operational

Reports

6 types, required_sections validation, auto-report from heartbeat, grading, cross-agent access

Fully operational

Heartbeat

Configurable interval/cron, active hours, proactive levels, auto-reporting, 12 API endpoints

Fully operational

Playbooks

10 CRUD tools, cron scheduling, multi-step with agent-per-step, scratchpad data passing, quality thresholds

Fully operational

Workspace I/O

Read/write files, directory listing, regex grep, sandboxed exec, git operations

Fully operational

Memory

Store, search (semantic), browse, delete, daily logs, smart consolidation, backup/restore

Fully operational

Monitoring

Loki logs, Prometheus metrics, infrastructure alerts, Railway deploy logs, system health

Fully operational

Integrations

100+ Composio apps (Slack, Telegram, Gmail, Jira, GitHub, Google Calendar, etc.)

Fully operational

Documents/RAG

Upload, chunk, embed, semantic search, cloud sync, reprocess

Fully operational

NL2SQL

Natural language to SQL, multi-table joins, aggregations, any time range

Fully operational

Blog

Publish, list, get, update, cover images, categories, tags

Fully operational

Shared Field

Per-mission semantic field for cross-agent context, query/inject/stability

Fully operational

Marketplace

Browse agents/plugins/skills, install to workspace, inventory management

Fully operational

Scheduler

One-shot (ISO datetime) and recurring (cron), max_runs limit, per-agent scheduling

Fully operational

Frontend Features

Feature

Status

6-column Kanban board (Inbox, Assigned, In Progress, Review, Blocked, Done)

Live

Parent-child task grouping with expand/collapse badges

Live

SLA deadline indicators (amber approaching, red overdue)

Live

Blocked card styling (red accent, shield icon, reason text)

Live

Widget picker (12 widgets, show/hide toggles, drag-to-reorder)

Live

Cost Tracker widget (sparkline, top spenders, period change)

Live

Agent Performance widget (color-coded bars by success rate)

Live

Playbook Metrics widget (runs, success %, avg duration)

Live

Approval Gates widget (pending count, avg time, waiting list)

Live

Agent Readiness Badge (green/yellow/red shield from blueprint validation)

Live

Reports tab with grid, filters, viewer, grading form

Live

Mobile navigation (Activity + Workspace in burger menu)

Live

Mission control (create, monitor, task list)

Live

Chat modes (standard, mission mode)

Live

What Auto (CTO) Can Do Right Now

Self-Monitor

"What's our success rate?" → platform_get_success_rate (real data from executions)
"Any bottlenecks?" → platform_get_bottlenecks (failure rates, queue buildup, slow executions)
"Are costs going up?" → platform_get_cost_per_execution (30-day daily breakdown with trend)
"Who's the top performer?" → platform_get_agent_ranking (composite scoring)
"Are we meeting SLAs?" → platform_get_sla_compliance (completion rate + response time)
"Any predicted issues?" → platform_get_predictive_alerts (cost spikes, capacity, rate limits)
"What happened today?" → platform_get_activity_feed (chats, recipes, errors)

Manage the Board

Create tasks with priority, assignment, SLA deadlines, parent-child links
Move tasks through 6-column workflow
See board summary (counts by status/priority, busiest agents, failed tasks)
Blocked tasks auto-escalate after 24h — no manual monitoring needed

Govern Quality

Create blueprints: "All agents must have a system prompt and at least 2 tools"
Validate agents against blueprints before dispatch (strict mode blocks, advisory warns)
Check mission budgets and get alerts at configurable thresholds
Auto-pause missions that exceed budget, resume with increased budget

Orchestrate Missions

Decompose goals into parallel/sequential task DAGs
Dispatch tasks to best-fit agents automatically
Track progress, detect stalls, recover or escalate
Budget-aware execution with hard pause on exceeded

Run Recurring Operations

Schedule playbooks with cron: daily briefings, weekly reviews, monthly audits
Each playbook can chain multiple agents with scratchpad data passing
Quality thresholds ensure output meets standards before delivery

Communicate

Post to Slack channels, send Telegram messages, compose emails via Gmail
100+ Composio integrations available for external system interaction
Report findings via structured reports (6 types, grading, required sections)

Monitor Infrastructure

Query Loki logs by service, severity, keyword (up to 7 days)
Query Prometheus metrics (health, error_rate, latency, postgres, redis)
Check firing/resolved alerts with severity filtering
Pull Railway deployment logs per service

Manage Knowledge

Store and search memories (semantic)
Upload documents to RAG, semantic search across knowledge base
Read/write workspace files, run commands, use git
Query any database table via natural language (NL2SQL)

What Needs Configuration (Not Code)

These are Phase 4 items — all use existing platform tools, no code changes required:

Item

Tool to Use

Status

Auto heartbeat: 15min, proactive, active hours

platform_configure_agent_heartbeat

Ready to configure

Daily CEO Briefing playbook (cron 0 8 * * *)

platform_create_playbook + steps

Ready to configure

Weekly Business Review playbook (cron 0 9 * * 1)

platform_create_playbook + steps

Ready to configure

Monthly KB Audit playbook (cron 0 10 1 * *)

platform_create_playbook + steps

Ready to configure

Channel matrix document

workspace_write_file

Ready to configure

Report template with required sections

workspace_write_file

Ready to configure

Authority boundary documentation

workspace_write_file

Ready to configure

Metric baselines snapshot

workspace_write_file

Ready to configure

Auto failover runbook

workspace_write_file

Ready to configure

Summary

The platform has 98 registered tools, 100+ API endpoints, 16 core systems — all fully operational. The Mission Zero review identified gaps that were either already solved, have since been fixed, or require only configuration (not code). The system is ready for a clean GO verdict.

PreviousMISSION-ZERO NextPlatform Readiness Report — Mission Zero Enablement

Last updated 3 days ago

Good afternoon

hashtagCorrections to Mission Zero Findings

hashtagFinding: "Board limited to 5-column workflow"

hashtagFinding: "No built-in dependency visualization for DAGs"

hashtagFinding: "No blocked column on the board"

hashtagFinding: "Heartbeat service — basic health checks only; lacks CTO-level orchestration"

hashtagFinding: "No pre-built Report-Validator agent"

hashtagFinding: "Reports — no advanced search capabilities"

hashtagFinding: "Limited to 7-day rolling windows; no long-term trend analysis"

hashtagFinding: "Playbooks — limited to predefined steps; no dynamic adjustments"

hashtagFinding: "Requires human approval before [mission] execution"

hashtagFinding: "No unified dashboard for cross-system monitoring"

hashtagFinding: "Authority boundaries described but not codified into platform-enforced policies"

hashtagFinding: "No escalation framework"

hashtagFinding: "Budget — currently advisory only"

hashtagFinding: "No formal review cadence playbooks exist"

hashtagFinding: "Workspace filesystem — no version control"

hashtagFinding: "No integration with external calendars"

hashtagComplete Platform Capability Inventory

hashtag98 Platform Tools Across 18 Domains

hashtagAPI Surface: 100+ Endpoints

hashtagCore Systems

hashtagFrontend Features

hashtagWhat Auto (CTO) Can Do Right Now

hashtagSelf-Monitor

hashtagManage the Board

hashtagGovern Quality

hashtagOrchestrate Missions

hashtagRun Recurring Operations

hashtagCommunicate

hashtagMonitor Infrastructure

hashtagManage Knowledge

hashtagWhat Needs Configuration (Not Code)

hashtagSummary

Corrections to Mission Zero Findings

Finding: "Board limited to 5-column workflow"

Finding: "No built-in dependency visualization for DAGs"

Finding: "No blocked column on the board"

Finding: "Heartbeat service — basic health checks only; lacks CTO-level orchestration"

Finding: "No pre-built Report-Validator agent"

Finding: "Reports — no advanced search capabilities"

Finding: "Limited to 7-day rolling windows; no long-term trend analysis"

Finding: "Playbooks — limited to predefined steps; no dynamic adjustments"

Finding: "Requires human approval before [mission] execution"

Finding: "No unified dashboard for cross-system monitoring"

Finding: "Authority boundaries described but not codified into platform-enforced policies"

Finding: "No escalation framework"

Finding: "Budget — currently advisory only"

Finding: "No formal review cadence playbooks exist"

Finding: "Workspace filesystem — no version control"

Finding: "No integration with external calendars"

Complete Platform Capability Inventory

98 Platform Tools Across 18 Domains

API Surface: 100+ Endpoints

Core Systems

Frontend Features

What Auto (CTO) Can Do Right Now

Self-Monitor

Manage the Board

Govern Quality

Orchestrate Missions

Run Recurring Operations

Communicate

Monitor Infrastructure

Manage Knowledge

What Needs Configuration (Not Code)

Summary