AI Operations Agent

AI Agent for Kubernetes.
Approval-Gated by Design.

Self-hosted control layer for Kubernetes and CI/CD. Converts natural language into typed, auditable tool execution inside your cluster.

Open SourceSelf-HostedNo TelemetryMulti-LLM
The Post-Code Bottleneck

Code Ships Faster Than Ever.
Your Operations Haven't Kept Up.

AI coding tools accelerated development. But deploying, operating, and keeping production alive? That's still manual, fragmented, and dangerously risky.

01

Brittle Scripts & Manual kubectl

Your deployment process is held together by tribal knowledge and shell scripts that break at 2 AM.

Tribal knowledge
02

Fragmented Visibility

Prometheus in one tab, Grafana in another, kubectl in the terminal, Slack on fire. Five tools, zero unified context when it matters most.

Zero unified context
03

Every Mutation Is a Blind Risk

kubectl apply with fingers crossed. No dry-run, no rollback plan, no verification that the change did what you intended.

No safety net
Deterministic Control Loop

Not a Black Box.
A Deterministic Control Loop.

Every infrastructure change follows four auditable steps. The agent plans, you decide, the outcome is verified.

01

Plan

Agent analyzes your intent, discovers resources, and generates a concrete action plan.

Intent → Action plan
02

Approve

Every mutating tool call pauses for explicit human approval. Read operations flow freely. The gate is enforced by the engine.

Human gate on mutations
03

Execute

Typed tools run via MCP. Schema-validated inputs. Sandboxed execution. Full audit trail for every operation.

Typed, sandboxed execution
04

Verify

Agent validates the outcome against your original intent. If state drifts from the plan, it flags the discrepancy and suggests remediation.

Post-action verification

The approval gate is enforced at the engine level. Not a setting. Not configurable off.

See It in Action

Real Operations.
Against a Live Cluster.

Not mockups or scripted demos. Watch Skyflo handle the workflows your team runs every day.

Faster Diagnosis. Safer Changes.
Auditable Operations.

Architecture is table stakes. These are the operational outcomes that matter to your team.

Faster Incident Diagnosis

Agent correlates logs, events, and resource state in a single pass. No more context-switching across dashboards.

Consistent, Auditable Deployments

No more ad-hoc kubectl runs or untracked mutations. Every change is repeatable and auditable.

Approval Gates on Writes, Not Reads

Read operations flow freely. Mutating tool calls require explicit approval. Your developers move fast. Your infrastructure stays safe.

Used in production at

Storylane
Get Started

Your Cluster. Your Agent.
Running in Minutes.

Deploy on your cluster with your own LLM. No Skyflo telemetry or phone-home.

Apache 2.0Self-HostedApproval-Gated
Skyflo for Teams

Ready for Your Team?
Scale with Confidence

Team adds collaboration, governance, and integrations. Same agent. Same control loop. Same approval gates.

Chat Integration

Operate from Slack, Microsoft Teams, and more

SCM Integration

Persist changes to GitHub, GitLab, Bitbucket

AI Alerting Agent

Anomaly correlation and proactive detection

RBAC & Governance

Team permissions, audit trails, SSO

RBAC & Team Management
Audit Trails
Self-Hosted
SSO Compatible
Core Capabilities

An Execution Runtime.
Not a Chat Wrapper.

Every capability maps to an operational outcome.

01

Natural Language to Typed Execution

Describe what you need in plain English. Skyflo converts intent into schema-validated tool calls.

Intent → Execution
02

Unified Cluster Context

Logs, events, resource state, and configuration correlated in one place. Diagnose a CrashLoopBackOff without switching between five terminals.

One interface
03

Graph-Based Workflow Engine

A LangGraph-powered workflow with distinct phases. Deterministic. Replayable. Not a monolithic LLM call.

Deterministic flow
04

Live Agent Reasoning

Agent thoughts, tool progress, and results streamed in real time via SSE.

Real-time streaming
05

Post-Action Verification

The agent validates outcomes against your original intent. Drifts are flagged with suggested remediation.

Outcome validation
06

Extensible via MCP

Every tool follows the Model Context Protocol. Typed inputs, sandboxed execution, defined safety model.

Open standard

Every capability ships with open source.

MCP Tool Ecosystem

64 Infrastructure Tools.
Every One Typed and Auditable.

Schema-validated inputs. Sandboxed execution. Defined safety model per tool.

Kubernetes

Orchestration
22 tools

Discovery, logs, exec, apply, diff

  • Discover resources across namespaces
  • Stream pod logs and exec into containers
  • Drain and cordon nodes safely
  • Preview changes with diff before apply
Read: Auto
Write: Human Approval

Helm

Package Management
16 tools

Search, install, upgrade, rollback

  • Install charts with custom values
  • Upgrade releases with dry-run preview
  • Roll back to any previous revision
  • Manage chart repositories
Read: Auto
Write: Dry-run + Diff

Argo Rollouts

Progressive Delivery
13 tools

Pause, resume, promote, abort

  • Run canary and blue-green deployments
  • Promote or abort with human gate
  • Monitor analysis runs and experiments
  • Track full rollout history and status
Read: Auto
Write: Human Gate

Jenkins

CI/CD
13 tools

Jobs, builds, logs, SCM, identity

  • Manage and trigger build jobs
  • Stream build logs in real time
  • Inspect SCM configurations
  • Authenticate via Kubernetes Secrets
Read: Auto
Write: Secure Auth + CSRF

On the Roadmap

AWS
GCP
Azure
GitHub Actions
GitLab CI

Same typed, sandboxed pattern. All open source.

Open Source. Self-Hosted.

An AI Agent in Your Cluster
Should Be Yours to Audit.

Apache 2.0 licensed. The agent, the control loop, and the safety model are all inspectable and under your control.

01

Full Source Transparency

Every tool call, every decision path, every safety check is in the source.

Apache 2.0 licensed
02

Self-Hosted, In-Cluster

Runs inside your Kubernetes cluster. LLM calls go only to the provider you configure.

Your data stays yours
03

Bring Your Own LLM, No Lock-in

OpenAI, Anthropic, Gemini, Groq, or self-hosted models. Switch providers without changing workflows.

Bring your own LLM
04

Safety Is Not a Premium Feature

Approval gates ship with open source. No feature gates on safety. No usage limits.

Free forever

No black-box agent decisions. No Skyflo telemetry.

Built in the Open

Transparent, auditable, and built for operators managing production Kubernetes.

Open Source

Full source code available under the Apache 2.0 license. Audit every line. No black boxes in your production stack.

View on GitHub

Join Our Channels

Connect with operators and developers building on Skyflo.

Frequently Asked Questions

Common questions about Skyflo and approval-gated operations.

Install and Run Your First Operation

Install Skyflo on your cluster and run your first operation today.

terminal
$curl -fsSL https://skyflo.ai/install.sh | bash