Open Source · Free Tier with full analysis stack

When Kubernetes Breaks,
Fix It in 4 Minutes, Not 40.

AI root-cause analysis + auto-fix pull requests. Your team reviews and merges.

Most Kubernetes incidents involve 3060 minutes of manual investigation. KI-Ops eliminates the diagnosis. Claude AI analyzes your cluster in seconds. You get specific action items or auto-generated PRs.

ki-ops · production-cluster

FREE Cluster Analysis

$ki-ops analyze --namespace production

▸ kubectl get pods · Grafana queries · Loki logs

⚠ pod/api-gateway CrashLoopBackOff (OOMKilled 3x)

★ LLM recommendation: increase memory limit from 256Mi to 512Mi

   + set HPA minReplicas from 1 to 2

PRO Auto-Fix

$ki-ops fix --auto-pr

✓ deployment.yaml updated (memory: 256Mi → 512Mi)

✓ hpa.yaml updated (minReplicas: 1 → 2)

✓ kubectl dry-run validated · helm template OK

▸ PR #347 opened: "fix: api-gateway OOM + HPA scaling"

DevOps Teams Lose 40% of Their Time to Manual Incident Investigation

Your on-call engineer gets an alert. Then comes the detective work: kubectl describe grep logs check Grafana diff YAML Stack Overflow. 30 minutes later, the root cause is found. The fix takes 5 minutes.

🔥

30-min investigations, 5-min fixes

Your on-call engineer spends 30–45 minutes grepping logs, diffing YAMLs, and Googling fixes. The actual fix? 5 minutes. Repeat 3–4 times per week.

🔄

No learning, only repetition

kubectl get pods, kubectl describe, grep logs, check Grafana. The same diagnostic steps every incident. No automation. No learning. Just Groundhog Day.

💸

Tools cost more than fixing

Datadog: €300/user/year. PagerDuty: €250/user/year. For a 5-person team: €2,750+/year in tools — and you’re still manually diagnosing every incident.

FREE

Full Kubernetes Diagnostics. Zero Cost.

The free tier is not a demo. You get the complete incident analysis stack cluster diagnostics, log analysis, and AI-powered root-cause identification. With your own LLM key (BYOK). No artificial limits.

🔍

Kubernetes cluster analysis

Full diagnostics across Pods, Deployments, Services, StatefulSets, and Events. No config needed — just point KI-Ops at your kubeconfig. Instant results.

run_kubectlget_podsdescribe_node
📊

Grafana & Loki log analysis

Query your dashboards and logs directly. AI surfaces anomalies, error spikes, and correlation patterns you’d manually find in 30 minutes.

query_lokiquery_grafana
🌐

Network & DNS troubleshooting

DNS resolution, service reachability, network path analysis. Catch misconfigurations before they cascade into production incidents.

check_dnscheck_service
🧠

AI-powered root-cause analysis

Claude AI generates specific, validated action items for every incident. Not generic advice — context-aware recommendations based on your actual cluster state.

BYOK
📚

Kubernetes best practices library

Built-in runbooks for Kubernetes, Helm, Terraform, and Docker. AI cross-references best practices in every analysis.

💚

Cluster health tracking & trends

Real-time health status, scheduler metrics, node health, and historical trends. Catch degradation early before incidents happen.

PRO

Stop Writing Fixes Manually.
Let AI Generate Them.

Pro adds auto-remediation to diagnosis. Every fix is validated kubectl dry-run, Helm template, Terraform plan before the PR is created. Your team reviews and merges. Done.

🔧

Auto-generated fix PRs

Claude generates Kubernetes manifests and opens PRs automatically. No manual YAML editing. Ready to merge. Multi-file support for complex fixes.

propose_file_changecreate_new_filecreate_pr
📁

GitHub/GitLab integration

Connect all your repos. KI-Ops reads, searches, and modifies files across repositories. Automatic branch creation, PR opening, and post-merge cleanup.

list_reposread_repo_filesearch_repo

Pre-merge validation

Every fix passes validation before PR creation: kubectl dry-run, Helm template check, Terraform plan, Dockerfile security lint. Merge with confidence.

validate_yamlkubectl_dry_runhelm_template
🚀

Full GitOps workflow

Feature branch creation → PR with incident context → auto-cleanup after merge. Zero manual git work. One-click incident response.

create_branchcleanup_branches
How it works

From Install to First Analysis in 5 Minutes

Self-hosted, BYOK, no vendor lock-in. You bring your Claude API key. We bring the incident response toolchain. Your data never leaves your infrastructure.

🔑1

Add your API key

Enter your own Claude API key in the setup wizard. Your data, your key, your control. Under 2 minutes.

🔍2

Connect your cluster

Point KI-Ops at your kubeconfig, optionally connect Grafana and Loki. First analysis runs immediately.

🎯3

Get fixes, not just alerts

AI root-cause analysis with specific action items. With Pro: auto-generated PRs ready to merge in under 4 minutes.

Pricing

One price. Your whole team.

No per-user pricing. No subscription stress. One license key, one year, all features.

Community

For individuals and small teams

0 €forever
  • Cluster analysis (kubectl, Pods, Nodes, Events)
  • Grafana & Loki log analysis
  • DNS & network checks
  • LLM recommendations (text output)
  • Knowledge base & best practices
  • Health dashboard & scheduler
  • BYOK (bring your own Claude API key)
  • Unlimited analyses
Get Started Free

BYOK bring your own Claude API key

Pro

POPULAR

For teams that want auto-remediation

250 €/ year

~21 €/month · whole team, not per user

  • Everything in Community, plus:
  • Git integration & multi-repo
  • Auto-fix pull requests
  • YAML / kubectl dry-run validation
  • Helm & Terraform validation
  • Dockerfile lint
  • Branch cleanup service
  • PR creation (multi-file)

1 license · 1 year · Instant activation

Enterprise

For large organizations

Contact Us
  • Everything in Pro, plus:
  • SSO (SAML / OIDC)
  • Audit log & compliance
  • Multi-cluster support
  • Priority support & SLA
  • Custom integrations
  • On-premise deployment
  • Dedicated account manager
Get in Touch

The real cost of competing tools

Datadog Incident Mgmt€300/user/yr × 5 = €1,500/yr
PagerDuty€250/user/yr × 5 = €1,250/yr
Komodor€400+/user/yr × 5 = €2,000+/yr
KI-Ops Pro€250/yr whole team

Plus: Those tools only monitor. KI-Ops also diagnoses and auto-remediates.

Manage subscription

Enter your license key to manage payments and your subscription.

See Your ROI Live

How Many Incidents Until KI-Ops Pays for Itself?

Adjust your teams incident frequency and troubleshooting time. Most teams break even in 37 days.

Your team

5 engineers
150
8 incidents
130
35 min
5120
€/h

Your savings with KI-Ops Pro

Time saved per year1.075 hBased on 8 incidents/week
Money saved per year91.347 €At 85 €/h
Net savings91.097 €After KI-Ops Pro cost
ROI multiplier365x returnReturn on investment

KI-Ops Pro: 250 €/year pays for itself in 1 days. Flat team license, no per-user pricing.

KI-Ops reduces mean resolution time (MTTR) to ~4 min per incident via automated root-cause analysis, log triage, and auto-fix PRs. Calculations assume 5 engineers handling 8 incidents/week across 52 working weeks.

FAQ

Common Questions

Everything you need to know before getting started.

What is KI-Ops?

KI-Ops is an AI-powered Kubernetes troubleshooting tool that analyzes clusters, logs, and metrics using LLMs and delivers actionable recommendations. The free tier includes full diagnostics; Pro adds auto-fix pull requests.

Is my cluster data safe?

Yes. KI-Ops is fully self-hosted. Your data never leaves your infrastructure. We use BYOK (Bring Your Own Key) so you control the LLM connection. No telemetry is sent to us.

What does the Claude API cost?

You bring your own Claude API key (BYOK). Typical usage costs $5–15/month depending on cluster size and analysis frequency. KI-Ops itself is free (Community) or 250€/year (Pro).

Does KI-Ops work with EKS, GKE, and AKS?

Yes. KI-Ops works with any Kubernetes cluster accessible via kubeconfig — EKS, GKE, AKS, on-premise, k3s, kind, and more. Optional Grafana/Loki integration is available for enhanced diagnostics.

How long does setup take?

Under 5 minutes. Install via Helm chart, add your Claude API key, and point to your kubeconfig. The first analysis runs immediately.

Can I use KI-Ops without a Pro license?

Absolutely. The free Community tier includes full cluster analysis, log/metric analysis, DNS checks, LLM recommendations, knowledge base, and a health dashboard. Pro adds auto-fix PRs, Git integration, and validation.

Free vs. Pro: the full breakdown

FeatureCommunityPro
kubectl cluster analysis
Grafana / Loki log analysis
DNS & network checks
LLM recommendations (text)
Knowledge base
Health dashboard
Git repo access (read/search)
Auto-fix file changes
Pull request creation
YAML / Helm / Terraform validation
Branch lifecycle management
Multi-repo support

Trusted by DevOps Teams

Open source, self-hosted, and built for teams who value data sovereignty.

250€/yr
Entire team, not per user
Self-Hosted
BYOK your data stays with you
Open Source
Fully transparent, no vendor lock-in
GDPR Compliant
🇩🇪Made in Germany
Your Data Stays On-Prem
Open Source (MIT)

Open Source & Growing

Built in the open. Trusted by DevOps teams worldwide.

GitHub Stars

Star us on GitHub

100%

Open Source

< 5 min

Setup time

250€/yr

Whole team, not per user

Your Next Kubernetes Incident
Resolved in Under 5 Minutes.

Start with free diagnostics. See the AI analysis. When youre ready for auto-fix PRs, unlock Pro 250 /year, whole team.

Self-hosted · BYOK (Bring Your Own Key) · No vendor lock-in · Open Source (MIT)