Your AI SRE,
on every server.
Install one lightweight agent. Ask questions in plain English. OpsPilot diagnoses problems, runs safe read-only commands, and explains the root cause — no 2am phone calls.
Works with Ollama (air-gapped), Claude, and AWS Bedrock. Linux · macOS · Windows.
Need a human engineer right now? Emergency support →
/var/log/nginx/access.log is 38 GB. Log rotation configured but hasn't run since the nginx restart.truncate -s 0 /var/log/nginx/access.logSound familiar?
These aren't edge cases. They happen every week.
The 3am disk alert
You SSH in. Run df. Run du. Grep through logs. Forty-five minutes later you find a stale nginx log nobody rotated. You go back to sleep. It happens again next week.
OpsPilot solves this.
The new on-call engineer
Something's broken on prod. They know it's broken. They don't know which logs to check, which commands are safe to run, or which service is downstream of what. They page the senior dev.
OpsPilot solves this.
The compliance audit
Auditor asks: show us every command run on prod last month. You have rsync timestamps and tribal knowledge. You have nothing.
OpsPilot solves this.
Three steps to AI-powered ops
From blank server to intelligent diagnostics in under two minutes.
Install the agent
One curl command. The agent registers itself, starts sending heartbeats, and is ready in under 60 seconds.
$ curl -sSL "https://cp.911devops.com/install/linux?api_key=YOUR_KEY" | sudo bash ✓ Downloading agent v0.9.4... ✓ Verifying SHA256... ✓ Installing systemd service... ✓ Registered: web01.acme.com → control plane Agent online. Heartbeat active.
Ask in plain English
No commands to memorize. Describe the problem exactly as you'd say it to a senior SRE.
opspilot > why is web01 running out of disk? Thinking... analyzing web01
AI reasons and explains
OpsPilot runs diagnostic steps, observes each result, and returns a plain-English root cause with a recommended fix.
→ Checking disk usage... df -h ✓
→ Finding large directories... du -sh /* ✓
→ Checking recent writes... stat ... ✓
⚡ Root cause identified:
/var/log/nginx/access.log 38 GB
Last modified: 8 days ago. No active writes.
Fix: truncate -s 0 /var/log/nginx/access.log
(or configure logrotate — see /etc/logrotate.d/)Every feature ships working
Not a demo. 1,695 tests passing. Every capability is live today.
Plain-English diagnostics
Ask exactly what you'd ask a senior SRE. OpsPilot translates it into the right safe command sequence and explains findings in full sentences.
Safe by design
Hard-coded DENY list + YAML policy + model self-assessed risk. rm, kill, reboot, dd never run — regardless of what the AI suggests. Three independent layers.
Any AI backend
Local Ollama (air-gapped, nothing leaves your network), Claude API, AWS Bedrock. Switch in config.yaml — no code changes, no restart.
Full audit trail
Every command logged: actor, capability, risk level, duration, exit code. SOC 2 / HIPAA ready from day one. Query the log via API.
Cross-platform agents
Linux (systemd), macOS (launchd), Windows (NSSM). Same control plane. Same dashboard. Same API. One install command per OS.
Auto-updating agents
On startup and every hour, agents check for updates, download, verify SHA256, and restart in place. Zero manual updates across your fleet.
Pay for what you use
Start free with up to 5 servers. Upgrade when you need cloud AI or more scale.
Basic
Pro
Elite
Basic is free while we're in early access. Avg. response times vary with AI backend and server load.
Need a human engineer?
OpsPilot handles routine diagnostics 24/7. For architecture reviews, migrations, and emergencies — 911 DevOps engineers are on call.
Ready to stop guessing
what's wrong?
Install in 60 seconds. Get your first AI diagnosis in under 5 minutes. No credit card required to start.