Main FeaturesΒΆ
Robusta integrates HolmesGPT to provide AI-powered root cause analysis for Kubernetes alerts and issues.
What HolmesGPT DoesΒΆ
- Automatic Investigation
When alerts fire, HolmesGPT automatically:
Analyzes pod logs and events
Examines resource metrics and limits
Checks recent deployments and changes
Investigates related resources and dependencies
Provides actionable recommendations
- Multi-Source Analysis
HolmesGPT pulls data from:
Kubernetes API (pods, nodes, events, logs)
Prometheus metrics
Cloud provider APIs (AWS, Azure, GCP)
Application monitoring (Datadog, New Relic)
Log aggregation systems (Loki, OpenSearch, Coralogix)
And more via extensible toolsets
How to Use ItΒΆ
- In Robusta UI
Click the
Root Cause
tab on any alert to see the AI investigation.- Via @holmes in Slack
Ask natural language questions about your clusters:
@holmes what apps are crashing in prod-cluster?
@holmes why is my alert firing on staging?
@holmes investigate high memory usage in dev-cluster
Example InvestigationΒΆ
Here's what HolmesGPT found for a CrashLoopBackOff alert:

The AI identified: - The exact error from pod logs - The root cause (missing environment variable) - Recommended fix with example YAML - Related configuration issues
Why Use HolmesGPT?ΒΆ
- Save Time
Stop manually checking logs, metrics, and events across multiple tools. HolmesGPT does it in seconds.
- Reduce MTTR
Get to root cause faster with AI that understands Kubernetes patterns and common issues.
- Learn as You Go
Each investigation explains what was checked and why, helping your team learn Kubernetes troubleshooting.
- 24/7 Coverage
AI investigations run automatically on every alert, even at 3 AM.
Next StepsΒΆ
Getting Started - Set up HolmesGPT in 5 minutes
Available Data Sources - See all supported integrations (use Robusta Helm Chart configuration method)
Helm Configuration Reference - Advanced HolmesGPT settings for Robusta deployments