Insights Engine
24 built-in rules that detect common Kubernetes issues and provide actionable recommendations.
Rule Definitions
Pod in CrashLoopBackOff with restarts >3/hour
Container terminated with OOMKilled (exit 137)
Deployment with 0 available replicas
Node condition Ready ≠ True
Pod in ImagePullBackOff state
Rollout stalled past progressDeadlineSeconds
Container can't start: referenced ConfigMap/Secret not found
Helm release left in a failed state after install/upgrade
CPU usage >80% of limit sustained
Memory usage >85% of limit
HPA current replicas == max replicas
PVC in Pending state for >5 minutes
Pod with >5 restarts in 24 hours (non crash-loop)
Pods evicted from node due to pressure
Pod Running but not Ready for >2 minutes
Liveness probe failing repeatedly before restarts pile up
Service has zero ready endpoints
NetworkPolicy podSelector matches no pods
PodDisruptionBudget selector matches no pods
Helm release stuck pending >5 min (hook never completed)
cert-manager Certificate expired or within 14 days of expiry
ArgoCD Application OutOfSync or Degraded
Requests <40% of actual usage
Namespace has running pods but no NetworkPolicy
Each insight includes the affected resource, a human-readable message, and a specific suggestion with remediation steps or kubectl commands.