Google Kubernetes Engine (GKE)
SmartSRE provides cluster optimization and scaling for Google Kubernetes Engine.
What SmartSRE Scans
| Category | Checks |
|---|---|
| Scaling | Node pool utilization, HPA configuration |
| Resources | Pod resource requests/limits |
| Health | Node conditions, pod restarts |
| Cost | Over-provisioned nodes, spot opportunities |
Findings
| Issue Type | Severity | Description |
|---|---|---|
node_pool_high_utilization | High | CPU/memory > 85% on node pool |
pod_memory_limits_missing | Medium | Pods without memory limits |
node_not_ready | Critical | Node in NotReady state |
high_pod_restart_rate | High | Pods restarting frequently |
Available Fixes
| Operation | Description | Impact |
|---|---|---|
scale_node_pool | Add/remove nodes | Medium |
enable_node_autoscaling | Enable cluster autoscaler | Low |
set_pod_resources | Update pod resource limits | Medium |
drain_node | Safely drain node | High |
Required Permissions
For Scanning
roles/container.clusterViewer
roles/monitoring.viewer
For Remediation
roles/container.clusterAdmin
Best Practices
- Enable cluster autoscaler — Automatic node scaling
- Set resource requests — Enable accurate scheduling
- Use node taints — Isolate workloads
- Monitor pod restarts — Early warning for issues