πPart 4 Optimize Kubernetes Monitoring: A Complete Guide to Prometheus and Grafana Integration


π Advanced Kubernetes Monitoring with Prometheus, Grafana, Node Exporter & cAdvisor
1οΈβ£ Overview
Monitoring Kubernetes effectively requires collecting metrics from both the cluster and individual nodes. This guide extends our monitoring setup by adding:
β
Prometheus β Collects Kubernetes cluster metrics
β
Grafana β Visualizes metrics in dashboards
β
Node Exporter β Captures node-level CPU, memory, and disk metrics
β
cAdvisor β Monitors container-level resource usage
By the end of this guide, you'll have deep visibility into your Kubernetes environment. π
2οΈβ£ Deploying Prometheus
π Ensure you have created a monitoring namespace:
kubectl create namespace monitoring
β Sample Output:
namespace/monitoring created
πΉ Apply Prometheus Configurations
curl -o prometheus-config.yaml https://raw.githubusercontent.com/Vikas-DevOpsPractice/EasyShop/feature/kindcluster/K8s/14-prometheus-config.yaml
curl -o prometheus-deployment.yaml https://raw.githubusercontent.com/Vikas-DevOpsPractice/EasyShop/feature/kindcluster/K8s/15-prometheus-deployment.yaml
kubectl apply -f prometheus-config.yaml -n monitoring
kubectl apply -f prometheus-deployment.yaml -n monitoring
β Check Prometheus Deployment:
kubectl get pods -n monitoring
β Sample Output:
NAME READY STATUS RESTARTS AGE
prometheus-5f9d77c86f-xyz12 1/1 Running 0 1m
3οΈβ£ Deploying Node Exporter
πΉ Why Node Exporter?
πΉ Collects CPU, Memory, Disk, and Network usage of each Kubernetes node
πΉ Provides hardware and OS metrics
πΉ Step 1: Create a DaemonSet for Node Exporter
# node-exporter-daemonset.yaml
apiVersion: apps/v1
kind: DaemonSet
metadata:
name: node-exporter
namespace: monitoring
spec:
selector:
matchLabels:
app: node-exporter
template:
metadata:
labels:
app: node-exporter
spec:
hostNetwork: true
containers:
- name: node-exporter
image: prom/node-exporter:v1.5.0
ports:
- containerPort: 9100
hostPort: 9100
β Apply the DaemonSet:
kubectl apply -f node-exporter-daemonset.yaml -n monitoring
β Verify Deployment:
kubectl get pods -n monitoring
β Sample Output:
NAME READY STATUS RESTARTS AGE
node-exporter-xyz12 1/1 Running 0 1m
4οΈβ£ Deploying cAdvisor
πΉ Why cAdvisor?
πΉ Provides per-container resource usage (CPU, memory, disk, network)
πΉ Helps in troubleshooting slow or resource-hungry containers
πΉ Step 1: Create a DaemonSet for cAdvisor
# cadvisor-daemonset.yaml
apiVersion: apps/v1
kind: DaemonSet
metadata:
name: cadvisor
namespace: monitoring
spec:
selector:
matchLabels:
app: cadvisor
template:
metadata:
labels:
app: cadvisor
spec:
hostNetwork: true
containers:
- name: cadvisor
image: gcr.io/cadvisor/cadvisor:v0.47.0
ports:
- containerPort: 8080
hostPort: 8080
β Apply the DaemonSet:
kubectl apply -f cadvisor-daemonset.yaml -n monitoring
β Verify Deployment:
kubectl get pods -n monitoring
β Sample Output:
NAME READY STATUS RESTARTS AGE
cadvisor-xyz12 1/1 Running 0 1m
5οΈβ£ Integrating Node Exporter & cAdvisor with Prometheus
πΉ Update Prometheus Configuration
Add the following scrape jobs to prometheus-config.yaml:
scrape_configs:
- job_name: 'node-exporter'
static_configs:
- targets: ['node-exporter.monitoring.svc.cluster.local:9100']
- job_name: 'cadvisor'
static_configs:
- targets: ['cadvisor.monitoring.svc.cluster.local:8080']
β Reapply Prometheus Configuration:
kubectl apply -f prometheus-config.yaml -n monitoring
kubectl rollout restart deployment prometheus -n monitoring
6οΈβ£ Deploying Grafana
curl -o grafana-deployment.yaml https://raw.githubusercontent.com/Vikas-DevOpsPractice/EasyShop/feature/kindcluster/K8s/16-grafana-deployment.yaml
kubectl apply -f grafana-deployment.yaml -n monitoring
β Verify Grafana:
kubectl get pods -n monitoring
β Sample Output:
NAME READY STATUS RESTARTS AGE
grafana-78b6c9c76f-xyz12 1/1 Running 0 1m
7οΈβ£ Setting Up Dashboards in Grafana
πΉ Add Prometheus as a Data Source
π Go to Grafana β Configuration β Add Data Source
πΉ Select Prometheus
πΉ Set URL to:
http://prometheus.monitoring.svc.cluster.local:9090
πΉ Click Save & Test
β Integration Successful!
πΉ Import Prebuilt Kubernetes Dashboards
π Go to Grafana Dashboard β Click Dashboards β Import
πΉ Use Dashboard ID: 11074 (Node Exporter)
πΉ Use Dashboard ID: 13689 (cAdvisor)
πΉ Select Prometheus as the data source β Click Import
β Sample Node Metrics Dashboard:
β Sample Container Metrics Dashboard:
8οΈβ£ Setting Up Alerts in Grafana
π Open Grafana β Click Alerts β Create Alert Rule
πΉ Condition: Alert when CPU Usage > 80% for 5 minutes
πΉ Notification: Email, Slack, PagerDuty
πΉ Click Save & Enable Alerting
β Now, alerts will trigger on resource spikes!
9οΈβ£ Troubleshooting & Best Practices
πΉ Prometheus Not Collecting Metrics?
kubectl logs -l app=prometheus -n monitoring
π Ensure the scrape configs in prometheus-config.yaml
are correct
πΉ Node Exporter Not Running?
kubectl describe pod node-exporter-xyz12 -n monitoring
π Ensure hostPort 9100 is available
πΉ cAdvisor Metrics Not Appearing?
kubectl logs -l app=cadvisor -n monitoring
π Ensure cadvisor.monitoring.svc.cluster.local:8080
is correct in Prometheus config
π― Conclusion
π Advanced Kubernetes Monitoring is now fully set up!
β
Prometheus collects Kubernetes & node metrics
β
Node Exporter tracks hardware performance
β
cAdvisor monitors per-container usage
β
Grafana visualizes & alerts on key metrics
Subscribe to my newsletter
Read articles from Vikas Surve directly inside your inbox. Subscribe to the newsletter, and don't miss out.
Written by
