πŸ“ŒPart 4 Optimize Kubernetes Monitoring: A Complete Guide to Prometheus and Grafana Integration

Vikas SurveVikas Surve
4 min read

πŸ“Œ Advanced Kubernetes Monitoring with Prometheus, Grafana, Node Exporter & cAdvisor

1️⃣ Overview

Monitoring Kubernetes effectively requires collecting metrics from both the cluster and individual nodes. This guide extends our monitoring setup by adding:

βœ… Prometheus – Collects Kubernetes cluster metrics
βœ… Grafana – Visualizes metrics in dashboards
βœ… Node Exporter – Captures node-level CPU, memory, and disk metrics
βœ… cAdvisor – Monitors container-level resource usage

By the end of this guide, you'll have deep visibility into your Kubernetes environment. πŸš€


2️⃣ Deploying Prometheus

πŸ“Œ Ensure you have created a monitoring namespace:

kubectl create namespace monitoring

βœ… Sample Output:

namespace/monitoring created

πŸ”Ή Apply Prometheus Configurations

curl -o prometheus-config.yaml https://raw.githubusercontent.com/Vikas-DevOpsPractice/EasyShop/feature/kindcluster/K8s/14-prometheus-config.yaml
curl -o prometheus-deployment.yaml https://raw.githubusercontent.com/Vikas-DevOpsPractice/EasyShop/feature/kindcluster/K8s/15-prometheus-deployment.yaml

kubectl apply -f prometheus-config.yaml -n monitoring
kubectl apply -f prometheus-deployment.yaml -n monitoring

βœ… Check Prometheus Deployment:

kubectl get pods -n monitoring

βœ… Sample Output:

NAME                         READY   STATUS    RESTARTS   AGE
prometheus-5f9d77c86f-xyz12  1/1     Running   0          1m

3️⃣ Deploying Node Exporter

πŸ”Ή Why Node Exporter?

πŸ”Ή Collects CPU, Memory, Disk, and Network usage of each Kubernetes node
πŸ”Ή Provides hardware and OS metrics

πŸ”Ή Step 1: Create a DaemonSet for Node Exporter

# node-exporter-daemonset.yaml
apiVersion: apps/v1
kind: DaemonSet
metadata:
  name: node-exporter
  namespace: monitoring
spec:
  selector:
    matchLabels:
      app: node-exporter
  template:
    metadata:
      labels:
        app: node-exporter
    spec:
      hostNetwork: true
      containers:
      - name: node-exporter
        image: prom/node-exporter:v1.5.0
        ports:
        - containerPort: 9100
          hostPort: 9100

βœ… Apply the DaemonSet:

kubectl apply -f node-exporter-daemonset.yaml -n monitoring

βœ… Verify Deployment:

kubectl get pods -n monitoring

βœ… Sample Output:

NAME                         READY   STATUS    RESTARTS   AGE
node-exporter-xyz12          1/1     Running   0          1m

4️⃣ Deploying cAdvisor

πŸ”Ή Why cAdvisor?

πŸ”Ή Provides per-container resource usage (CPU, memory, disk, network)
πŸ”Ή Helps in troubleshooting slow or resource-hungry containers

πŸ”Ή Step 1: Create a DaemonSet for cAdvisor

# cadvisor-daemonset.yaml
apiVersion: apps/v1
kind: DaemonSet
metadata:
  name: cadvisor
  namespace: monitoring
spec:
  selector:
    matchLabels:
      app: cadvisor
  template:
    metadata:
      labels:
        app: cadvisor
    spec:
      hostNetwork: true
      containers:
      - name: cadvisor
        image: gcr.io/cadvisor/cadvisor:v0.47.0
        ports:
        - containerPort: 8080
          hostPort: 8080

βœ… Apply the DaemonSet:

kubectl apply -f cadvisor-daemonset.yaml -n monitoring

βœ… Verify Deployment:

kubectl get pods -n monitoring

βœ… Sample Output:

NAME                         READY   STATUS    RESTARTS   AGE
cadvisor-xyz12               1/1     Running   0          1m

5️⃣ Integrating Node Exporter & cAdvisor with Prometheus

πŸ”Ή Update Prometheus Configuration

Add the following scrape jobs to prometheus-config.yaml:

scrape_configs:
  - job_name: 'node-exporter'
    static_configs:
      - targets: ['node-exporter.monitoring.svc.cluster.local:9100']

  - job_name: 'cadvisor'
    static_configs:
      - targets: ['cadvisor.monitoring.svc.cluster.local:8080']

βœ… Reapply Prometheus Configuration:

kubectl apply -f prometheus-config.yaml -n monitoring
kubectl rollout restart deployment prometheus -n monitoring

6️⃣ Deploying Grafana

curl -o grafana-deployment.yaml https://raw.githubusercontent.com/Vikas-DevOpsPractice/EasyShop/feature/kindcluster/K8s/16-grafana-deployment.yaml
kubectl apply -f grafana-deployment.yaml -n monitoring

βœ… Verify Grafana:

kubectl get pods -n monitoring

βœ… Sample Output:

NAME                         READY   STATUS    RESTARTS   AGE
grafana-78b6c9c76f-xyz12     1/1     Running   0          1m

7️⃣ Setting Up Dashboards in Grafana

πŸ”Ή Add Prometheus as a Data Source

πŸ“Œ Go to Grafana β†’ Configuration β†’ Add Data Source
πŸ”Ή Select Prometheus
πŸ”Ή Set URL to:

http://prometheus.monitoring.svc.cluster.local:9090

πŸ”Ή Click Save & Test

βœ… Integration Successful!


πŸ”Ή Import Prebuilt Kubernetes Dashboards

πŸ“Œ Go to Grafana Dashboard β†’ Click Dashboards β†’ Import
πŸ”Ή Use Dashboard ID: 11074 (Node Exporter)
πŸ”Ή Use Dashboard ID: 13689 (cAdvisor)
πŸ”Ή Select Prometheus as the data source β†’ Click Import

βœ… Sample Node Metrics Dashboard:

Node Metrics

βœ… Sample Container Metrics Dashboard:

Container Metrics


8️⃣ Setting Up Alerts in Grafana

πŸ“Œ Open Grafana β†’ Click Alerts β†’ Create Alert Rule
πŸ”Ή Condition: Alert when CPU Usage > 80% for 5 minutes
πŸ”Ή Notification: Email, Slack, PagerDuty
πŸ”Ή Click Save & Enable Alerting

βœ… Now, alerts will trigger on resource spikes!


9️⃣ Troubleshooting & Best Practices

πŸ”Ή Prometheus Not Collecting Metrics?

kubectl logs -l app=prometheus -n monitoring

πŸ“Œ Ensure the scrape configs in prometheus-config.yaml are correct


πŸ”Ή Node Exporter Not Running?

kubectl describe pod node-exporter-xyz12 -n monitoring

πŸ“Œ Ensure hostPort 9100 is available


πŸ”Ή cAdvisor Metrics Not Appearing?

kubectl logs -l app=cadvisor -n monitoring

πŸ“Œ Ensure cadvisor.monitoring.svc.cluster.local:8080 is correct in Prometheus config


🎯 Conclusion

πŸš€ Advanced Kubernetes Monitoring is now fully set up!
βœ… Prometheus collects Kubernetes & node metrics
βœ… Node Exporter tracks hardware performance
βœ… cAdvisor monitors per-container usage
βœ… Grafana visualizes & alerts on key metrics

πŸ“Œ Next Step: End-to-End CI/CD Automation for Kubernetes Using Jenkins, GitLab, AWS CodePipeline & Azure DevOps

0
Subscribe to my newsletter

Read articles from Vikas Surve directly inside your inbox. Subscribe to the newsletter, and don't miss out.

Written by

Vikas Surve
Vikas Surve