Advanced Reading #kubernetes #kubectl #devops

Reading Kubernetes Events & Logs

5 exercises on reading kubectl get events and kubectl describe output — understand ImagePullBackOff, CrashLoopBackOff, OOMKilled, and what each event means for pod health.

Key Kubernetes pod states to know

ImagePullBackOff — cannot pull the container image (wrong tag, 401 auth, registry down)
CrashLoopBackOff — container keeps crashing; Kubernetes retries with exponential back-off
OOMKilled / Exit 137 — container exceeded its memory limit; killed by the Linux kernel
Normal vs. Warning events — Warning = something unexpected; investigate
Restart Count — how many times the container has been restarted; high = ongoing problem

0 / 5 completed

1 / 5

⎈ kubectl get events output

$ kubectl get events -n production --sort-by='.lastTimestamp'

LAST SEEN   TYPE      REASON              OBJECT                          MESSAGE
2m          Normal    Scheduled           pod/api-server-7d9b4c6f8-xk2p9  Successfully assigned production/api-server-7d9b4c6f8-xk2p9 to node-3
2m          Normal    Pulling             pod/api-server-7d9b4c6f8-xk2p9  Pulling image "registry.example.com/api-server:v2.4.0"
90s         Warning   Failed              pod/api-server-7d9b4c6f8-xk2p9  Failed to pull image "registry.example.com/api-server:v2.4.0": rpc error: code = Unknown desc = failed to pull and unpack image: failed to resolve reference "registry.example.com/api-server:v2.4.0": unexpected status code 401 Unauthorized
90s         Warning   Failed              pod/api-server-7d9b4c6f8-xk2p9  Error: ErrImagePull
45s         Warning   BackOff             pod/api-server-7d9b4c6f8-xk2p9  Back-off pulling image "registry.example.com/api-server:v2.4.0"
45s         Warning   Failed              pod/api-server-7d9b4c6f8-xk2p9  Error: ImagePullBackOff

Read the kubectl get events output. What is the root cause of the ImagePullBackOff status on the pod?

2 / 5

⎈ kubectl describe pod output

$ kubectl describe pod worker-6b8c9d7f5-mnp12 -n production

Name:         worker-6b8c9d7f5-mnp12
Namespace:    production
Node:         node-2/10.0.0.12
Status:       Running

Containers:
  worker:
    State:          Waiting
      Reason:       CrashLoopBackOff
    Last State:     Terminated
      Reason:       OOMKilled
      Exit Code:    137
      Started:      Wed, 10 Apr 2024 10:22:14 +0000
      Finished:     Wed, 10 Apr 2024 10:22:41 +0000
    Ready:          False
    Restart Count:  8
    Limits:
      memory:   256Mi
    Requests:
      memory:   128Mi

Events:
  Warning  BackOff   2m    kubelet  Back-off restarting failed container worker in pod worker-6b8c9d7f5-mnp12

The pod's Last State shows Reason: OOMKilled and Exit Code: 137. What does this tell a developer?

3 / 5

⎈ kubectl describe pod output

$ kubectl describe pod worker-6b8c9d7f5-mnp12 -n production

Name:         worker-6b8c9d7f5-mnp12
Namespace:    production
Node:         node-2/10.0.0.12
Status:       Running

Containers:
  worker:
    State:          Waiting
      Reason:       CrashLoopBackOff
    Last State:     Terminated
      Reason:       OOMKilled
      Exit Code:    137
      Started:      Wed, 10 Apr 2024 10:22:14 +0000
      Finished:     Wed, 10 Apr 2024 10:22:41 +0000
    Ready:          False
    Restart Count:  8
    Limits:
      memory:   256Mi
    Requests:
      memory:   128Mi

Events:
  Warning  BackOff   2m    kubelet  Back-off restarting failed container worker in pod worker-6b8c9d7f5-mnp12

The pod's Restart Count is 8 and its current state is CrashLoopBackOff. What does CrashLoopBackOff mean in Kubernetes?

4 / 5

⎈ kubectl get events output

$ kubectl get events -n production --sort-by='.lastTimestamp'

LAST SEEN   TYPE      REASON              OBJECT                          MESSAGE
2m          Normal    Scheduled           pod/api-server-7d9b4c6f8-xk2p9  Successfully assigned production/api-server-7d9b4c6f8-xk2p9 to node-3
2m          Normal    Pulling             pod/api-server-7d9b4c6f8-xk2p9  Pulling image "registry.example.com/api-server:v2.4.0"
90s         Warning   Failed              pod/api-server-7d9b4c6f8-xk2p9  Failed to pull image "registry.example.com/api-server:v2.4.0": rpc error: code = Unknown desc = failed to pull and unpack image: failed to resolve reference "registry.example.com/api-server:v2.4.0": unexpected status code 401 Unauthorized
90s         Warning   Failed              pod/api-server-7d9b4c6f8-xk2p9  Error: ErrImagePull
45s         Warning   BackOff             pod/api-server-7d9b4c6f8-xk2p9  Back-off pulling image "registry.example.com/api-server:v2.4.0"
45s         Warning   Failed              pod/api-server-7d9b4c6f8-xk2p9  Error: ImagePullBackOff

The events show both TYPE: Warning and TYPE: Normal entries. What does an event TYPE: Warning indicate in Kubernetes?

5 / 5

⎈ kubectl describe pod output

$ kubectl describe pod worker-6b8c9d7f5-mnp12 -n production

Name:         worker-6b8c9d7f5-mnp12
Namespace:    production
Node:         node-2/10.0.0.12
Status:       Running

Containers:
  worker:
    State:          Waiting
      Reason:       CrashLoopBackOff
    Last State:     Terminated
      Reason:       OOMKilled
      Exit Code:    137
      Started:      Wed, 10 Apr 2024 10:22:14 +0000
      Finished:     Wed, 10 Apr 2024 10:22:41 +0000
    Ready:          False
    Restart Count:  8
    Limits:
      memory:   256Mi
    Requests:
      memory:   128Mi

Events:
  Warning  BackOff   2m    kubelet  Back-off restarting failed container worker in pod worker-6b8c9d7f5-mnp12

The container ran from 10:22:14 to 10:22:41 before being OOMKilled. What does this short runtime duration suggest about the nature of the memory problem?