Practice CKA Troubleshooting questions with full explanations on every answer.
Start practicing
Troubleshooting — choose a session length
Free · No account required
Click any question to see the full explanation and answer options, or start a focused practice session above.
A pod named 'web-frontend' is in CrashLoopBackOff. You run 'kubectl logs web-frontend' and see: 'Error: listen tcp :8080: bind: address already in use'. What is the most likely cause and how should you fix it?
2A user reports that their application cannot resolve DNS names for services in the cluster. The application runs in a pod with dnsPolicy: ClusterFirst. What is the most likely cause?
3Which TWO of the following are valid methods to troubleshoot a pod that is stuck in 'Pending' state?
4Based on the exhibit, the pod is in CrashLoopBackOff. Which command should you run NEXT to identify the root cause?
5You are a CKA managing a production cluster with 5 worker nodes. A developer reports that a new deployment 'payment-service' is not accessible from other pods via its Service 'payment-svc' in the 'default' namespace. The Service is of type ClusterIP with selector 'app: payment'. The deployment has 3 replicas, all showing 'Running' status. From a test pod, you run 'curl http://payment-svc:8080' and get 'Connection refused'. You verify that the pods are listening on port 8080 and the container's readiness probe passes. 'kubectl get endpoints payment-svc' shows no endpoints. 'kubectl describe svc payment-svc' shows the selector 'app=payment'. What is the most likely cause?
6A developer reports that a newly deployed Deployment named 'web-app' is not serving traffic. The Deployment has 3 replicas, a Service of type ClusterIP, and an Ingress. Which TWO commands should you run first to diagnose the issue?
7Based on the exhibit, what is the most likely cause of the pod not running?
8You are tasked with troubleshooting a production Kubernetes cluster. A user reports that they cannot access a web application running in the cluster. The application is deployed as a Deployment named 'frontend' with 2 replicas, exposed via a Service of type LoadBalancer. You have kubectl access to the cluster. You run 'kubectl get pods -l app=frontend' and see both pods are Running and Ready. You run 'kubectl get svc frontend' and see the Service has an external IP of 192.168.1.100. However, when you curl http://192.168.1.100 from a machine outside the cluster, you get a connection timeout. You are able to curl the pod IPs directly from within the cluster and get a response. Which of the following is the most likely cause of the issue?
9A pod is in CrashLoopBackOff state. Which command should you use to see the logs of the previous instance?
10You run kubectl get nodes and see one node is NotReady. The kubelet is running on the node. What is the most likely cause?
11A pod is stuck in Pending state. You describe the pod and see: '0/4 nodes are available: 4 node(s) had taint {node.kubernetes.io/not-ready: }, that the pod didn't tolerate.' What is the most likely reason?
12You need to investigate why a service is not reachable from within the cluster. Which of the following is the first step?
13A pod has status 'Init:Error'. What does this indicate?
14Which of the following are valid methods to debug a failing CoreDNS pod? (Select TWO)
15A pod is in CrashLoopBackOff state. Which command shows the last termination reason?
16A Node is in NotReady state. Which action should be taken first to diagnose the issue?
17A pod is stuck in ContainerCreating. Which condition is most likely if `kubectl describe pod` shows 'Failed to create pod sandbox'?
18A pod is in ImagePullBackOff. Which command would help determine the exact reason?
19A Service of type ClusterIP is not reachable from within the cluster. Pods backing the Service are running and healthy. What is the most likely cause?
20A worker node is marked NotReady. Which two checks are most relevant to diagnose the node's kubelet health? (Choose two.)
21A Deployment's pods are failing with 'CrashLoopBackOff'. The container exits with code 1. Which two approaches will help identify the issue? (Choose two.)
22A user reports that a pod is stuck in 'ContainerCreating' state. Which command would you run first to diagnose the issue?
23A Node is reporting DiskPressure condition. Which action is most appropriate to resolve this without losing data?
24You have a pod that is CrashLoopBackOff. The logs show 'error: dial tcp: lookup service.default.svc.cluster.local: no such host'. What is the most likely cause?
25A pod is running but cannot be accessed via its ClusterIP service from another pod in the same namespace. The service endpoints list shows the pod's IP. What is the most likely cause?
26You run 'kubectl get pods' and see some pods in 'ImagePullBackOff' state. Which command would best help identify the root cause?
27Which of the following are valid methods to troubleshoot a Node that is 'NotReady'? (Select all that apply)
28You have a Deployment that is not scaling up beyond 1 replica despite setting replicas: 3. Which of the following could be the cause? (Select all that apply)
29A node in your cluster is reporting a DiskPressure condition. Which kubectl command would you use to get details about the node's condition?
30A Pod is stuck in CrashLoopBackOff. You run `kubectl logs <pod-name>` but see no output. What is the most likely cause?
31A Service of type LoadBalancer is created but the EXTERNAL-IP remains <pending>. The cluster is running on-premises without a cloud load balancer integration. Which of the following is the most likely reason?
32A Pod is in ImagePullBackOff state. Which of the following are valid troubleshooting steps? (Select two.)
33A ClusterIP Service is not reachable from within the cluster. You verify that the Service has endpoints. Which of the following could be the cause? (Select three.)
34Drag and drop the steps to back up and restore etcd data for a Kubernetes cluster into the correct order.
35Drag and drop the steps to set up a PersistentVolumeClaim for a pod into the correct order.
36Match each scheduling concept to its description.
37Match each logging/monitoring component to its role.
38You run 'kubectl get pods' and see a pod in 'ImagePullBackOff' state. What is the most likely cause?
39A pod is stuck in 'Pending' state. Which command would you run FIRST to diagnose the issue?
40After deploying a new Deployment, you run 'kubectl get events' and see 'FailedScheduling' events. What is a possible cause?
41You have a multi-node cluster. One node shows 'NotReady'. You run 'journalctl -u kubelet' on that node and see 'network plugin is not ready'. What is the most likely cause?
42You want to check the logs of a container that previously crashed. Which command should you use?
43You are troubleshooting a DNS issue. From within a pod, you run 'nslookup kubernetes.default.svc.cluster.local' and get 'connection timed out; no servers could be reached'. What is the most likely cause?
44Which command shows CPU and memory usage of nodes in the cluster?
45You run 'kubectl get pods' and see a pod in 'CrashLoopBackOff'. What command would you run to see the reason for the crash?
46You want to test network connectivity from pod A to pod B in the same namespace. Which command would you run from within pod A?
47You see a pod in 'Pending' state. 'kubectl describe pod' shows '0/4 nodes are available: 1 node(s) had taint(s) that the pod didn't tolerate, 3 Insufficient cpu'. What should you do?
48Which command would you use to check the status of the kube-apiserver on a control plane node managed by systemd?
49You deploy a pod with resource limits but no requests. The pod gets OOMKilled. What is the most likely reason?
50Which TWO of the following are valid methods to view the logs of a container that has terminated?
51Which THREE of the following are common causes for a pod to remain in 'Pending' state?
52Which TWO of the following commands are useful for debugging network connectivity between pods?
53A pod is in CrashLoopBackOff. You run 'kubectl logs mypod --previous' and see 'Error: unable to connect to database'. What is the MOST likely cause?
54Which command retrieves logs from a container that has crashed and restarted?
55You run 'kubectl get nodes' and one node shows 'NotReady'. You SSH into the node and run 'systemctl status kubelet'. The output shows 'active (running)'. What should you check NEXT?
56A Deployment's pod is stuck in Pending state. 'kubectl describe pod' shows Events: '0/4 nodes are available: 1 node(s) had taint {node-role.kubernetes.io/control-plane: }, that the pod didn't tolerate, 3 Insufficient memory'. What is the likely fix?
57A pod is in ImagePullBackOff state. Which command is MOST useful to diagnose the issue?
58You want to check the memory usage of nodes in your cluster. Which command should you use?
59A pod is unable to resolve DNS names. You exec into the pod and run 'nslookup kubernetes.default.svc.cluster.local'. The command hangs. What is the MOST likely cause?
60You run 'kubectl get pods' and see a pod in 'CrashLoopBackOff'. 'kubectl logs pod' shows no output, but 'kubectl logs --previous pod' shows an error. Why might the current logs be empty?
61A pod is stuck in Pending state. 'kubectl describe pod' shows '0/2 nodes are available: 1 node(s) had taint that the pod didn't tolerate, 1 node(s) didn't match pod anti-affinity rules'. What should you check?
62Which command shows all events in the cluster sorted by timestamp?
63You suspect that the kube-scheduler is not running. Which command checks the scheduler's health?
64A pod runs but you cannot connect to its container port from another pod in the same namespace. 'kubectl exec' into the pod and 'curl localhost:8080' works. What is the MOST likely cause?
65Which TWO of the following are valid reasons for a pod to be in the Pending state? (Choose two)
66Which THREE of the following are valid steps to troubleshoot a Node in NotReady state? (Choose three)
67Which TWO of the following are common causes of CrashLoopBackOff? (Choose two)
68A node in the cluster is reporting NotReady. You run 'journalctl -u kubelet' and see repeated errors: 'failed to get system container stats for "/system.slice/kubelet.service": failed to get cgroup stats'. What is the most likely cause?
69You need to check the logs of a container that previously ran but has crashed. Which command would you use?
70A pod is stuck in Pending state. 'kubectl describe pod' shows the event: '0/3 nodes are available: 3 node(s) didn't match pod anti-affinity rules'. What is the most likely cause?
71A pod is in ImagePullBackOff state. Which command can you run to get more details about the underlying error?
72You want to view the resource usage of all pods in the cluster. What command should you run?
73A pod named 'web-app' is crashing repeatedly. You run 'kubectl describe pod web-app' and see that the container exited with code 137. What does this indicate?
74You are troubleshooting a connectivity issue between two pods in the same cluster. Pod A cannot reach Pod B's IP. Which command would you run from Pod A to test connectivity?
75A pod has been in Pending state for a long time. 'kubectl describe pod' shows the event: '0/3 nodes are available: 1 node(s) had taint {node.kubernetes.io/not-ready: }, that the pod didn't tolerate, 2 node(s) had taint {node.kubernetes.io/unreachable: }, that the pod didn't tolerate.' What is the most likely cause?
76You need to see the startup logs of the kubelet service. Which command should you use?
77You run 'kubectl get pods' and see a pod in CrashLoopBackOff. The pod has a single container. You want to see the last termination message from the container. Which command will provide that information?
78A pod is failing to start with the error 'CrashLoopBackOff'. You check the logs with 'kubectl logs pod' and see nothing. What is the most likely reason?
79Which command shows all events in the cluster sorted by timestamp?
80Which TWO of the following are common causes for a pod to be stuck in Pending state?
81Which THREE of the following are valid steps to troubleshoot a DNS issue within a Kubernetes cluster?
82Which TWO of the following are correct methods to check the health of the kube-apiserver?
83A pod in the 'production' namespace is in a CrashLoopBackOff state. The pod has been running successfully for several days. You run 'kubectl describe pod app-pod -n production' and see the message: 'OOMKilled'. What is the MOST appropriate action to resolve this issue?
84You run 'kubectl get nodes' and see that one node is in the 'NotReady' state. Which command would you use FIRST to investigate the kubelet status on that node?
85A pod is stuck in 'Pending' state. You run 'kubectl describe pod my-pod' and see the event: '0/4 nodes are available: 4 node(s) had taint {node.kubernetes.io/unreachable: }, that the pod didn't tolerate'. What is the likely cause?
86You need to check the logs of a container that previously crashed. The pod is currently running, but the previous instance of the container exited with an error. Which command will show you the logs from the crashed container?
87You run 'kubectl get pods' and see a pod in 'ImagePullBackOff' state. Which of the following is NOT a common cause?
88You need to verify if the kube-apiserver is running on the control plane node. Which command should you use?
89A pod is in 'CrashLoopBackOff' state. You run 'kubectl logs my-pod' and see 'exec format error: exec: "/app": stat /app: no such file or directory'. What is the most likely cause?
90You want to check the current resource usage (CPU and memory) of pods in the 'default' namespace. Which kubectl command should you use?
91You are debugging a DNS issue from within a pod. The pod is running 'busybox'. Which command would you use to test DNS resolution for 'kubernetes.default.svc.cluster.local'?
92You see events like 'Failed to pull image' when running 'kubectl get events'. Which command can you use to get more detailed information about a specific pod's events?
93A pod is not able to communicate with another pod in the same namespace. Both pods are running and have IP addresses. Which command can you use to test connectivity from the first pod to the second pod's IP?
94You run 'kubectl get nodes' and see that a node is 'NotReady'. You SSH into the node and run 'systemctl status kubelet'. The output shows 'Active: inactive (dead)'. What is the most likely cause?
95A pod is stuck in 'Pending' state. Which TWO of the following are common causes?
96Which THREE of the following are valid methods to troubleshoot a CrashLoopBackOff pod?
97Which TWO of the following are valid commands to check the status of control plane components?
98A pod in the 'production' namespace is in a CrashLoopBackOff state. The pod has been running successfully for several days. You run 'kubectl describe pod app-pod -n production' and see the message: 'OOMKilled'. What is the MOST appropriate action to resolve this issue?
99Which kubectl command is used to view the logs of a container that has previously crashed in a pod?
100You have a pod that is in 'Pending' state. Which command would you run to get more information about why the pod cannot be scheduled?
101A node in your cluster shows status 'NotReady'. You have SSH access to the node. What is the first command you should run to check the kubelet status?
102You create a Deployment with the following YAML: apiVersion: apps/v1 kind: Deployment metadata: name: myapp spec: replicas: 3 selector: matchLabels: app: myapp template: metadata: labels: app: myapp spec: containers: - name: myapp image: myapp:latest resources: limits: cpu: "500m" memory: "256Mi" requests: cpu: "200m" memory: "128Mi" After applying it, the pods are in 'CrashLoopBackOff'. You check logs and see 'Error: container process exited with error'. What is the MOST likely cause?
103You are troubleshooting a pod that cannot start. Running 'kubectl describe pod' shows the event: 'Failed to pull image "myregistry.io/myapp:1.0": rpc error: code = Unknown desc = Error response from daemon: manifest for myregistry.io/myapp:1.0 not found'. What is the MOST likely cause?
104You need to check the current resource usage of nodes in your cluster. Which command should you use?
105A pod is in 'ImagePullBackOff' state. You run 'kubectl describe pod' and see the event: 'Failed to pull image "nginx:latest": rpc error: code = Unknown desc = Error response from daemon: Get https://registry-1.docker.io/v2/: dial tcp: lookup registry-1.docker.io on 8.8.8.8:53: no such host'. What is the MOST likely cause?
106You have a multi-node cluster. One node is healthy but pods scheduled on another node are failing to resolve DNS names. You exec into a pod and run 'nslookup kubernetes.default.svc.cluster.local' and get 'server can't find kubernetes.default.svc.cluster.local: NXDOMAIN'. Running the same command on a pod on the healthy node works. What is the MOST likely cause?
107You run 'kubectl get pods' and see a pod in 'CrashLoopBackOff'. You want to see the logs of the last crashed instance. Which command should you run?
108Which command can you run to see the events related to a specific pod?
109You have a pod that is stuck in 'Pending' state. Running 'kubectl describe pod' shows the event: '0/3 nodes are available: 1 node(s) had taint {node-role.kubernetes.io/master: }, that the pod didn't tolerate, 2 node(s) didn't match pod anti-affinity rules.' What is the MOST likely solution?
110Which TWO of the following are valid steps to troubleshoot a pod that is in 'CrashLoopBackOff'?
111Which TWO of the following are possible reasons for a node being in 'NotReady' state?
112Which THREE of the following are possible causes for a pod to remain in 'Pending' state?
113A node in your cluster is reporting 'NotReady' status. You log into the node and run 'systemctl status kubelet'. The kubelet service is not running. Which command should you use to start the kubelet and enable it to start on boot?
114You run 'kubectl get pods' and see a pod in 'ImagePullBackOff' state. Which command would help you determine the exact reason for the image pull failure?
115A Pod is stuck in Pending state. 'kubectl describe pod' shows the event: '0/4 nodes are available: 1 node had taint {node-role.kubernetes.io/control-plane: }, that the pod didn't tolerate, 3 Insufficient cpu.' Which of the following is the most likely combination of issues?
116A developer reports that a Pod named 'web-pod' in namespace 'frontend' is crashing repeatedly. You run 'kubectl logs web-pod -n frontend' but see no output. Which command should you run next to see the logs from the previous, crashed container instance?
117You have a Deployment with one replica. The Pod enters CrashLoopBackOff. 'kubectl describe pod' shows the container exits with code 137. What is the most likely cause?
118You need to check the CPU and memory usage of all pods in the 'production' namespace. Which command should you use?
119A pod is in 'Pending' state. After running 'kubectl describe pod', you see the event: '0/3 nodes are available: 3 PersistentVolumeClaim is not bound'. What is the most likely cause?
120You have a Service named 'my-svc' in namespace 'app'. A Pod in the same namespace cannot resolve the hostname 'my-svc' to an IP address. Which command should you run inside the Pod to test DNS name resolution?
121A Pod is in 'CrashLoopBackOff' state. You run 'kubectl logs <pod> --previous' and see an error about a missing environment variable. The Pod spec defines the environment variable in a ConfigMap. What is the best next step to diagnose the issue?
122A Pod that was running fine is now showing 'OOMKilled' in its status. Which kubectl command would show you the memory limit and usage for the container?
123You have a Deployment with 3 replicas. One of the pods is in 'Pending' state. 'kubectl describe pod' shows: 'Warning FailedScheduling 0/4 nodes are available: 1 node(s) had taint {key1: value1}, that the pod didn't tolerate, 3 node(s) didn't match pod anti-affinity rules.' Which two issues are preventing the pod from being scheduled?
124You run 'kubectl get pods' and see a pod with status 'Init:CrashLoopBackOff'. What does this indicate?
125Which TWO of the following are valid methods to diagnose why a node is in 'NotReady' state?
126A pod is in 'CrashLoopBackOff' state. Which THREE of the following are possible causes?
127Which TWO of the following are valid reasons a pod might be stuck in 'Pending' state?
128A pod in the 'production' namespace is in a CrashLoopBackOff state. The pod has been running successfully for several days. You run 'kubectl describe pod app-pod -n production' and see the message: 'OOMKilled'. What is the MOST appropriate action to resolve this issue?
129You have a Pod that is stuck in Pending state. Which command should you use to get detailed information about why the Pod is not running?
130A node in the cluster is showing NotReady status. Which steps should you take to diagnose the issue? (Select the BEST initial step.)
131After deploying a new Deployment, you notice that the pods are stuck in ImagePullBackOff. What is the most common cause?
132You have a Pod that is stuck in Pending state. Running 'kubectl describe pod' shows events: '0/4 nodes are available: 1 node(s) had taint {node-role.kubernetes.io/control-plane: }, 3 node(s) had taint {key: value}, that the pod didn't tolerate.' How can you resolve this issue?
133Which command should you use to view the logs of a container that has previously crashed in a Pod?
134A Pod is in CrashLoopBackOff. You run 'kubectl describe pod' and see that the container fails with 'Error: container command not found'. What is the most likely cause?
135You have a Deployment with the following resource limits for containers: memory: 256Mi. The pod is repeatedly killed with OOMKilled. You need to change the limit to 512Mi. Which field should you modify in the Deployment YAML?
136Which of the following is a valid command to check the status of the kubelet service on a node running systemd?
137You have two Pods in different namespaces. Pod A (namespace: frontend) cannot reach Pod B (namespace: backend) via the ClusterIP service 'backend-svc' in the backend namespace. Which command should you use from within Pod A to test connectivity?
138You are debugging a DNS issue in the cluster. Which of the following tools is commonly used to test DNS resolution from within a Pod?
139A Pod is running but cannot connect to a Service. You have verified that the Service endpoints are correct. Which of the following is the most likely cause if the Pod is using hostNetwork: true?
140Which TWO of the following are valid commands to view cluster events sorted by timestamp?
141Which THREE of the following are common causes for a Pod to remain in Pending state? (Select THREE.)
142You have a Pod that is in CrashLoopBackOff. Which TWO of the following commands would be most helpful in diagnosing the issue?
143A node in your cluster is in the 'NotReady' state. You SSH into the node and run 'systemctl status kubelet' which shows the kubelet is active but not functioning correctly. Which command should you use to get detailed logs to troubleshoot the kubelet?
144You have a pod named 'web-pod' that is in a CrashLoopBackOff state. To examine the logs from the previous instance of the container, which command should you use?
145A pod is in the 'Pending' state for a long time. You run 'kubectl describe pod pending-pod' and see the event: '0/4 nodes are available: 1 node(s) had taint {node.kubernetes.io/not-ready: }, 3 node(s) had taint {node-role.kubernetes.io/control-plane: } that the pod didn't tolerate.' What is the MOST likely solution?
146You run 'kubectl get pods' and see a pod with status 'ImagePullBackOff'. Which of the following is a possible cause?
147You need to check the resource usage of nodes in your cluster. Which command should you run?
148A pod named 'app' is not starting. You run 'kubectl describe pod app' and see the event: 'MountVolume.SetUp failed for volume "pvc-volume" : rpc error: code = NotFound desc = volume not found'. What is the most likely issue?
149You suspect a DNS issue inside a pod. Which command can you run to test DNS resolution from within a pod?
150You are troubleshooting a network connectivity issue between two pods in different namespaces. The pods have the following labels: pod-a in namespace 'foo' with labels {app: web}, pod-b in namespace 'bar' with labels {app: db}. You verify that both pods have IP addresses and can ping the Kubernetes service IP. However, pod-a cannot connect to pod-b on port 5432. What should you check first?
151Which command shows all events in the cluster, sorted by timestamp?
152You run 'kubectl get pods' and see a pod with status 'CrashLoopBackOff'. You check the logs with 'kubectl logs <pod> --previous' and see: 'Error: unable to connect to database at db-svc:5432 (connection refused)'. What is the most likely cause?
153You are debugging a CrashLoopBackOff. The pod YAML includes the following container spec: ```yaml containers: - name: app image: myapp:1.0 livenessProbe: httpGet: path: /healthz port: 8080 initialDelaySeconds: 3 periodSeconds: 3 ``` The application starts successfully but crashes after about 10 seconds. What is the most likely cause?
154You need to view the logs of the previous instance of a container in a pod. Which command is correct?
155Which TWO of the following are common causes for a pod to be in the 'Pending' state?
156Which THREE of the following are valid steps to troubleshoot a node that is in 'NotReady' state?
157Which TWO of the following are valid commands to check resource usage of pods?
158You suspect the kubelet on a worker node is not functioning correctly. Which command should you use to check the kubelet service status?
159A pod is stuck in Pending state. You run 'kubectl describe pod my-pod' and see the event: '0/4 nodes are available: 1 node(s) had taint {node.kubernetes.io/not-ready: }, 3 node(s) had taint {node-role.kubernetes.io/control-plane: }, that the pod didn't tolerate'. What is the most likely cause?
160You run 'kubectl logs my-pod -c my-container' and get no output, but you know the container produces logs. What should you do next to see previous container logs?
161You have a Deployment with 3 replicas. Two pods are Running, one is CrashLoopBackOff. You run 'kubectl logs pod-crash -c app' and see: 'Error: failed to connect to database at 10.0.0.5:5432'. The database is a separate Pod with a ClusterIP service. How should you verify network connectivity from the crashing pod to the database service?
162A pod is in ImagePullBackOff state. Which command would give you the most information about why the image pull failed?
163You need to check the memory usage of all pods in the 'production' namespace. Which command fulfills this requirement?
164A node is NotReady. You ssh into the node and run 'systemctl status kubelet'. It shows 'Active: inactive (dead)'. What is the most appropriate next step?
165You are troubleshooting DNS resolution from a pod. You exec into the pod and run 'nslookup kubernetes.default.svc.cluster.local'. It fails with 'server can't find kubernetes.default.svc.cluster.local: NXDOMAIN'. What is the most likely cause?
166You want to see events sorted by timestamp in the 'kube-system' namespace. Which command should you use?
167A pod is in CrashLoopBackOff. You check the logs with 'kubectl logs my-pod --previous' and see 'Error: cannot connect to database at 10.0.0.1:3306'. The database service is named 'mysql' and runs on port 3306. What is the most likely cause?
168You run 'kubectl top nodes' and get an error: 'error: metrics not available yet'. What is the most likely cause?
169A pod is in Pending state. You run 'kubectl describe pod pending-pod' and see an event: '0/3 nodes are available: 3 Insufficient memory'. However, you believe there is enough memory across the cluster. What could be the issue?
170Which two commands can be used to view the logs of a container that has crashed? (Choose two.)
171Which three are possible reasons for a pod being in Pending state? (Choose three.)
172You have a pod that is in CrashLoopBackOff. Which two troubleshooting steps should you take first? (Choose two.)
173A pod in the 'production' namespace is in a CrashLoopBackOff state. The pod has been running successfully for several days. You run 'kubectl describe pod app-pod -n production' and see the message: 'OOMKilled'. What is the MOST appropriate action to resolve this issue?
174You are troubleshooting a node that is in 'NotReady' state. Which command should you use to check the kubelet logs for errors?
175A pod is stuck in 'Pending' state. You run 'kubectl describe pod mypod' and see the event: '0/3 nodes are available: 1 node(s) had taint {node.kubernetes.io/unreachable: }, that the pod didn't tolerate, 2 Insufficient memory.' Which issue is causing the pod to be pending?
176You have a Deployment with 3 replicas. After updating the container image, the new pods are in 'ImagePullBackOff' state. You run 'kubectl describe pod <pod-name>' and see the event: 'Failed to pull image "myregistry/myapp:latest": rpc error: code = Unknown desc = Error response from daemon: manifest for myregistry/myapp:latest not found: manifest unknown: manifest unknown'. What is the MOST likely cause?
177You are trying to debug a network connectivity issue between two pods. Pod A can reach the internet but cannot reach Pod B's IP address. Which command should you use to test connectivity from within Pod A to Pod B's service?
178You are troubleshooting a pod that is in 'CrashLoopBackOff' state. You run 'kubectl logs mypod' and get no output. You then run 'kubectl logs mypod --previous' and see an error: 'Error: failed to start container: context deadline exceeded'. What is the MOST likely cause?
179Which command can be used to view resource usage of nodes in a cluster?
180You run 'kubectl get events --sort-by=.lastTimestamp' and see the following events for a pod: 'Warning FailedScheduling 0/3 nodes are available: 3 Insufficient cpu'. What is the most likely solution?
181A Pod is stuck in 'Pending' state. Upon inspection, you find that the PVC it references is also 'Pending'. Which of the following is NOT a common cause for a PVC to remain in Pending state?
182You are troubleshooting DNS resolution from within a pod. You exec into the pod and run 'nslookup kubernetes.default.svc.cluster.local'. The command fails with 'connection timed out; no servers could be reached'. However, 'kubectl get svc -n kube-system' shows the kube-dns service with a ClusterIP. What is the MOST likely cause?
183You want to check the status of the kube-apiserver on a control plane node. Which commands should you use? (Select the best option)
184You deploy a pod with the following YAML: apiVersion: v1 kind: Pod metadata: name: test-pod spec: containers: - name: test image: nginx resources: requests: memory: "64Mi" cpu: "250m" limits: memory: "128Mi" cpu: "500m" The pod starts, but after a few minutes it is killed with OOMKilled. What is the MOST likely reason?
185Which TWO of the following are valid reasons for a pod to be in 'CrashLoopBackOff' state? (Choose two.)
186You are troubleshooting a node that is 'NotReady'. Which THREE of the following are possible causes? (Choose three.)
187Which TWO of the following kubectl commands can be used to view the logs of a container in a pod? (Choose two.)
188A pod in the 'production' namespace is in CrashLoopBackOff state. Running 'kubectl describe pod web-app -n production' shows the event 'OOMKilled'. What is the most appropriate action to resolve this issue?
189You run 'kubectl get pods -n default' and see a pod named 'backend' in ImagePullBackOff state. What is the most likely cause?
190Which command shows the logs of a pod that has crashed and restarted?
191A pod remains in Pending state. 'kubectl describe pod' shows '0/1 nodes are available: 1 node(s) had taint {node-role.kubernetes.io/master: }, that the pod didn't tolerate'. What is the solution?
192You run 'kubectl get nodes' and one node shows 'NotReady'. You SSH into the node and run 'systemctl status kubelet'. Kubelet is active but 'journalctl -u kubelet -n 50' shows 'network plugin is not ready: cni config uninitialized'. What is the most likely cause?
193You run 'kubectl top nodes' but get 'error: metrics not available yet'. What is the most likely cause?
194A newly created Service of type ClusterIP is not accessible from within the cluster. You exec into a pod and run 'curl http://service-name:80' but get 'Connection refused'. What is the most likely cause?
195Which command shows events sorted by timestamp for troubleshooting recent issues?
196You run 'kubectl get pods' and see a pod named 'db' in CrashLoopBackOff. 'kubectl logs db' shows nothing. 'kubectl logs db --previous' shows 'Error: database connection failed'. What is the most likely cause?
197A pod is in CrashLoopBackOff. You want to see the current container's logs. Which command do you use?
198You have a Deployment with 3 replicas. One pod is in CrashLoopBackOff. The other two are Running. You run 'kubectl get events' and see 'Liveness probe failed: HTTP probe failed with statuscode: 503'. What should you do?
199A pod is stuck in Pending state. 'kubectl describe pod' shows '0/1 nodes are available: 1 node(s) had taint {key=value: NoSchedule}, 1 node(s) had taint {node.kubernetes.io/unreachable: }. What does this indicate?
200Which THREE are valid methods to debug DNS resolution inside a pod? (Select 3)
201A pod is in Pending state. Which TWO are possible causes? (Select 2)
202Which THREE are valid steps to troubleshoot a node that is in 'NotReady' state? (Select 3)
203A node in the cluster is showing status 'NotReady'. You run 'kubectl describe node worker1' and see that the kubelet has not posted node status for more than 1 minute. Which command should you run on the node to check the kubelet logs?
204You deploy a pod with the following YAML and it remains in 'Pending' state. What is the most likely cause? ```yaml apiVersion: v1 kind: Pod metadata: name: myapp spec: containers: - name: myapp image: nginx resources: requests: memory: "64Gi" ```
205A pod is in CrashLoopBackOff. 'kubectl logs my-pod --previous' shows: 'Error: failed to start: exec: "/app/start.sh": stat /app/start.sh: no such file or directory'. What is the most likely cause?
206You run 'kubectl get pods' and see that a pod named 'web' is in 'ImagePullBackOff' state. Which command would help you see the reason for the image pull failure?
207You have a Deployment that is failing to update. You run 'kubectl rollout status deployment/myapp' and it hangs. Which command should you run to see the status of individual ReplicaSets and pods?
208A user reports that they cannot access a service running in the cluster from within another pod. They run 'kubectl exec -it pod-a -- curl http://service-b:8080' and get a connection timeout. What is the first thing you should check?
209You run 'kubectl get events --sort-by='.lastTimestamp'' and see repeated events: 'Failed to pull image "myimage:v2": rpc error: code = Unknown desc = Error response from daemon: manifest for myimage:v2 not found'. What is the issue?
210A pod is stuck in 'Pending' state. You run 'kubectl describe pod mypod' and see: '0/4 nodes are available: 4 node(s) didn't match pod anti-affinity constraints'. What does this mean?
211You deploy a pod with a readiness probe that checks an HTTP endpoint. The probe fails and the pod is marked as 'NotReady'. Which command would you use to see the exact HTTP response from the probe?
212You run 'kubectl top nodes' and it returns an error: 'error: metrics not available yet'. What does this indicate?
213You have a Deployment with three replicas. After a rolling update, all pods are in 'CrashLoopBackOff'. You want to revert to the previous revision. Which command accomplishes this?
214You try to run 'kubectl logs mypod' and get the error: 'Error from server (BadRequest): container "myapp" in pod "mypod" is waiting to start: PodInitializing'. What does this mean?
215A pod is in 'Pending' state. 'kubectl describe pod' shows: '0/3 nodes are available: 1 Insufficient memory, 2 node(s) had taint {node-role.kubernetes.io/control-plane: }, that the pod didn't tolerate.' Which TWO actions would resolve the issue? (Choose two)
216You have a pod that is 'CrashLoopBackOff'. 'kubectl logs --previous mypod' shows: 'Liveness probe failed: HTTP probe failed with statuscode: 503'. Which THREE actions could potentially fix the issue? (Choose three)
217You run 'kubectl get pods' and see that a pod named 'db' is in 'CrashLoopBackOff'. Which TWO commands are most useful for diagnosing the issue? (Choose two)
218A pod is stuck in Pending state. You run 'kubectl describe pod my-pod' and see the event '0/1 nodes are available: 1 Insufficient cpu'. What is the most likely cause?
219You run 'kubectl get nodes' and one node shows 'NotReady'. Which command should you run first to check the kubelet status on that node?
220A pod has been restarted multiple times. You want to see the logs from the previous (terminated) container instance. Which command should you use?
221A pod is in CrashLoopBackOff. The YAML for the initContainer is: apiVersion: v1 kind: Pod metadata: name: myapp spec: initContainers: - name: init image: busybox command: ['sh', '-c', 'sleep 5 && exit 1'] containers: - name: app image: nginx What is the most likely reason for the CrashLoopBackOff?
222You run 'kubectl get pods' and one pod shows 'ImagePullBackOff'. Which command would help you diagnose the issue?
223A pod is in Pending state. You see the event: '0/2 nodes are available: 2 node(s) had taint {node-role.kubernetes.io/control-plane: }, that the pod didn't tolerate'. What should you do to schedule the pod on one of the control-plane nodes?
224You need to test DNS resolution from within a pod. Which command should you run?
225A Deployment is configured with replicas: 3, but only 2 pods are running. 'kubectl describe deployment' shows 'Replicas: 3 desired | 2 current | 2 available'. What could be the reason for the missing pod?
226Which command shows resource usage (CPU/memory) of all pods in the default namespace?
227You suspect the kube-apiserver is down. Which command should you run on the control-plane node to check its status?
228A pod is in Running state but cannot connect to a service by its DNS name. 'kubectl exec <pod> -- curl http://my-service' fails. Which is the most likely cause?
229Which command shows all recent events in the cluster, sorted by timestamp?
230Which TWO actions can help troubleshoot a service that is not reachable from within a pod?
231A node is NotReady. Which THREE conditions could cause this?
232Which TWO commands can be used to view logs for a pod?
233A pod in the 'production' namespace is in a CrashLoopBackOff state. The pod has been running successfully for several days. You run 'kubectl describe pod app-pod -n production' and see the message: 'OOMKilled'. What is the MOST appropriate action to resolve this issue?
234You have a pod that is in 'Pending' state. Which command would you use to view detailed information about the pod's status, including events that may indicate why it is not running?
235A node in your cluster is reporting 'NotReady'. You SSH into the node and run 'systemctl status kubelet'. The output shows 'Active: inactive (dead)'. Which command should you run FIRST to attempt to resolve this?
236A pod is stuck in 'Pending' state. 'kubectl describe pod' shows the following event: '0/4 nodes are available: 1 node(s) had taint {node-role.kubernetes.io/master: }, that the pod didn't tolerate, 3 node(s) had taint {node.kubernetes.io/disk-pressure: }, that the pod didn't tolerate.' Which action would resolve the issue?
237You have a pod that is in 'CrashLoopBackOff' state. Which command should you use to view the logs from the previous instance of the container?
238You attempt to schedule a pod but it remains 'Pending'. 'kubectl describe pod' shows the event: '0/3 nodes are available: 3 node(s) didn't match node selector.' What is the MOST likely cause?
239You run 'kubectl top nodes' and get an error: 'error: metrics not available yet'. Which of the following is the MOST likely cause?
240A pod is in 'ImagePullBackOff' state. Which of the following is NOT a common cause?
241You need to check the logs of a kubelet on a node. Which command should you run on the node?
242You run 'kubectl get pods' and see that a pod is in 'CrashLoopBackOff'. You want to examine the container's previous exit code. Which command provides this information?
243You are troubleshooting a DNS issue in the cluster. You exec into a pod and run 'nslookup kubernetes.default.svc.cluster.local'. The command returns 'server can't find kubernetes.default.svc.cluster.local: NXDOMAIN'. What is the MOST likely cause?
244You have a pod that is in 'Pending' state because it requires a PersistentVolumeClaim that is not bound. Which event would you see in 'kubectl describe pod'?
245A pod is in 'Pending' state. Which TWO of the following are possible causes? (Select 2)
246You are troubleshooting a scenario where a pod cannot communicate with another pod in the same namespace via service name. Which THREE steps would you take to diagnose the issue? (Select 3)
247A node is in 'NotReady' state. Which TWO of the following are common causes? (Select 2)
248A pod is stuck in Pending state. Running 'kubectl describe pod mypod' reveals the event '0/4 nodes are available: 3 Insufficient memory, 1 node(s) had taints that the pod didn't tolerate'. What is the most likely cause?
249You suspect the kubelet on a worker node has stopped. Which two commands should you run to confirm the kubelet status and check its logs?
250To view the logs of a specific container in a multi-container pod named 'web-pod', which command is correct?
251A Deployment has 3 replicas. One pod is in CrashLoopBackOff. You run 'kubectl logs <pod> --previous' and see only a single line: 'Error: failed to start container'. What is the most likely cause?
252Which command shows resource usage for pods and nodes in the cluster?
253A pod is in ImagePullBackOff state. You run 'kubectl describe pod mypod' and see 'Failed to pull image "myapp:latest": rpc error: code = Unknown desc = Error response from daemon: manifest for myapp:latest not found: manifest unknown'. What is the most likely cause?
254You have a Deployment with livenessProbe configured. The pod restarts every few minutes. 'kubectl describe pod' shows the liveness probe is failing with 'HTTP probe failed with statuscode: 503'. The application's /healthz endpoint returns 200 from within the pod using 'kubectl exec'. What could be the issue?
255You are unable to resolve a Service DNS name from within a pod. To test DNS resolution, which command should you run inside the pod?
256Which command lists all events in the cluster sorted by timestamp?
257A node shows status NotReady. You SSH into the node and run 'systemctl status kubelet' which shows the kubelet is active (running). What is the next most likely step to diagnose the issue?
258A pod is stuck in Pending with event '0/4 nodes are available: 1 node(s) had taint "node.kubernetes.io/disk-pressure", and 3 node(s) had taint "node.kubernetes.io/memory-pressure", that the pod didn't tolerate'. What is the best approach to schedule the pod?
259You want to see the last 50 lines of logs from a pod named 'api-pod' for the container 'api-container'. Which command accomplishes this?
260Which TWO of the following are valid causes for a pod to be in CrashLoopBackOff?
261Which THREE of the following are valid steps to troubleshoot DNS issues in a Kubernetes cluster?
262Which TWO of the following could cause a Node to be in NotReady state?
263A node in your cluster is marked as NotReady. You SSH into the node and run 'systemctl status kubelet'. The output shows the kubelet is inactive (dead). What should you do FIRST to restore the node?
264Which command can you use to view the logs of a container that has crashed and been restarted?
265A Deployment named 'web' is failing to schedule pods. You run 'kubectl describe pod web-xyz' and see the event: '0/3 nodes are available: 3 Insufficient cpu.' What is the most likely cause?
266A pod remains in Pending state. You run 'kubectl describe pod mypod' and see the following event: '0/3 nodes are available: 2 node(s) had taint {node-role.kubernetes.io/control-plane: }, that the pod didn't tolerate, 1 node(s) didn't match pod anti-affinity rules.' What is the best action to schedule the pod?
267You deploy a pod with image 'nginx:1.21'. It stays in ImagePullBackOff. You run 'kubectl describe pod nginx-pod' and see the event: 'Failed to pull image "nginx:1.21": rpc error: code = Unknown desc = Error response from daemon: manifest for nginx:1.21 not found'. What is the most likely fix?
268Which command shows resource usage (CPU and memory) for nodes in a cluster?
269You cannot reach a service by its DNS name from within a pod. Running 'kubectl exec -it dnsutils -- nslookup myservice' returns 'server can't find myservice: NXDOMAIN'. Which of the following is the MOST probable cause?
270The controller-manager logs show repeated errors: 'Failed to list *v1.Pod: connection refused'. What is the most likely cause?
271A pod is in CrashLoopBackOff. You run 'kubectl logs pod-name' and see nothing. You suspect the app is failing due to a missing environment variable. Which command can you use to verify environment variables inside the container?
272Which command would you use to view all events in the cluster sorted by timestamp?
273A pod is in ImagePullBackOff. Which TWO of the following are possible causes? (Select 2)
274A pod is stuck in Pending state. Which TWO of the following could be causes? (Select 2)
275Which THREE of the following are valid commands to troubleshoot network connectivity between pods? (Select 3)
276You run 'kubectl get pods' and see that a pod is in CrashLoopBackOff. Which THREE of the following are valid next steps? (Select 3)
277Which TWO of the following are valid ways to view the status of the kube-apiserver? (Select 2)
278You are troubleshooting a node that shows 'NotReady' status. Which TWO commands can help you investigate the kubelet state?
279A pod is stuck in 'Pending' state. Which THREE are common causes for this?
280You run 'kubectl logs pod-name' and get no output. Which TWO steps should you take to troubleshoot further?
281You suspect a DNS issue within the cluster. Which TWO commands can you run from within a pod to test DNS resolution?
282A pod is in 'ImagePullBackOff' state. Which TWO are valid first troubleshooting steps?
283You run 'kubectl get pods' and see a pod with status 'CrashLoopBackOff'. Which TWO commands can help you investigate the cause?
284A node is 'NotReady'. Which THREE steps should you take to troubleshoot?
285You want to check resource usage of pods and nodes. Which TWO commands should you use?
286You need to test network connectivity between two pods in different namespaces. Which TWO approaches are valid?
287Which TWO commands show cluster events that can help in troubleshooting?
288You are troubleshooting a pod that is in 'Pending' state. 'kubectl describe pod' shows '0/1 nodes are available: 1 Insufficient memory, 1 node(s) had taint {node-role.kubernetes.io/control-plane: }, that the pod didn't tolerate'. Which TWO actions can resolve the issue?
289You need to check the status of control plane components. Which TWO commands are appropriate?
290A pod is repeatedly killed with 'OOMKilled'. The container's memory request is 256Mi and limit is 512Mi. Which THREE steps could help resolve the issue?
The Troubleshooting domain covers the key concepts tested in this area of the CKA exam blueprint published by CNCF. Courseiva provides free domain-focused practice, mock exams, missed-question review, and readiness tracking across all CKA domains — no account required.
The Courseiva CKA question bank contains 290 questions in the Troubleshooting domain. Click any question to see the full explanation and answer breakdown.
Start with a 10-question focused session to identify your baseline accuracy in this domain. Read every explanation — even for questions you answer correctly — to understand the reasoning. Once you score consistently above 80%, move to a 20–30 question session to confirm depth before moving to the next domain.
Yes — the session launcher on this page draws questions exclusively from the Troubleshooting domain. Choose 10, 20, 30, or 50 questions for a focused session, or click individual questions to review them one by one.
Save your results, see per-domain analytics, and get readiness scores — free, for every certification.
Sign Up FreeFree forever · Every certification included