What happens when a master fails? What happens when a worker fails?
What happens when a master fails? What happens when a worker fails?
When a Kubernetes master node fails, the impact on the cluster depends on the number of master nodes and the specific components that fail. Here are the key points:
Single Master Node Failure:
Multi-Master Node Setup:
Recovery:
When a worker node fails, the impact on the cluster is generally less severe than a master node failure, but it still affects the availability of applications running on that node. Here are the key points:
Pod Rescheduling:
Persistent Volumes:
Service Availability:
Node Health Checks:
In summary, the failure of a master node in a single-master setup can severely impact the cluster's ability to manage itself, while a multi-master setup provides redundancy and resilience. Worker node failures primarily affect the availability of the applications running on them, but Kubernetes' self-healing capabilities help mitigate this by rescheduling pods to healthy nodes.
Citations:
[1] https://github.com/kubernetes/kubeadm/issues/2138
[2] https://stackoverflow.com/questions/39172131/what-happens-when-the-kubernetes-master-fails
[3] https://discuss.kubernetes.io/t/unable-to-join-worker-node-to-the-master-node/24460
[4] https://discuss.kubernetes.io/t/what-would-happen-in-the-event-of-master-node-failure/19645
[5] https://kubernetes.io/docs/tasks/debug/debug-cluster/
[6] https://www.gremlin.com/blog/how-to-ensure-your-kubernetes-cluster-can-tolerate-lost-nodes
[7] https://discuss.kubernetes.io/t/solved-kubeadm-upgrade-fails-on-worker-node/27425
[8] https://www.reddit.com/r/kubernetes/comments/uhgz1i/how_are_services_managed_when_theres_a_node/
[9] https://discuss.kubernetes.io/t/dns-fail-in-worker-node-but-fine-in-master-node/24895
[10] https://serverfault.com/questions/1020224/how-to-recover-from-master-failure-in-kubernetes
[11] https://www.reddit.com/r/kubernetes/comments/wj383k/help_me_understand_the_concept_of_a_failed_worker/
[12] https://kubernetes.io/docs/concepts/architecture/nodes/
[13] https://discuss.kubernetes.io/t/error-while-setting-up-a-clucter-unable-to-join-the-worker-node-please-someone-help-me-im-an-intern-help-me-to-do-this/27096
[14] https://discuss.kubernetes.io/t/kubernetes-multpiple-control-plane-nodes-cluster-not-working-when-one-control-plane-node-fails/25318
[15] https://stackoverflow.com/questions/71838539/kubernetes-worker-node-went-down-what-will-happen-to-the-pod
[16] https://blog.mosuke.tech/en/entry/2022/07/22/kubernetes-node-down/
[17] https://www.techtarget.com/searchitoperations/tip/Ensure-Kubernetes-high-availability-with-master-node-planning
[18] https://discuss.kubernetes.io/t/how-to-restore-master-failure-in-kubernetes/11352
[19] https://kubernetes.io/docs/concepts/scheduling-eviction/topology-spread-constraints/
[20] https://discuss.kubernetes.io/t/how-to-test-a-node-for-failure-force-a-node-to-fail/14386
middle
Gợi ý câu hỏi phỏng vấn
Chưa có bình luận nào