New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
kubelet: add graceful shutdown events #101081
kubelet: add graceful shutdown events #101081
Conversation
/kind bug |
92fa3b7
to
89f7a02
Compare
Generally LGTM! /cc @wzshiming who's also contributed a lot to this area. |
/retest |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
dbus-send --system /org/freedesktop/login1 org.freedesktop.login1.Manager.PrepareForShutdown boolean:true
<invalid> Normal Shutdown node/zsm-test-1 Shutdown manager detected new shutdown event
<invalid> Normal Shutdown node/zsm-test-1 Shutting down 2 pods
<invalid> Normal Killing pod/test-84cfc7bdf7-kn95k Stopping container test
<invalid> Normal Killing pod/test-84cfc7bdf7-rj8gl Stopping container test
0s Normal SuccessfulCreate replicaset/test-84cfc7bdf7 Created pod: test-84cfc7bdf7-xfb7f
<invalid> Normal NodeNotReady node/zsm-test-1 Node zsm-test-1 status is now: NodeNotReady
0s Normal Scheduled pod/test-84cfc7bdf7-xfb7f Successfully assigned default/test-84cfc7bdf7-xfb7f to zsm-test-1
<invalid> Warning Shutdown pod/test-84cfc7bdf7-xfb7f Node is in progress of shutting down, not admitting any new pods
0s Normal SuccessfulCreate replicaset/test-84cfc7bdf7 Created pod: test-84cfc7bdf7-wpzqv
0s Normal Scheduled pod/test-84cfc7bdf7-wpzqv Successfully assigned default/test-84cfc7bdf7-wpzqv to zsm-test-1
<invalid> Warning Shutdown pod/test-84cfc7bdf7-wpzqv Node is in progress of shutting down, not admitting any new pods
0s Normal SuccessfulCreate replicaset/test-84cfc7bdf7 Created pod: test-84cfc7bdf7-5vdbf
dbus-send --system /org/freedesktop/login1 org.freedesktop.login1.Manager.PrepareForShutdown boolean:false
<invalid> Normal Shutdown node/zsm-test-1 Shutdown manager detected new shutdown event
<invalid> Normal NodeReady node/zsm-test-1 Node zsm-test-1 status is now: NodeReady
0s Normal TaintManagerEviction pod/test-84cfc7bdf7-wpzqv Cancelling deletion of Pod default/test-84cfc7bdf7-wpzqv
0s Normal TaintManagerEviction pod/test-84cfc7bdf7-kn95k Cancelling deletion of Pod default/test-84cfc7bdf7-kn95k
0s Normal TaintManagerEviction pod/test-84cfc7bdf7-rj8gl Cancelling deletion of Pod default/test-84cfc7bdf7-rj8gl
0s Normal TaintManagerEviction pod/test-84cfc7bdf7-xfb7f Cancelling deletion of Pod default/test-84cfc7bdf7-xfb7f
Seeing that the time information in the event is invalid, is it possible to include time information, which is still very useful when troubleshooting the cause of the shutdown
nit: |
89f7a02
to
2a3b5ae
Compare
@wzshiming updated to incorporate your comments... I should have got them all. Thanks! |
2a3b5ae
to
d597cb3
Compare
/test pull-kubernetes-node-e2e |
/assign @bobbypage |
d597cb3
to
4a97632
Compare
@bobbypage Thank you for the review. I fixed the messages. |
/test pull-kubernetes-integration |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
/lgtm
looked at previous review feedback and I think this is good to go assuming tests all pass
/lgtm Thanks @rphillips for all your work on adding these events to graceful node shutdown! |
/assign @mrunalp |
4a97632
to
d9be5ab
Compare
re- |
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: mrunalp, rphillips The full list of commands accepted by this bot can be found here. The pull request process is described here
Needs approval from an approver in each of these files:
Approvers can indicate their approval by writing |
The Kubernetes project has merge-blocking tests that are currently too flaky to consistently pass. This bot retests PRs for certain kubernetes repos according to the following rules:
You can:
/retest |
2 similar comments
The Kubernetes project has merge-blocking tests that are currently too flaky to consistently pass. This bot retests PRs for certain kubernetes repos according to the following rules:
You can:
/retest |
The Kubernetes project has merge-blocking tests that are currently too flaky to consistently pass. This bot retests PRs for certain kubernetes repos according to the following rules:
You can:
/retest |
What type of PR is this?
/bug
/sig node
/triage accepted
/priority important-soon
What this PR does / why we need it:
#96129 added Graceful Node Shutdown to the Kubelet. #99735 moved the feature to Beta. This PR adds events to the shutdown logic so a user will know the status of the shutdown.
Which issue(s) this PR fixes:
Fixes #
Special notes for your reviewer:
cc @mrunalp @ehashman @harche @bobbypage
Does this PR introduce a user-facing change?
Additional documentation e.g., KEPs (Kubernetes Enhancement Proposals), usage docs, etc.: