-
Notifications
You must be signed in to change notification settings - Fork 1.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Issue with controller manager when deploying Velero on a fresh 1.26/1.27 EKS cluster #6350
Comments
@ArchiFleKs This question may not be easy to pinpoint. Have you tried only installing Calico to do the test? |
Yes to be honest I managed to reproduced the issue with only Calico. It is
really hard to know what is causing this issue.
…On Wed, Jun 7, 2023, 12:09 qiuming ***@***.***> wrote:
@ArchiFleKs <https://github.com/ArchiFleKs> This question may not be easy
to pinpoint. Have you tried only installing Calico to do the test?
And What‘s the original version of velero and the upgraded version?
—
Reply to this email directly, view it on GitHub
<#6350 (comment)>,
or unsubscribe
<https://github.com/notifications/unsubscribe-auth/AAJXRKKNW722TAYQH6JPF6TXKBHPJANCNFSM6AAAAAAYZFSE5Y>
.
You are receiving this because you were mentioned.Message ID:
***@***.***>
|
I have the same issue in a cluster deployed with just calico also. Is possible that the GC is prevented to run properly due to a problem with calico installation?
|
Maybe we should continue this issue in Calico. Regarding the bgpfilters it is an issue with operators rbac not having the bgpfilters in roles. But it does not fix the issue when editing RBAC manually. |
I found this to work (except the calico APIServer will clobber the changes and break it again). kubectl get clusterroles/calico-crds -o json | jq '.rules[] |= ( if ( (.apiGroups | index("crd.projectcalico.org")) and (.resources | index("bgpfilters") | not) ) then .resources += [ "bgpfilters" ] else . end )' | kubectl apply -f - |
per latest comment, the root cause is the roles installed by calico, closing this issue. |
What steps did you take and what happened:
I deployed an EKS cluster with Terraform. Then I installed a lot of middleware, especially:
When Velero is running the job to upgrade CRDs I go into the state describe in these issue:
On EKS controller manager logs I get :
Which render the cluster completely unusable as I can't restart the controller manager on EKS.
I managed to remove Velero and Calico as well as removing all the Velero CRDs and Calico CRDs then the controller manager GC started working again.
This lead me to believe that the issue is with Velero at the CRDs upgrade process I'm not 100% sure.
Use the "reaction smiley face" up to the right of this comment to vote.
Edit:
I still get perpetual errors in controller manager now:
The text was updated successfully, but these errors were encountered: