Keycloak operator created cluster not working with mutliple keycloak pods

apeschel · June 7, 2021, 4:34pm

Somehow we have keycloak cluster managed with the keycloak-operator which ended up in a state where setting the instance count > 1 causes the login flows to break. The login flow seems to get into an infinite loop of failed logins with first this error message:

21:07:36,645 WARN [org.keycloak.events] (default task-1) type=CODE_TO_TOKEN_ERROR, realmId=master, clientId=security-admin-console, userId=null, ipAddress=10.240.0.5, error=invalid_code, grant_type=authorization_code, code_id=23594b9a-f98f-4ad6-b8b2-437838708843, client_auth_method=client-secret

And then on a second attempted login, an infinite loop resulting in this error over and over:

21:08:17,217 WARN [org.keycloak.events] (default task-11) type=LOGIN_ERROR, realmId=master, clientId=null, userId=null, ipAddress=10.240.0.5, error=expired_code, restart_after_timeout=true, authSessionParentId=f8d19861-6fb7-4425-88df-c183caaa2b11, authSessionTabId=Z-9DszaYy4A

With a single instance, everything works fine.

With multiple instances, going through the ingress does not work, but going through a port-forward to the service end point works fine.

I suspect the service port-forward may work because the ingress load balances requests, but the service does not.

Setting session affinity with the nginx ingress fixes the issue, but I suspect that’s just a band-aid on some broken functionality. I suspect the AuthenticationSessions replication is not working correctly, but I don’t see any indication that it’s failing, and I’m not sure where to look or how to confirm it’s the root issue.

Here are the nginx affinity settings I used on the ingress:

nginx.ingress.kubernetes.io/affinity: cookie
nginx.ingress.kubernetes.io/affinity-mode: persistent

We have another keycloak cluster that is configured almost completely identically, and it’s working fine.

I’m at a complete loss as to what’s going on here though - this cluster was previously working fine, and it only broke when we attempted to update a theme we’re using with the cluster. We’ve tried restarting all the keycloak pods, but the problem still persists.

If anyone could provide some guidance on how to figure out what is going on, it would be much appreciated, thank you!

Topic		Replies	Views
Keycloak 9 - Login fails with code to token exchange error unless login using incognito mode Getting advice authentication , oidc	1	824	April 6, 2021
Keycloak oidc authentication issue on K8s having replica of application server Getting advice authentication , oidc	1	207	November 20, 2023
Login with keycloak	0	399	November 4, 2020
0.5% of logins redirect to error page with "OAuth error" without error in Keycloak log Getting advice	0	447	July 16, 2021
Getting Invalid parameter: redirect_uri when installed with keycloak-operator Configuring the server	3	17735	January 14, 2020

Keycloak operator created cluster not working with mutliple keycloak pods

Related Topics