Support node CIDR mask config #488

g-gaston · 2021-10-25T21:41:39Z

Right now, the kube controller manager is using the default for --node-cidr-mask-size (24 for ipv4 and 64 for ipv6)
Add the ability to configure this through the eks-a cluster config CRD

The text was updated successfully, but these errors were encountered:

CharudathGopal · 2022-05-13T23:54:17Z

We are trying to deploy a EKS Anywhere setup with more than 500 nodes and hitting this issue too.

Appreciate any workarounds or suggestions to move ahead.

jaxesn · 2022-05-17T21:28:23Z

We are going to go ahead and look into this one and see if we can something quickly. Well plan on having it in our late June release (0.10.0) but could produce a dev build if interested in testing it earlier once we have something in place.

CharudathGopal · 2022-05-18T00:45:39Z

@jaxesn Please let us know when you have the fix, happy to give a shot!

JFYI, Here is the command we used to set the CIDR on the nodes after which the Cilium pods came up as expected. Before this fix, we had only 254 cilium pods coming up as /24 mask was used.

kubectl annotate node --all --overwrite io.cilium.network.ipv4-pod-cidr=192.168.0.0/16

jaxesn · 2022-05-18T02:56:17Z

When you initially created the cluster with 500 nodes, what were some of the values of the cilium annotation before you changed it? This information may still exist as spec.podCIDR(s) on the node objects.

jaxesn · 2022-05-18T18:18:26Z

@CharudathGopal would you mind giving me a few more details on your network setup and what cidr range and masks you would like to set?

CharudathGopal · 2022-05-18T21:33:59Z

Here is the snapshot of cilium config.

auto-direct-node-routes                        false
bpf-lb-map-max                                 65536
bpf-map-dynamic-size-ratio                     0.0025
bpf-policy-map-max                             16384
cgroup-root                                    /run/cilium/cgroupv2
cilium-endpoint-gc-interval                    5m0s
cluster-id
cluster-name                                   poc-22
cni-chaining-mode                              portmap
custom-cni-conf                                false
debug                                          false
disable-cnp-status-updates                     true
enable-auto-protect-node-port-range            true
enable-bandwidth-manager                       false
enable-bpf-clock-probe                         true
enable-bpf-masquerade                          true
enable-endpoint-health-checking                true
enable-health-check-nodeport                   true
enable-health-checking                         true
enable-hubble                                  true
enable-ipv4                                    true
enable-ipv6                                    false
enable-l7-proxy                                true
enable-local-redirect-policy                   false
enable-metrics                                 true
enable-policy                                  default
enable-remote-node-identity                    true
enable-session-affinity                        true
enable-well-known-identities                   false
enable-xt-socket-fallback                      true
hubble-disable-tls                             false
hubble-listen-address                          :4244
hubble-socket-path                             /var/run/cilium/hubble.sock
hubble-tls-cert-file                           /var/lib/cilium/tls/hubble/server.crt
hubble-tls-client-ca-files                     /var/lib/cilium/tls/hubble/client-ca.crt
hubble-tls-key-file                            /var/lib/cilium/tls/hubble/server.key
identity-allocation-mode                       crd
install-iptables-rules                         true
ipam                                           kubernetes
kube-proxy-replacement                         probe
kube-proxy-replacement-healthz-bind-address
masquerade                                     true
monitor-aggregation                            medium
monitor-aggregation-flags                      all
monitor-aggregation-interval                   5s
node-port-bind-protection                      true
operator-api-serve-addr                        127.0.0.1:9234
operator-prometheus-serve-addr                 :6942
preallocate-bpf-maps                           false
prometheus-serve-addr                          :9090
proxy-prometheus-port                          9095
sidecar-istio-proxy-image                      cilium/istio_proxy
tunnel                                         geneve
wait-bpf-mount                                 false

With this configuration, Cilium PODs were failing to comeup after reaching 255 nodes. So I added few more params:


cluster-pool-ipv4-cidr                         192.168.0.0/16
cluster-pool-ipv4-mask-size                    16
ipv4-pod-cidr                                  192.168.0.0/16
ipv4-range                                     192.168.0.0/16
allocate-node-cidrs                            true

This did not make much difference, finally after setting the annotations on the nodes using this command cilium PODs came up.

kubectl annotate node --all --overwrite io.cilium.network.ipv4-pod-cidr=192.168.0.0/16

jaxesn · 2022-05-18T22:43:36Z

After that annotation change, are pods running on all nodes? Could you send the results "k get pods -A -owide"? Setting the pod cidr range to the entire /16 block on all nodes seems like it shouldn't work since all the nodes could potentially being trying to assign pods with the same IPs as other nodes?

I think exposing the node cidr mask makes a lot of sense and @mitalipaygude is actively looking at what it will take to that do that, but I want to make sure that would actually solve the problem in your environment. Are you thinking of leaving the podcidr the same, 192.168.0.0/16 and then changing the node cidr mask to like 28 or something to increase the number of avoid ranges for nodes, but limit the number of pods on each node? Or were you thinking of opening up your cidr range to like 10.0.0.0/8 to have more total IPs?

g-gaston changed the title ~~Support node CIDR mask~~ Support node CIDR mask config Oct 25, 2021

jaxesn added this to the next milestone Nov 8, 2021

jaxesn modified the milestones: next, next+1 Jan 10, 2022

jaxesn modified the milestones: next+1, backlog Jan 26, 2022

g-gaston added kind/enhancement New feature or request area/cni Kubernetes CNIs for EKS-A team/cli labels Apr 25, 2022

jaxesn modified the milestones: backlog, next May 17, 2022

mitalipaygude self-assigned this May 25, 2022

mitalipaygude mentioned this issue May 27, 2022

Support node CIDR mask config #2283

Merged

This was referenced Jun 10, 2022

Controller manager args for clusterapi #2387

Merged

Support for node cidr mask size documentation #2411

Merged

mitalipaygude closed this as completed Jun 24, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support node CIDR mask config #488

Support node CIDR mask config #488

g-gaston commented Oct 25, 2021

CharudathGopal commented May 13, 2022 •

edited

Loading

jaxesn commented May 17, 2022

CharudathGopal commented May 18, 2022 •

edited

Loading

jaxesn commented May 18, 2022 •

edited

Loading

jaxesn commented May 18, 2022

CharudathGopal commented May 18, 2022

jaxesn commented May 18, 2022

Support node CIDR mask config #488

Support node CIDR mask config #488

Comments

g-gaston commented Oct 25, 2021

CharudathGopal commented May 13, 2022 • edited Loading

jaxesn commented May 17, 2022

CharudathGopal commented May 18, 2022 • edited Loading

jaxesn commented May 18, 2022 • edited Loading

jaxesn commented May 18, 2022

CharudathGopal commented May 18, 2022

jaxesn commented May 18, 2022

CharudathGopal commented May 13, 2022 •

edited

Loading

CharudathGopal commented May 18, 2022 •

edited

Loading

jaxesn commented May 18, 2022 •

edited

Loading