Minimal and explicit resource requests for image-puller pods #1764

consideRatio · 2020-09-01T09:40:19Z

If a k8s cluster has a LimitRange resource, it may end up for example
allocating 100m CPU for containers which is a waste of resources. This
would be a waste as the running container only does sh -c "Done!"
anyhow. While there is image pulling happening, that is done before the
actual container starts so its worth noting we don't need any resources
specified for that.

Another reason for specifying resources, and making them configurable,
is that sometimes there is a ResourceQuota resource that requires the
resources to be explicitly specified on pods.

Default CPU requests - https://kubernetes.io/docs/tasks/administer-cluster/manage-resources/cpu-default-namespace/
LimitRange https://kubernetes.io/docs/concepts/policy/limit-range/
ResourceQuota - https://kubernetes.io/docs/concepts/policy/resource-quotas/

Closes #1761

… pods If a k8s cluster has a LimitRange resource, it may end up for example allocating 100m CPU for containers which is a waste of resources. This would be a waste as the running container only does `sh -c "Done!"` anyhow. While there is image pulling happening, that is done before the actual container starts so its worth noting we don't need any resources specified for that. Another reason for specifying resources, and making them configurable, is that sometimes there is a ResourceQuota resource that requires the resources to be explicitly specified on pods. Related: - Default CPU requests - https://kubernetes.io/docs/tasks/administer-cluster/manage-resources/cpu-default-namespace/ - LimitRange https://kubernetes.io/docs/concepts/policy/limit-range/ - ResourceQuota - https://kubernetes.io/docs/concepts/policy/resource-quotas/

yuvipanda

Yeah, the limitrange has bit me more than once.

Does setting the request to '0' work well? I remember seeing that it would count it as unset and set request == limit, but not sure.

Small suggestion to reduce the limit even more!

yuvipanda · 2020-09-04T03:29:26Z

jupyterhub/values.yaml

+      cpu: 0
+      memory: 0
+    limits:
+      cpu: 10m


Suggested change

cpu: 10m

cpu: 1m

yuvipanda · 2020-09-04T03:29:43Z

jupyterhub/values.yaml

+      memory: 0
+    limits:
+      cpu: 10m
+      memory: 10Mi


Suggested change

memory: 10Mi

memory: 1Mi

About 0 requests

These were the resources reported back from kubectl get pod -o yaml after trying to set the resources like described. I suspect that KubeSpawner may have a logic checking for a truthy value of the cpu request and leaving it out if not set or similar could be an issue though.

resources: limits: cpu: 10m memory: 10Mi requests: cpu: "0" memory: "0"

Relevant to this is that I've now learned that LimitRange resources can specify a maxLimitRequestRatio which would enforce the resource type influenced to not have any field set to zero. Do we want to avoid this by having a request of 1?

Reducing limits further

We can do this, but are you sure that the containers will stay below 1Mi of memory usage? My main concern is that I'll have concern about this when debugging some other issue and spend time thinking about if this narrow limit could have caused problems.

I decided to try using a extremely narrow limit of 1m / 10Mi and found that in my k8s cluster with Calico etc the pod with 1m / 1Mi limits got stuck in ContainerCreating while the 10m / 10Mi limited pod got started successfully.

The issues were the following on the 1m / 10 Mi pod.

Events: Type Reason Age From Message ---- ------ ---- ---- ------- Normal Scheduled 3m18s default-scheduler Successfully assigned default/tmp-6f699cc94f-62sm6 to gke-nh-2020-core-standard-2-b80a0890-kfvt Warning FailedCreatePodSandBox 104s (x3 over 2m47s) kubelet, gke-nh-2020-core-standard-2-b80a0890-kfvt Failed create pod sandbox: rpc error: code = Unknown desc = failed to start sandbox container for pod "tmp-6f699cc94f-62sm6": Error response from daemon: OCI runtime create failed: container_linux.go:349: starting container process caused "process_linux.go:319: getting the final child's pid from pipe caused \"read init-p: connection reset by peer\"": unknown Warning FailedCreatePodSandBox 33s (x2 over 68s) kubelet, gke-nh-2020-core-standard-2-b80a0890-kfvt Failed create pod sandbox: rpc error: code = Unknown desc = failed to start sandbox container for pod "tmp-6f699cc94f-62sm6": Error response from daemon: OCI runtime create failed: container_linux.go:349: starting container process caused "process_linux.go:449: container init caused \"read init-p: connection reset by peer\"": unknown Normal SandboxChanged 32s (x5 over 2m46s) kubelet, gke-nh-2020-core-standard-2-b80a0890-kfvt Pod sandbox changed, it will be killed and re-created.

I think the newly introduced Pod Overhead could mitigate this and allow us to have extremely low limits, but since that is a lot of extra infrastructure, I suggest we stick with the current 10 m / 10 Mi to be safe.

@yuvipanda what do you think, merge as it is?

I think we should actually remove the limits. On my cluster (AKS), setting this to 10Mi causes the pods to constantly crash as well. Why don't we just remove the limits?

@yuvipanda ah yes deal. Are we unlimited by leaving them out?

yuvipanda · 2020-09-04T12:05:29Z

<3 for checking through this, @consideRatio. I always appreciate how thorough you are.

This memory limit seems to include pod overhead, which can be variable based on cluster config. In my cluster (AKS) with Calico enabled, the memory limit was too low and kept crashing these pods with very weird reasons - the usual reasons aren't reported since it's not *our* process that is causing OOM kills, but the cluster's overhead. By removing the default resource limits, we prevent thi from being a problem. Theoretically this would let the pods have unbounded memory - but since we control the contents, it wouldn't be the case. We leave the explicit default requests be, for reasons outlined in jupyterhub#1764. The capacity for admins to specify their own limits is still in place, in case you want that for your cluster.

consideRatio force-pushed the resource-for-image-puller-containers branch from f90c5ea to 1ddafd3 Compare September 1, 2020 09:56

consideRatio changed the title ~~Set configurable minimal resource requests for image-puller daemonset pods~~ Minimal and explicit resource requests for image-puller pods Sep 1, 2020

yuvipanda reviewed Sep 4, 2020

View reviewed changes

yuvipanda merged commit dd4fd8b into jupyterhub:master Sep 4, 2020

yuvipanda mentioned this pull request Sep 16, 2020

Remove memory / cpu limits for pre-puller #1780

Merged

consideRatio added the enhancement label Oct 4, 2020

consideRatio mentioned this pull request Oct 7, 2020

prePuller: fix recently introduced regression #1817

Merged

consideRatio mentioned this pull request Feb 15, 2021

Don't set default resource requests #2034

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Minimal and explicit resource requests for image-puller pods #1764

Minimal and explicit resource requests for image-puller pods #1764

consideRatio commented Sep 1, 2020

yuvipanda left a comment

yuvipanda Sep 4, 2020

yuvipanda Sep 4, 2020

consideRatio Sep 4, 2020

consideRatio Sep 4, 2020

yuvipanda Sep 14, 2020

consideRatio Sep 14, 2020

yuvipanda commented Sep 4, 2020

Minimal and explicit resource requests for image-puller pods #1764

Minimal and explicit resource requests for image-puller pods #1764

Conversation

consideRatio commented Sep 1, 2020

yuvipanda left a comment

Choose a reason for hiding this comment

yuvipanda Sep 4, 2020

Choose a reason for hiding this comment

yuvipanda Sep 4, 2020

Choose a reason for hiding this comment

consideRatio Sep 4, 2020

Choose a reason for hiding this comment

About 0 requests

Reducing limits further

consideRatio Sep 4, 2020

Choose a reason for hiding this comment

yuvipanda Sep 14, 2020

Choose a reason for hiding this comment

consideRatio Sep 14, 2020

Choose a reason for hiding this comment

yuvipanda commented Sep 4, 2020