[Feature] Introduce HTTP endpoint to restart embedded etcd #24

unmarshall · 2024-02-02T05:34:52Z

Feature (What you would like to be added):
Introduce a HTTP endpoint to allow external agents to restart the embedded etcd.

Motivation (Why is this needed?):
Use case:
To update advertise-peer-urls it is mandated by etcd to restart the member post making the member update call.
Refer: https://etcd.io/docs/v3.3/op-guide/runtime-configuration/#update-advertise-peer-urls

Today etcd-druid works around this missing feature by doing the following (Refer code):

etcd-druid updates the StatefulSet to ensure that any pending secret volume(s) are mounted and the config map changes are seen by the etcd-backup-restore container.
etcd-backup-restore makes the member update call as part of the starting the server. Refer code.
To ensure that the update to peer URL is reflected in the embedded etcd etcd-druid also triggers a deletion of all existing etcd pods forcing a restart.

The current implementation in etcd-druid is synchronous with waits embedded between steps. It is not crash friendly. If etcd-druid crashed in the middle of handling the peer URL TLS changes then it could result in a non-functioning etcd cluster. In addition etcd-backup-restore currently reports the status of peer URL TLS enablement by only looking at the mounted etcd configuration. This does not accurately indicate what the embedded etcd sees.

Therefore we need to follow the recommendations and ensure that the update is completed by first making the member update call immediately followed by restart of the member. The endpoint that is proposed to be exposed out of etcd-wrapper will be invoked by the etcd-backup-restore container just after the member-update call.

Approach/Hint to the implement solution (optional):

The text was updated successfully, but these errors were encountered:

shreyas-s-rao · 2024-06-24T09:33:40Z

/close since it is no longer required for scale-up of etcds in gardener/etcd-druid#777

unmarshall added the kind/enhancement Enhancement, improvement, extension label Feb 2, 2024

unmarshall mentioned this issue Feb 2, 2024

[Enhancement] Update member URL should also restart the embedded etcd run by etcd-wrapper via restart endpoint gardener/etcd-backup-restore#712

Open

shreyas-s-rao mentioned this issue Feb 2, 2024

☂ Druid Refactor to Address Multiple Controller Conflicts gardener/etcd-druid#728

Closed

shreyas-s-rao self-assigned this Feb 2, 2024

shreyas-s-rao added this to the v0.2.0 milestone Feb 2, 2024

unmarshall mentioned this issue Feb 23, 2024

Changed go version to 1.22 and added /stop endpoint #25

Closed

unmarshall self-assigned this Feb 23, 2024

shreyas-s-rao removed this from the v0.2.0 milestone May 21, 2024

gardener-robot closed this as completed Jun 24, 2024

gardener-robot added the status/closed Issue is closed (either delivered or triaged) label Jun 24, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature] Introduce HTTP endpoint to restart embedded etcd #24

[Feature] Introduce HTTP endpoint to restart embedded etcd #24

unmarshall commented Feb 2, 2024

shreyas-s-rao commented Jun 24, 2024

[Feature] Introduce HTTP endpoint to restart embedded etcd #24

[Feature] Introduce HTTP endpoint to restart embedded etcd #24

Comments

unmarshall commented Feb 2, 2024

shreyas-s-rao commented Jun 24, 2024