Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Stream coordinator: fixes to automatic membership changes. #10331

Merged
merged 13 commits into from
Jan 26, 2024

Conversation

kjnilsson
Copy link
Contributor

@kjnilsson kjnilsson commented Jan 12, 2024

Various bug fixes to make stream coordinator membership changes more reliable. Previously various errors could happen as well as partially successful attempts where the membership change command may fail but it leaves the new server running.

Also ensure that stream coordinator members are removed as part of the forget_cluster_node command.

Also added rabbitmq-streams coordinator_status command so that we can get a view of replication and other information in a nice tabular format, same as rabbitmq-queues quorum_status

This PR also contains a change to allow rabbit_nodes:list_member/0 to return [] (rather than [node()]) when the query to fetch the current members fails. This allows us to detect this scenario in the code and avoid, e.g. shrinking the coordinator cluster incorrectly.

Also adding back a missing clause in the quorum queue tick handler that was accidentally removed in #10364

@mergify mergify bot added the bazel label Jan 12, 2024
Various bug fixes to make stream coordinator membership changes
more reliable. Previously various errors could happen as well as
partially successful attempts where the membership change command
may fail but it leaves the new server running.

Also ensure that stream coordinator members are removed as part of
the forget_cluster_node command.

Add stream coordinator status command.

To show the raft status of the stream coordinator just like is done
for quorum queues.
Don't do clustering with mixed versions anyway. Please.
Also refactor to avoid double read of queue record.
And fix a race condition flake in amqp10_client_SUITE.
@kjnilsson kjnilsson marked this pull request as ready for review January 25, 2024 10:42
@kjnilsson kjnilsson added this to the 3.13.0 milestone Jan 25, 2024
@kjnilsson kjnilsson merged commit b1d7037 into main Jan 26, 2024
19 checks passed
@kjnilsson kjnilsson deleted the stream-coordinator-fixes branch January 26, 2024 15:39
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants