Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

TimeoutError waiting for topic_storage_purged in TopicDeleteCloudStorageTest.topic_delete_unavailable_test #8071

Closed
dlex opened this issue Jan 6, 2023 · 2 comments
Assignees
Labels
area/cloud-storage Shadow indexing subsystem ci-disabled-test ci-failure kind/bug Something isn't working sev/low Bugs which are non-functional paper cuts, e.g. typos, issues in log messages

Comments

@dlex
Copy link
Contributor

dlex commented Jan 6, 2023

Module: rptest.tests.topic_delete_test
Class: TopicDeleteCloudStorageTest
Method: topic_delete_unavailable_test

on (amd64, container) https://buildkite.com/redpanda/redpanda/builds/20621#01858070-ed5d-4031-a40a-b86891ceab01
on (arm64, VM) https://buildkite.com/redpanda/vtools/builds/5007#01857e64-9b47-44f1-ade7-92fcfff06d73

    TimeoutError('')
Traceback (most recent call last):
  File "/usr/local/lib/python3.10/dist-packages/ducktape/tests/runner_client.py", line 135, in run
    data = self.run_test()
  File "/usr/local/lib/python3.10/dist-packages/ducktape/tests/runner_client.py", line 227, in run_test
    return self.test_context.function(self.test)
  File "/root/tests/rptest/services/cluster.py", line 35, in wrapped
    r = f(self, *args, **kwargs)
  File "/root/tests/rptest/tests/topic_delete_test.py", line 238, in topic_delete_unavailable_test
    wait_until(lambda: topic_storage_purged(self.redpanda, self.topic),
  File "/usr/local/lib/python3.10/dist-packages/ducktape/utils/util.py", line 57, in wait_until
    raise TimeoutError(err_msg() if callable(err_msg) else err_msg) from last_exception
ducktape.errors.TimeoutError

Related to #7460

@dlex dlex added kind/bug Something isn't working ci-failure labels Jan 6, 2023
@dlex dlex changed the title TimeoutError witing for topic_storage_purged in TopicDeleteCloudStorageTest.topic_delete_unavailable_test TimeoutError waiting for topic_storage_purged in TopicDeleteCloudStorageTest.topic_delete_unavailable_test Jan 6, 2023
jcsp added a commit to jcsp/redpanda that referenced this issue Jan 6, 2023
@jcsp jcsp self-assigned this Jan 6, 2023
@jcsp jcsp added area/cloud-storage Shadow indexing subsystem sev/low Bugs which are non-functional paper cuts, e.g. typos, issues in log messages labels Jan 6, 2023
@jcsp
Copy link
Contributor

jcsp commented Jan 6, 2023

A leadership transfer is happening very shortly before the topic deletion. This could be a real bug where our deletion is racing somehow.

@dlex
Copy link
Contributor Author

dlex commented Jan 6, 2023

May not be the same failure, but also a timeout in the same test:

    TimeoutError('Segments were not removed')
Traceback (most recent call last):
  File "/usr/local/lib/python3.10/dist-packages/ducktape/tests/runner_client.py", line 135, in run
    data = self.run_test()
  File "/usr/local/lib/python3.10/dist-packages/ducktape/tests/runner_client.py", line 227, in run_test
    return self.test_context.function(self.test)
  File "/root/tests/rptest/services/cluster.py", line 35, in wrapped
    r = f(self, *args, **kwargs)
  File "/root/tests/rptest/tests/topic_delete_test.py", line 260, in topic_delete_unavailable_test
    self._populate_topic(next_topic)
  File "/root/tests/rptest/tests/topic_delete_test.py", line 207, in _populate_topic
    wait_for_segments_removal(self.redpanda, topic_name, i, 5)
  File "/root/tests/rptest/util.py", line 256, in wait_for_segments_removal
    wait_until(done,
  File "/usr/local/lib/python3.10/dist-packages/ducktape/utils/util.py", line 57, in wait_until
    raise TimeoutError(err_msg() if callable(err_msg) else err_msg) from last_exception
ducktape.errors.TimeoutError: Segments were not removed

on (amd64, container) https://buildkite.com/redpanda/redpanda/builds/20705#01858463-ffa4-495f-925f-9a6fdf1723f2

jcsp added a commit to jcsp/redpanda that referenced this issue Jan 9, 2023
This enables cluster::partition::stop to proceed promptly,
rather than being blocked if ntp_archiver has an S3 request
in flight.

Fixes: redpanda-data#8071
jcsp added a commit to jcsp/redpanda that referenced this issue Jan 9, 2023
This enables cluster::partition::stop to proceed promptly,
rather than being blocked if ntp_archiver has an S3 request
in flight.

Fixes: redpanda-data#8071
jcsp added a commit to jcsp/redpanda that referenced this issue Jan 9, 2023
This enables cluster::partition::stop to proceed promptly,
rather than being blocked if ntp_archiver has an S3 request
in flight.

Fixes: redpanda-data#8071
jcsp added a commit to jcsp/redpanda that referenced this issue Jan 12, 2023
This enables cluster::partition::stop to proceed promptly,
rather than being blocked if ntp_archiver has an S3 request
in flight.

Fixes: redpanda-data#8071
jcsp added a commit to jcsp/redpanda that referenced this issue Jan 13, 2023
This enables cluster::partition::stop to proceed promptly,
rather than being blocked if ntp_archiver has an S3 request
in flight.

Fixes: redpanda-data#8071
jcsp added a commit to jcsp/redpanda that referenced this issue Jan 13, 2023
This enables cluster::partition::stop to proceed promptly,
rather than being blocked if ntp_archiver has an S3 request
in flight.

Fixes: redpanda-data#8071
jcsp added a commit to jcsp/redpanda that referenced this issue Jan 16, 2023
This enables cluster::partition::stop to proceed promptly,
rather than being blocked if ntp_archiver has an S3 request
in flight.

Fixes: redpanda-data#8071
jcsp added a commit to jcsp/redpanda that referenced this issue Jan 16, 2023
This enables cluster::partition::stop to proceed promptly,
rather than being blocked if ntp_archiver has an S3 request
in flight.

Fixes: redpanda-data#8071
jcsp added a commit to jcsp/redpanda that referenced this issue Jan 16, 2023
jcsp added a commit to jcsp/redpanda that referenced this issue Jan 16, 2023
This enables cluster::partition::stop to proceed promptly,
rather than being blocked if ntp_archiver has an S3 request
in flight.

Fixes: redpanda-data#8071
jcsp added a commit to jcsp/redpanda that referenced this issue Jan 16, 2023
jcsp added a commit to jcsp/redpanda that referenced this issue Jan 17, 2023
This enables cluster::partition::stop to proceed promptly,
rather than being blocked if ntp_archiver has an S3 request
in flight.

Fixes: redpanda-data#8071
jcsp added a commit to jcsp/redpanda that referenced this issue Jan 17, 2023
jcsp added a commit to jcsp/redpanda that referenced this issue Jan 17, 2023
This enables cluster::partition::stop to proceed promptly,
rather than being blocked if ntp_archiver has an S3 request
in flight.

Fixes: redpanda-data#8071
jcsp added a commit to jcsp/redpanda that referenced this issue Jan 17, 2023
jcsp added a commit to jcsp/redpanda that referenced this issue Jan 17, 2023
@jcsp jcsp closed this as completed in 5f96ac7 Jan 18, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
area/cloud-storage Shadow indexing subsystem ci-disabled-test ci-failure kind/bug Something isn't working sev/low Bugs which are non-functional paper cuts, e.g. typos, issues in log messages
Projects
None yet
Development

No branches or pull requests

2 participants