Limit number of remote segment readers allocated (bad_allocs in `ShadowIndexingWhileBusyTest.test_create_or_delete_topics_while_busy` with 10 readers, 1 writer, 1GB ram per core) #6111

jcsp · 2022-08-19T13:06:24Z

This issue was seen while updating kgo-verifier. The new version of was a bit more efficient in how it looped readers, and that probably explains why it was hitting redpanda slightly harder: this destabilized the ShadowIndexingWhileBusyTest.test_create_or_delete_topics_while_busy test, which was writing 24GB of data via a single producer, and concurrently reading it via 10 random-access readers.

This was hitting bad_allocs in docker, where redpanda runs with 2 threads and 2GB RAM. It's a low-resource environment, but 1GB of RAM really should be enough to service 10 readers.

The allocator dump shows 850MB of memory in 128kb extents.

I think it may be caused by lack of bound on the number of readers on materialized segments.

The text was updated successfully, but these errors were encountered:

Previously, we only evicted stale segments, not readers. So if the segments remained materialized, they could accumulate ever-larger numbers of readers, resulting in out of memory conditions. After this change, materialized segments are only allowed to have one reader in their `readers` list after a call into borrow_reader(), the net result is that a segment can have up to two readers cached. Fixes redpanda-data#6111

This test will bad_alloc sometimes in docker if using the original parallelism. This is a redpanda bug, as the parallelism wasn't terribly high. It will be fixed separately, but this commit stabilizes the test in the meantime. Related: redpanda-data#6111

This test will bad_alloc sometimes in docker if using the original parallelism. This is a redpanda bug, as the parallelism wasn't terribly high. It will be fixed separately, but this commit stabilizes the test in the meantime. Related: #6111

Previously, if we were instantiating many readers on many materialized segments, we were vulnerable to instantiating an unbounded number of readers: - excess readers on materialized segments were only GC'd when we hydrated another segment. If readers were hitting already-hydrated segments then we would never trim the per-segment cache of readers - in-use readers (i.e. those not stashed in segment's `readers` list) were not tracked anywhere + there was no limit on how many might be created. This change does not apply any backpressure, but it triggers proactive dropping of readers when a partition's reader count exceeds the capacity of a semaphore. Fixes redpanda-data#6111

This test will bad_alloc sometimes in docker if using the original parallelism. This is a redpanda bug, as the parallelism wasn't terribly high. It will be fixed separately, but this commit stabilizes the test in the meantime. Related: redpanda-data#6111 (cherry picked from commit 5a1273c)

Previously, if we were instantiating many readers on many materialized segments, we were vulnerable to instantiating an unbounded number of readers: - excess readers on materialized segments were only GC'd when we hydrated another segment. If readers were hitting already-hydrated segments then we would never trim the per-segment cache of readers - in-use readers (i.e. those not stashed in segment's `readers` list) were not tracked anywhere + there was no limit on how many might be created. This change does not apply any backpressure, but it triggers proactive dropping of readers when a partition's reader count exceeds the capacity of a semaphore. Fixes redpanda-data#6111

test_write_with_node_failures was disabled for a ticket that was fixed already. test_write_with_node_failures was disable unnecessarily, because the test body was already tweaked to work around redpanda-data#6111 by using smaller reader count, until we fix the code to limit concurrent readers. Related: redpanda-data#6111

jcsp added kind/bug Something isn't working area/cloud-storage Shadow indexing subsystem labels Aug 19, 2022

jcsp mentioned this issue Aug 19, 2022

cloud_storage: limit segment readers per partition #6112

Closed

5 tasks

jcsp mentioned this issue Aug 23, 2022

bad_alloc+crash failure in KgoVerifierWithSiTestLargeSegments.test_si_with_timeboxed #4626

Closed

mmedenjak added area/tests ci-failure labels Aug 23, 2022

rystsov added the ci-disabled-test label Sep 16, 2022

r-vasquez mentioned this issue Sep 19, 2022

[v22.2.x] rpk: grafana-generate - support public metrics #6464

Merged

jcsp mentioned this issue Oct 18, 2022

tests: remove stale ok_to_fail markers on cloud storage tests #6813

Merged

6 tasks

jcsp removed ci-failure ci-disabled-test labels Oct 31, 2022

jcsp mentioned this issue Nov 1, 2022

cloud_storage: limit reader concurrency to avoid bad_allocs under random read loads #7042

Merged

6 tasks

mmedenjak removed the area/tests label Nov 2, 2022

jcsp closed this as completed in #7042 Nov 4, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Limit number of remote segment readers allocated (bad_allocs in `ShadowIndexingWhileBusyTest.test_create_or_delete_topics_while_busy` with 10 readers, 1 writer, 1GB ram per core) #6111

Limit number of remote segment readers allocated (bad_allocs in `ShadowIndexingWhileBusyTest.test_create_or_delete_topics_while_busy` with 10 readers, 1 writer, 1GB ram per core) #6111

jcsp commented Aug 19, 2022

Limit number of remote segment readers allocated (bad_allocs in ShadowIndexingWhileBusyTest.test_create_or_delete_topics_while_busy with 10 readers, 1 writer, 1GB ram per core) #6111

Limit number of remote segment readers allocated (bad_allocs in ShadowIndexingWhileBusyTest.test_create_or_delete_topics_while_busy with 10 readers, 1 writer, 1GB ram per core) #6111

Comments

jcsp commented Aug 19, 2022

Limit number of remote segment readers allocated (bad_allocs in `ShadowIndexingWhileBusyTest.test_create_or_delete_topics_while_busy` with 10 readers, 1 writer, 1GB ram per core) #6111

Limit number of remote segment readers allocated (bad_allocs in `ShadowIndexingWhileBusyTest.test_create_or_delete_topics_while_busy` with 10 readers, 1 writer, 1GB ram per core) #6111