Compactors require high memory as traces combine and grow in the backend #976

annanay25 · 2021-09-22T13:51:34Z

Describe the bug

Compactors require high memory as traces combine and grow in the backend. This opens the possibility of crafting a long running trace with low spans-per-second that can grow and eventually OOM compactors.

To Reproduce
Steps to reproduce the behavior:

Start Tempo (SHA or version): all versions till e5f7ded
Perform Operations (Read/Write/Others): Carefully craft super-long running traces with a few spans every second that will eventually be combined by the compactors into a MEGA trace (the largest we are seeing so far are 1.3GB)

Expected behavior

Compactors do not keep OOMing.

Environment:

Infrastructure: [e.g., Kubernetes, bare-metal, laptop]
Deployment tool: [e.g., helm, jsonnet]

Additional Context

Some possibilities considered:

write multiple splits of a trace into the same block (might be harder than it sounds)
Do not compact blocks that have very large traces

The text was updated successfully, but these errors were encountered:

mdisibio · 2021-09-27T13:57:22Z

a MEGA trace (the largest we are seeing so far are 1.3GB)

For both possibilities listed, a trace of this size would still be trouble for the queriers. I.e. if the compactor is able to write multiple splits of the trace (or ignore the block entirely), the expected behavior is still the querier recombines all segments. Maybe a solution for the querier is limit the amount of data returned in a single call, add a new paged API to retrieve all the splits. Quick ideas on the limits per call, 100MiB? Even a 100MiB trace is quite large and hard to utilize.

Also, I propose to call any trace over 1GB a GIGA trace :)

annanay25 mentioned this issue Nov 24, 2021

Increase protection from large traces #1133

Closed

mdisibio mentioned this issue Jan 5, 2022

Ingester prevent writes to large traces even after flushing #1199

Merged

3 tasks

joe-elliott mentioned this issue Jan 13, 2022

Enforce max trace size on query path #1225

Closed

joe-elliott mentioned this issue Jan 21, 2022

Memory consumption post usage #852

Closed

joe-elliott added this to the v1.4 milestone Jan 28, 2022

mdisibio mentioned this issue Feb 17, 2022

Replace CombineTraceProtos with new Combiner #1291

Merged

3 tasks

joe-elliott mentioned this issue Mar 1, 2022

Compactor: Partially drop traces that exceed max_bytes_per_trace #1317

Merged

3 tasks

joe-elliott closed this as completed in #1317 Mar 7, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Compactors require high memory as traces combine and grow in the backend #976

Compactors require high memory as traces combine and grow in the backend #976

annanay25 commented Sep 22, 2021

mdisibio commented Sep 27, 2021

Compactors require high memory as traces combine and grow in the backend #976

Compactors require high memory as traces combine and grow in the backend #976

Comments

annanay25 commented Sep 22, 2021

mdisibio commented Sep 27, 2021