Fix minor bugs in LZ4H5 #127

nhz2 · 2025-04-09T14:06:15Z

This PR fixes a few minor bugs in lz4h5. I was unable to compile this package locally for testing, but I think this will fix the last compatibility issues I'm seeing in JuliaIO/ChunkCodecs.jl#27

These are the bugs this PR tries to fix:

When encoding empty bytes, the output is currently 16 bytes, but it should be 12 bytes, 11 0x00, and one 0x01 at the end.

This PR removes the special case

        if srcsize == 0:
            write_i4be(<uint8_t*> &dst[dstpos], <uint32_t> 0)
            dstpos += 4

that adds these 4 null bytes and adds a check in the while loop that there is enough space for the block header.

            if dstsize - dstpos < 5:
                raise Lz4h5Error('output too small')

Encoding more than 2 GB causes an integer overflow.

>>> imagecodecs.lz4h5_encode(bytearray(2147483647))
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
  File "imagecodecs/_lz4.pyx", line 305, in imagecodecs._lz4.lz4h5_encode
imagecodecs.Lz4h5Error: LZ4_compress_fast returned 0

The added (min(dstsize - dstpos, 2147483647)), should fix this problem.

Encoding more than 1 block, but the first block fills the output fails to correctly encode but doesn't throw an error.

>>> src = bytearray(17)
>>> dst = bytearray(26)
>>> out = imagecodecs.lz4h5_encode(src, out=dst, blocksize=16)
>>> imagecodecs.lz4h5_decode(out) == src
False

This should be fixed by moving the and dstpos < dstsize out of the while loop condition and instead throwing an error if the dst space left is too small.

read_i4be on line 358 can read out of bounds if there are only 3 to 1 bytes left after the first block.

This PR adds a check to fix this:

            if srcsize - srcpos < 4:
                raise Lz4h5Error('LZ4H5 data too short')

fix minor bugs in LZ4H5

e8aef01

cgohlke added the bug Something isn't working label Apr 11, 2025

This was referenced May 25, 2025

[LibLz4] Add LZ4 HDF5 JuliaIO/ChunkCodecs.jl#27

Merged

Add imagecodec LZ4HDF5 compat tests JuliaIO/ChunkCodecs.jl#48

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix minor bugs in LZ4H5 #127

Fix minor bugs in LZ4H5 #127

Uh oh!

nhz2 commented Apr 9, 2025

Uh oh!

Uh oh!

Fix minor bugs in LZ4H5 #127

Are you sure you want to change the base?

Fix minor bugs in LZ4H5 #127

Uh oh!

Conversation

nhz2 commented Apr 9, 2025

Uh oh!

Uh oh!