Reading PNG format (deflate compression)

Lehm2000 · 2014-10-27T09:46:42

I'm writing my own image file loader as a personal exercise. I managed bitmap and targa just fine. I've moved onto png. I'm having a bit of trouble with the deflate compression. I've read through the spec( http://www.ietf.org/rfc/rfc1951.txt ). I understand the concepts involved but having a bit of trouble implementing them. I'll go through the steps I'm using to decode a test image and if someone could point out where I'm going wrong that would be great. I created a test image (4x4px, solid (255,192,64). The compressed result in the IDAT chunk is ( xÚbüßàÀ..L.H.7....ÿÿ..Y..Çr.è3 ) Starting the parsing... Bytes 0 and 1 are xÚ ( 0x78 and 0xDA ) which is the zlib info and exactly what they should be. We move onto the actual image data. Bytes 2 and 3 are bü ( 0x62 and 0xFC ) Deflate encodes using bits so the binary for those bytes are 0110 0010 1111 1100 The first bit ( 8th character) is 0 so that means this is not the final block. The next two bits ( 6th and 7th characters) are 01. That means that it uses fixed huffman code tree. Read the first code. Start with 7bits as that is the smallest code in the fixed tree. 0001100 . Huffman codes are stored in reverse order so that becomes 0011000. That code is not in the fixed tree so we add one more bit. 00110001. That one is in the tree. The lit value is 1. My understanding is that anything less than 256 is the actual value. So the first byte of data for my image should be 1. Except there is no 1 value in the test image (should be 255). So either I'm getting off track somewhere or I misinterupting the output. Any help is appreciated.

Nypyren

12,313

October 21, 2014 07:35 PM

Yes, like others say, "inside" the ZLIB stream is a PNG-filtered representation. The PNG filtered representation is an optimization step that PNG performs on the color data in order for DEFLATE to compress better than it would on the raw color data.


IDAT chunk { ZLIB { PNG Filters (each scanline starts with the filter ID that the scanline is using) { Color data } } }

2. Filter the image data according to the filtering method specified by the IHDR chunk. (Note that with filter method 0, the only one currently defined, this implies prepending a filter-type byte to each scanline.)

And the logic for encoding/decoding with the filters is here:

http://www.libpng.org/pub/png/spec/1.2/PNG-Filters.html

Filtering algorithms are applied to bytes, not to pixels, regardless of the bit depth or color type of the image. The filtering algorithms work on the byte sequence formed by a scanline that has been represented as described in Image layout. If the image includes an alpha channel, the alpha data is filtered in the same way as the image data."

For all filters, the bytes 'to the left of' the first pixel in a scanline must be treated as being zero. For filters that refer to the prior scanline, the entire prior scanline must be treated as being zeroes for the first scanline of an image (or of a pass of an interlaced image).

Unsigned arithmetic modulo 256 is used, so that both the inputs and outputs fit into bytes.

In other words, if you expect your final image data to start with the byte '255', and you have a scanline filter of 1 (sub), then your next byte should be a 1. (0 minus 1 -> modulo 256 -> 255) // NOTE: this is wrong and has been corrected (below)

popsoftheyear

2,195

October 21, 2014 08:01 PM

[deleted]

Sorry - hold on. I made the wrong conclusion from some old code that didn't match with the spec (the result was correct but my explanation in this post was not). Fixed it in the below post (it was supposed to be another edit but I messed that up too).

popsoftheyear

2,195

October 21, 2014 08:07 PM

In other words, if you expect your final image data to start with the byte '255', and you have a scanline filter of 1 (sub), then your next byte should be a 1. (0 minus 1 -> modulo 256 -> 255)

Not quite.

For the first byte, you can just write the raw value. The equation is (curr - prev), and x < 0 is defined as 0. So filtering with the PNG subtraction filter would give 255 - 0, mod 256, which is still 255. So byte 1 would be "1" for sub filter, and byte 2 would be "255".

Nypyren

12,313

October 22, 2014 12:52 AM

In other words, if you expect your final image data to start with the byte '255', and you have a scanline filter of 1 (sub), then your next byte should be a 1. (0 minus 1 -> modulo 256 -> 255)

Not quite.

For the first byte, you can just write the raw value. The equation is (curr - prev), and x < 0 is defined as 0. So filtering with the PNG subtraction filter would give 255 - 0, mod 256, which is still 255. So byte 1 would be "1" for sub filter, and byte 2 would be "255".

Oops, you're right. I read the encoding and decoding parts backwards. Decoding *adds* the previously decoded byte (zero if there is no previous value) with the current byte, so the first filtered byte using the Sub filter is effectively the same before/after encoding/decoding.

SyncViews

844

October 22, 2014 03:01 PM

So I assume you are not wanting to use zlib?

Id suggest you tackle this the same way I believe libpng does. Get a working DEFLATE first (i.e. implement your own version of the zlib decompression stuff, which you can also test easily against zlib on simple binary strings for correctness), then use that to implement the PNG decoding stuff.

Lehm2000

Author

124

October 27, 2014 09:46 AM

In other words, if you expect your final image data to start with the byte '255', and you have a scanline filter of 1 (sub), then your next byte should be a 1. (0 minus 1 -> modulo 256 -> 255)

Not quite.

For the first byte, you can just write the raw value. The equation is (curr - prev), and x < 0 is defined as 0. So filtering with the PNG subtraction filter would give 255 - 0, mod 256, which is still 255. So byte 1 would be "1" for sub filter, and byte 2 would be "255".

Spot on. Thanks for the help. I totally looked over the filter info. There was also an error in the code that returned 79 for the second byte. Once I fixed that I got 255 for the second byte.

Reading PNG format (deflate compression)

This topic is closed to new replies.

Popular Topics

Recommended Tutorials

Reading PNG format (deflate compression)

This topic is closed to new replies.

Popular Topics

Recommended Tutorials

Reticulating splines