Deflating a File

Chase Preuninger · Apr 17, 2008

One thing I just noticed was that when I deflated a file full or
random bytes it actually increased its size. Just thought that was
kind of neat.

Joshua Cranmer · Apr 17, 2008

Chase said:
One thing I just noticed was that when I deflated a file full or
random bytes it actually increased its size. Just thought that was
kind of neat.

Compression works by collapsing redundant sequences of information into
shorter bit strings, typically (though not always) through some sort of
dictionary system or run-length encoding. True random data cannot be
compressed most of the time; sufficiently pseudorandom data is also not
likely to be compressed well either.

Random data being compressed would be surprising.

Patricia Shanahan · Apr 17, 2008

Chase said:
One thing I just noticed was that when I deflated a file full or
random bytes it actually increased its size. Just thought that was
kind of neat.

The job of a compression algorithm is to map some class of files to
shorter files. Since there is a fixed number of possible files of each
length, to achieve that it also has to map other files to longer files.

Patricia

Roedy Green · Apr 17, 2008

One thing I just noticed was that when I deflated a file full or
random bytes it actually increased its size. Just thought that was
kind of neat.

That's to be expected. You add the overhead without getting ANYTHING
back. A random file by definition can't be compressed.

Zipping a mess of zips buys you a little not because the zip parts
compress, but because the filenames and other uncompressed parts
compress.

Owen Jacobson · Apr 18, 2008

Compression works by collapsing redundant sequences of information into
shorter bit strings, typically (though not always) through some sort of
dictionary system or run-length encoding. True random data cannot be
compressed most of the time; sufficiently pseudorandom data is also not
likely to be compressed well either.

Random data being compressed would be surprising.

*Pseudo*random data can, of course, be compressed to a description
(goedel number? source code? whatever) of the algorithm plus the
initial conditions used to generate the values. This suggests to me
that there might be a fixed (or almost fixed) amount of entropy in
PRNG output, regardless of how many digits of output you have.

Hmm.

-o

Andreas Leitgeb · Apr 18, 2008

Owen Jacobson said:
*Pseudo*random data can, of course, be compressed to a description
(goedel number? source code? whatever) of the algorithm plus the
initial conditions used to generate the values. This suggests to me
that there might be a fixed (or almost fixed) amount of entropy in
PRNG output, regardless of how many digits of output you have.

If it's say 1e1000000000000000000 digits from the PRNG-Output, it's
not all that "regardless", as you'll need some extra bytes to encode
the actual length of the sequence

Arne Vajhøj · Apr 19, 2008

Owen said:
*Pseudo*random data can, of course, be compressed to a description
(goedel number? source code? whatever) of the algorithm plus the
initial conditions used to generate the values. This suggests to me
that there might be a fixed (or almost fixed) amount of entropy in
PRNG output, regardless of how many digits of output you have.

True.

But current available compression algorithms does not analyze
data and detect the RNG algorithm. And I am pretty sure that
they will not in the future either.

So in practice the data does not compress.

Arne

How do I rename and copy a file on the server?	1	Nov 21, 2025
How do I fix Error 1028: Insufficient Memory in IBM Notes when opening a large NSF file?	0	Feb 19, 2026
What causes PST file corruption in Outlook?	0	Mar 30, 2026
Even basic math is at risk? Why is a simple math and logic solution being ignored?	2	Jul 3, 2025
AES-128 Clipboard Protector: Auto-Encrypt Ctrl+C, Smart-Decrypt Ctrl+V (C++ Windows Hook)	7	Mar 24, 2026
How to upload a compressed file (.gz) to the swift object storage using the Python swift client?	1	Jul 24, 2024
Why is my Outlook OST file growing so fast?	0	Apr 1, 2026
Executing a second python file with one of several options at a time	0	Nov 6, 2025

Deflating a File

Chase Preuninger

Joshua Cranmer

Patricia Shanahan

Roedy Green

Owen Jacobson

Andreas Leitgeb

Arne Vajhøj

Ask a Question

Similar Threads

Members online

Forum statistics

Latest Threads