Re: Bzip2 Compression

And, here's some more examples, where it is extremely data dependent...
It's also significantly slower.

ETA grib file   bzip2           gzip
12.208MB        6.97MB          8.78MB
                24.6seconds     6.7seconds

MMOUT D1 file   bzip2           gzip
266.7MB         187.4MB         180.3MB
                9.0 minutes     2.5 minutes (!)

Laurie Carson

On 2000.08.09, Phil Rasch wrote:
: 
: The compression gains must be extremely data dependent.
: 
: Here are the results for the compression of a dataset that consists of
: 32bit floats of a collection of many meteorological fields from the NCAR CCM3:
: 
: 35577060 bytes ENSANNAVG-0105.nc
: 26739972 bytes ENSANNAVG-0105.nc.bz2   
: 
: The compression is very small.
: 
: Phil
: 
: On Wed, Aug 09, 2000 at 08:03:50AM -0600, Roy Mendelssohn wrote:
: > There was some question as to how well Bzip2 would compress floating 
: > point data.  We just did a test on a large NetCDF file, containing a 
: > years worth of global, 1-degree six-hourly pressure fields, so that 
: > the data is floating point.  here are the sizes from a Unix ls 
: > command (i.e in 512 byte units).
: > 
: > raw.nc        380,551,616
: > 
: > raw.nc.Z      143,716,211
: > 
: > raw.nc.bz2     55,633,062
: > 
: > 
: > 
: > Bzip2 gave about the same 7:1 compression that Russ reported, and 
: > roughly 3 times better that Unix compress.
: > 
: 
: -- 
: ----------------------------------------
: Phil Rasch, Climate Modeling Section, National Center for Atmospheric Research
: P.O. Box 3000, Boulder CO 80307
: internet: pjr@xxxxxxxx, http://www.cgd.ucar.edu/cms/pjr
: Phone: 303-497-1368, FAX: 303-497-1324 

  • 2000 messages navigation, sorted by:
    1. Thread
    2. Subject
    3. Author
    4. Date
    5. ↑ Table Of Contents
  • Search the netcdfgroup archives: