[netcdfgroup] Confusing documentation on unlimited dimension and chunk-size

To: netcdf@xxxxxxxxxxxxxxxx
Subject: [netcdfgroup] Confusing documentation on unlimited dimension and chunk-size
From: Heiko Klein <Heiko.Klein@xxxxxx>
Date: Mon, 27 May 2013 15:08:06 +0200

Hi,

the netcdf user guide describes the new chunking possibilities inhttp://www.unidata.ucar.edu/software/netcdf/docs/netcdf.html#Default-Chunking


I think the paragraf:

'For unlimited dimensions, a chunk size of one is always used. Users areadvised to set chunk sizes for large data sets with one or moreunlimited dimensions, since a chunk size of one is quite inefficient.'

is very misleading, and opposed to 'In particular, the idea of using 1for the chunksize of an unlimited dimension works well if the data arebeing read a record at a time. Any other read access patterns willresult in slower performance.'

In netcdf-3, it was usual to use a record-base read access pattern whenan unlimited dimension was found. Advising users now to change the chunksize of the unlimited dimension to something different than 1 is in mostcases wrong and will give slower performance. I suggest a sentence like:

'For unlimited dimensions, a chunk size of one is always used. For largedatasets, where the size of limited dimensions is small compared to theunlimited dimensions, users are advised to avoid unlimited dimensions orto increase the chunk sizes of the unlimited dimensions. Be aware thatan unlimited dimension with chunksize != 1 will result in slowerperformance for record-oriented access patterns which where common innetcdf-3.'



Best regards,

Heiko

--
Dr. Heiko Klein                              Tel. + 47 22 96 32 58
Development Section / IT Department          Fax. + 47 22 69 63 55
Norwegian Meteorological Institute           http://www.met.no
P.O. Box 43 Blindern  0313 Oslo NORWAY

Follow-Ups:
- Re: [netcdfgroup] Confusing documentation on unlimited dimension and chunk-size
  - From: Russ Rew

2013 messages navigation, sorted by:
1. Thread
2. Subject
3. Author
4. Date
5. ↑ Table Of Contents
Search the netcdfgroup archives: