standard handling of scale/offset and missing data

To: Netcdf group <netcdfgroup@xxxxxxxxxxxxxxxx>
Subject: standard handling of scale/offset and missing data
From: John Caron <caron@xxxxxxxxxxxxxxxx>
Date: Thu, 19 Apr 2001 16:21:03 -0600

As part of the latest netcdf-java 2 library, I am working on handlingscale/offset and missing data attributes in a "standard" way. While thenetcdf manual has recommended standards, these are not always followed,and I would like to know where the implementation rules below would failon existing datasets.

For example, in practice, valid_range seems to be in unpacked unitsrather than packed. The manual is not that clear (to me) and I couldimagine it being used both ways.


---------------------------
public class VariableStandardized extends Variable

A "standardized" read-only Variable which implements:
 1) packed data using scale_factor and add_offset

2) invalid data using valid_min, valid_max, valid_range, missing_dataor _FillValue

if those "standard attributes" are present. If they are not present, itacts just like the original Variable.


Implementation rules for scale/offset:

1) If scale_factor and/or add_offset variable attributes are present,then this is a "packed" Variable.2) the Variable element type is converted to double, unless thescale_factor and add_offset variable attributes are both type float ,inwhich case it converts it to float .3) packed data is converted to unpacked data transparently during theread() call.


Implementation rules for missing data:

1) if valid_range is present, valid_min and valid_max attributes areignored. Otherwise, the valid_min and/or valid_max is used to constructa valid range.2) a missing_value attribute may also specify a scalar or vector ofmissing values.3) if there is no missing_value attribute, the _FillValue attributecan be used to specify a scalar missing value.


Implementation rules for missing data with scale/offset:
  1) valid_range is always in the units of the converted (unpacked) data.

2) _FillValue and missing_data values are always in the units of theraw (packed) data.

If hasMissingData(), then isMissingData( double val) is called todetermine if the data is missing. Note that the data is converted andcompared as a double.

Follow-Ups:
- Re: standard handling of scale/offset and missing data
  - From: Brian Eaton

2001 messages navigation, sorted by:
1. Thread
2. Subject
3. Author
4. Date
5. ↑ Table Of Contents
Search the netcdfgroup archives: