Re: [netcdfgroup] simultaneous NetCDF writes to same file

To: Ted Mansell <ted.mansell@xxxxxxxx>
Subject: Re: [netcdfgroup] simultaneous NetCDF writes to same file
From: Kristopher Bedka <kristopher.m.bedka@xxxxxxxx>
Date: Tue, 5 Jun 2012 12:27:23 -0400

When I say processor, I actually mean different machines. How wouldthe different machines know when another is operating on the file andhence, wait to write the output?


On Jun 5, 2012, at 12:06 PM, Ted Mansell wrote:

If you are only using 15 processors, I would suggest using a 'roundrobin' approach with non-parallel, chunked and compressed output.Essentially, each processor writes in succession to the file (open,write, close, next processor). This works really well for me forsmallish numbers of processors (less than 60, say). If the chunkingis set up such that each processor writes just its own chunks, thismethod works well.
good luck!

-- Ted


On Jun 5, 2012, at 10:23 AM, Kristopher Bedka wrote:
I'm not quite sure what you mean by the "application layer"? Mygoal was to have 15 different processors process 15 segments of asatellite orbit, where each processor would write to the sameNetCDF file in the most disk space efficient manner possiblewithout any problems with simultaneous NetCDF writes. I hadpreviously done the compression with the "nf_def_var_chunking"function call in non-parallel NetCDF. As this function does notseem to be available in parallel NetCDF, I'd be interested inalternative suggestions to accomplish my goal. Sorry I am more ofthe scientist type and am not a software engineer, so I may absorbsome of these concepts a little slower than others.
Thanks for the help,
Kris

On Jun 5, 2012, at 11:13 AM, Rob Latham wrote:
On Tue, May 29, 2012 at 02:29:12PM -0600, Russ Rew wrote:
Hi Kristopher,
I am processing a large volume of satellite data where multiple
processes could be simultaneously writing data to the same netcdf
file. This has not been supported in previous NetCDF versionsandI've gotten fatal errors when two simultaneous writesconflicted. I
now understand that recent NetCDF versions do support this
functionality. Could someone tell me or provide an example ofwhat I
need to do (i.e. new
function calls, options in netcdf open, etc...) to make thiswork forme? I've tried the pnetcdf package does not support chunkingwhich
I need to internally compress these files.
No, sorry, it's not supported in current netCDF versions either.
NetCDF-4 uses HDF5 as its storage layer, and HDF5 does not support
compression with parallel access, as explained here:
Is there any chance you can compress at the application layer?  Each
processor takes it's local hunk of data, compresses it, thenwrites to
the file.

I admit, you will quickly find out why parallel writes with
compression is not already implemented in these parallel I/O
libraries!

However, it's possible that at your application level, there may be
ways to simplify the parallel, compressed writes problem that a
general purpose library cannot use.

==rob

--
Rob Latham
Mathematics and Computer Science Division
Argonne National Lab, IL USA
=========================================================
Kristopher Bedka
Science Systems & Applications, Inc. @ NASA Langley Research Center
Climate Science Branch
1 Enterprise Parkway, Suite 200
Hampton, VA 23666
Phone:  (757) 951-1920
Fax: (757) 951-1902
Kristopher.m.bedka@xxxxxxxx
=========================================================






_______________________________________________
netcdfgroup mailing list
netcdfgroup@xxxxxxxxxxxxxxxx
For list information or to unsubscribe,  visit: 
http://www.unidata.ucar.edu/mailing_lists/


=========================================================
Kristopher Bedka
Science Systems & Applications, Inc. @ NASA Langley Research Center
Climate Science Branch
1 Enterprise Parkway, Suite 200
Hampton, VA 23666
Phone:  (757) 951-1920
Fax: (757) 951-1902
Kristopher.m.bedka@xxxxxxxx
=========================================================

Follow-Ups:
- Re: [netcdfgroup] simultaneous NetCDF writes to same file
  - From: Ted Mansell

References:
- [netcdfgroup] simultaneous NetCDF writes to same file
  - From: Kristopher Bedka
- Re: [netcdfgroup] simultaneous NetCDF writes to same file
  - From: Russ Rew
- Re: [netcdfgroup] simultaneous NetCDF writes to same file
  - From: Rob Latham
- Re: [netcdfgroup] simultaneous NetCDF writes to same file
  - From: Kristopher Bedka
- Re: [netcdfgroup] simultaneous NetCDF writes to same file
  - From: Ted Mansell

2012 messages navigation, sorted by:
1. Thread
2. Subject
3. Author
4. Date
5. ↑ Table Of Contents
Search the netcdfgroup archives: