Building the Library

Parallel I/O requires this software stack:

MPI-IO from MPI2 (For openmpi, at least version1.6)
HDF5 (version 1.8 or higher)
netCDF C library (version 4.2 or higher)

Build HDF5 with the following configure options:

./configure --enable-parallel CFLAGS=-fPIC

Build netCDF with the following configure options:

./configure --enable-netcdf-4 -with-gnu-ld --enable-logging \
--enable-parallel-tests \
LDFLAGS='LIBS=-lhdf5 -lz -ldl -lmpi'

Build the NetCDF4 C++ API library with the following macros defined:

-DUSE_PARALLEL -DHAS_NETCDF_PAR_H

Build your code with -DUSE_PARALLEL and also use -DHAS_NETCDF_PAR_H if the header file "netcdf_par.h" is available.

and link with the NetCDF4.parallel library.

Parallel I/O Considerations

NetCDF version 4 uses the HDF5 library to do handle the I/O and underlying file structure. The netCDF file structure which you define is implemented in terms of HDF5. For parallel I/O, HDF5 uses MPI-IO, which is defined in the MPI2 standard. OpenMPI and MPICH2 provide this capability.

Collective mode and appending to a file

There are two modes of writing in parallel, independent and collective. These are described in the HDF5 documentation:

Independent IO means that each process can do IO independently. It should not depend on or be affected by other processes. Collective IO is a way of doing IO defined as MPI-IO standard; contrary to independent IO, all processes must participate in doing IO.

When using parallel I/O, if a variable is declared as UNLIMITED, the appending records must be done in collective mode. If the write is done in independent mode, the operation will fail with a message such as "HDF Error" or "Unknown error".

Another disadvantage of collective I/O is the elapsed time tends to be slower since all processes have to synchronize their write operation.

The solutions for this issue include:

Put the I/O state into collective mode with
NcVar::DoCollectiveIO(true).

Pre-allocate the disk space, either by appending empty records, or by changing the UNLIMITED dimension to a fixed dimension. Then independent I/O can be used in which each process writes to its own records.

Collective I/O mode and compression

Compression should not be used for variables which expect parallel writes, per HDF5 restrictions. The HDF5 documentation explains this:

Compression uses chunking. Since chunks are preallocated in the file before writing, chunks have to be of the same size. However, the size of the compressed chunk is not known in advance.

Discussion of compression and chunking is described in http://www.unidata.ucar.edu/software/netcdf/workshops/2010/nc4chunking/index.html and http://www.hdfgroup.org/HDF5/Tutor/compress.html

Performance Considerations

The HDF5 documentation has a list of potential issues with parallel I/O performance:

http://www.hdfgroup.org/HDF5/faq/perfissues.html

Table of Contents

Building the Library

Parallel I/O Considerations

Collective mode and appending to a file

Collective I/O mode and compression

Performance Considerations