From fda79322f90962d5384ac57a001be4612d79f005 Mon Sep 17 00:00:00 2001 From: Frank Baker Date: Wed, 11 Jul 2001 16:59:44 -0500 Subject: [svn-r4191] Purpose: New section -- "Freespace Management" Description: Added "Freespace Management" section. Minor formatting. Platforms tested: IE 5 --- doc/html/Performance.html | 102 ++++++++++++++++++++++++++++++++++++++++++---- 1 file changed, 95 insertions(+), 7 deletions(-) diff --git a/doc/html/Performance.html b/doc/html/Performance.html index f3dc0a7..079d991 100644 --- a/doc/html/Performance.html +++ b/doc/html/Performance.html @@ -58,12 +58,100 @@

2. Dataset Chunking

- Appropriate dataset chunking can make a siginificant difference - in HDF5 performance. This topic is discussed in - Dataset Chunking Issues elsewhere - in this User's Guide. - -

3. Use of the Pablo Instrumentation of HDF5

+ Appropriate dataset chunking can make a siginificant difference + in HDF5 performance. This topic is discussed in + Dataset Chunking Issues elsewhere + in this User's Guide. + +

3. Freespace Management

+ +

HDF5 does not yet manage freespace as effectively as it might. + While a file is opened, the library actively tracks and re-uses + freespace, i.e., space that is freed (or released) + during the run. + But the library does not yet manage freespace across the + closing and reopening of a file; when a file is closed, + all knowledge of available freespace is lost. + What was freespace becomes an unusable hole in the file. + +

There are several circumstances that can result in freespace + in an HDF5 file: +

+ +

As stated above, freespace is not managed across the + closing and reopening of an HDF5 file; file space that was + known freespace while the file remained open becomes an + inaccessible hole when the file is closed. + Thus, if a file is often closed and reopened, datasets + frequently rewritten, or groups and/or datasets frequently + added and deleted, that file can develop large numbers of + holes and grow unnecessarily large. This can, in turn, + seriously impair application or library performance + as the file ages. + +

An h5pack utility would enable packing + a file to remove the holes, but writing such a utility to + universally pack the file correctly is a complex task and the + HDF5 development team has not to date had the resources to + complete the task. + +

For application developers or researchers who find themselves + working with files that become bloated in this manner, there + are, at this time, two remedies: +

+ + +

+ 1 + + This is a problem only with compressed chunks. + The compression ratio of data is highly dependent on the data + itself; regardless of whether the size of the data + changes, the size of the compressed data change substantially + as the data changes. Uncompressed chunks do not vary in size, + so this issue does not arise. + + +

4. Use of the Pablo Instrumentation of HDF5

Pablo HDF5 Trace software provides a means of measuring the performance of programs using HDF5. @@ -147,7 +235,7 @@ -Last modified: 14 October 1999 +Last modified: 11 July 2001
-- cgit v0.12