HDF5 Image and Palette Specification
(Major revision: designated Version 1.1 of the Image and Palette Specification)
The HDF5 specification defines the standard objects and storage for the
standard HDF5 objects. (For information about the HDF5 library, model and
specification, see the HDF documentation.) This document is an additional
specification do define a standard profile for how to store image data
in HDF5. Image data in HDF5 is stored as HDF5 datasets with standard attributes
to define the properties of the image.
This specification defines:
-
Standard attributes for an Image dataset
-
Standard storage and attributes for Palettes
-
Standard for associating Palettes with Images.
1. HDF5 Image Specification
1.1 Overview
Image data is stored as an HDF5 dataset with values of HDF5 class Integer
or Float. A common example would be a two dimensional dataset, with
elements of type Integer. However, this specification does not limit
the dimensions or number type that may be used for an Image.
The dataset for an image is distinguished from other datasets by giving
it an attribute "CLASS=IMAGE". In addition, the Image dataset has
an attribute "PALETTE" that points to one or more palettes.
A Palette is ha HDF5 dataset. A Pallet dataset has an attribute
"an" and other attributes indicating the type and size of the palette.
A Palette is an independent object, which can be shared among several Image
datasets.
1.2 Image Attributes
-
Attribute name="CLASS"
-
This attribute is of type H5T_C_S1, with size 6.
-
For all Images, the value of this attribute is "IMAGE".
This attribute identifies this data set as intended to be interpreted as
an image that conforms to the specifications on this page.
-
Attribute name="IMAGE_SUBCLASS"
-
If present, the value of this attribute indicates the type of Palette that
should be used with the Image. This attribute is of type H5T_C_S1,
with size according to the string. The values are:
-
"IMAGE_GRAYSCALE"
-
A grayscale image, values 0-255 indicating brightness.
-
"IMAGE_BITMAP"
-
A "bit map" image (???)
-
"IMAGE_TRUECOLOR"
-
A truecolor image
-
"IMAGE_INDEXED"
-
An indexed image
Attribute name="IMAGE_WHITE_IS_ZERO"
-
This attribute is of type H5T_NATIVE_UCHAR. 0 = false, 1 = true .
This is used for images with IMAGE_SUBCLASS="IMAGE_GRAYSCALE" or "IMAGE_BITMAP".
-
Attribute name="IMAGE_MINMAXRANGE"
-
If present, this attribute is an array of two numbers, of the same HDF5
datatype as the data. The first element is the minimum value of the
data, and the second is the maximum. This is used for images with
IMAGE_SUBCLASS="IMAGE_GRAYSCALE", "IMAGE_BITMAP" or "IMAGE_INDEXED".
Attribute name="IMAGE_BACKGROUNDINDEX"
-
If set, this attribute indicates the index value that should be interpreted
as the "background color". This attribute is HDF5 type H5T_NATIVE_UINT.
Attribute name="IMAGE_TRANSPARENCY"
-
If set, this attribute indicates the index value that should be interpreted
as the "transparent color". This attribute is HDF5 type H5T_NATIVE_UINT.
This attribute may not be used for IMAGE_SUBCLASS="IMAGE_TRUE_COLOR".
Attribute name="IMAGE_ASPECTRATIO"
-
If set, this attribute indicates the aspect ratio. This attribute
is HDF5 type H5T_NATIVE_UINT.
Attribute name="IMAGE_COLORMODEL"
-
If set, this attribute indicates the color model of Palette that should
be used with the Image. This attribute is of type H5T_C_S1, with
size 4,5, or 6. The value is one of the color models described in
the Palette specification below. This attribute may be used only
for IMAGE_SUBCLASS="IMAGE_TRUECOLOR" or "IMAGE_INDEXED".
Attribute name="IMAGE_GAMMACORRECTION"
-
If set, this attribute gives the Gamma correction. The type is H5T_NATIVE_FLOAT.
This attribute may be used only for IMAGE_SUBCLASS="IMAGE_TRUECOLOR" or
"IMAGE_INDEXED".
Attribute name="PALETTE"
-
A dataset within an HDF5 file may optionally specify an array of palettes
to be viewed with. The dataset will have an attribute field called "PALETTE"
which contains an array of object reference pointers which refer to palettes
in the file. The first palette in this array will be the default palette
that the data may be viewed with.
Attribute name="IMAGE_VERSION"
-
This attribute is of type H5T_C_S1, with size corresponding to the length
of the version string. This attribute identifies the version number
of this specification to which it conforms. The initial version number
is "1.0".
Attributes of an Image Dataset
Attribute Name |
|
Type |
Value |
CLASS |
R |
String |
"IMAGE" |
PALETTE |
O |
Array of Object References |
<references to Palette datasets>1 |
IMAGE_SUBCLASS |
O2 |
String |
-
"IMAGE_GRAYSCALE",
-
"IMAGE_BITMAP",
-
"IMAGE_TRUECOLOR",
-
"IMAGE_INDEXED"
|
PALETTE |
O |
Array of Object Reference |
<references to Palette datasets> |
IMAGE_WHITE_IS_ZERO |
O3,4 |
Unsigned Integer |
0 = false, 1 = true |
IMAGE_MINMAXRANGE |
O3,5 |
Unsigned Integer[2] |
The (<minimum>, <maximum>) value of the data. |
IMAGE_BACKGROUNDINDEX |
O3 |
Unsigned Integer |
The index of the background color. |
IMAGE_TRANSPARENCY |
O3,5 |
Unsigned Integer |
The index of the transparent color. |
IMAGE_ASPECTRATIO |
O3,4 |
Unsigned Integer |
The aspect ratio. |
IMAGE_COLORMODEL |
O3,6 |
String |
The color model, as defined below in the Palette specification for
attribute PAL_COLORMODEL. |
IMAGE_GAMMACORRECTION |
O3,6 |
Float |
The gamma correction. |
IMAGE_VERSION |
R |
String |
<specification version number> |
1. The first element of the array is the default
Palette.
2. This attribute is required for images
that are converted from one of the standard types listed.
3. This attribute is required if set for images
converted from the applicable standard type.
4. This applies to gray scale images: IMAGE_SUBCLASS="IMAGE_GRAYSCALE"
or "IMAGE_BITMAP".
5. This applies to gray scale images: IMAGE_SUBCLASS="IMAGE_GRAYSCALE",
"IMAGE_BITMAP", or "IMAGE_INDEXED".
5. This applies to gray scale images: IMAGE_SUBCLASS="IMAGE_TRUECOLOR",
or "IMAGE_INDEXED".
1.3 Storage Properties for Images
Image datasets may be stored with any chunking or compression properties
supported by HDF5.
2. HDF5 Palette Specification
2.1 Overview
A palette is the means by which color is applied to an image and is also
referred to as a color lookup table. It is a table in which every row contains
the numerical representation of a particular color. In the example of an
8-bit standard RGB color model palette, this numerical representation of
a color is presented as a triplet specifying the intensity of red, green,
and blue components that make up each color.
In this example, the color component numeric type is an 8-bit unsigned
integer. While this is most common and recommended for general use, other
component color numeric datatypes, such as a 16-bit unsigned integer ,
may be used. This type is specified as the type attribute of the palette
dataset. (see H5Tget_type(), H5Tset_type())
The minimum and maximum values of the component color numeric are specified
as attribute of the palette dataset. See below (attribute PAL_MINMAXNUMERIC).
If these attributes do not exist, it is assumed that the range of values
will fill the space of the color numeric type. i.e. with an 8-bit unsigned
integer, the valid range would be 0 to 255 for each color component.
The HDF5 palette specification additionally allows for color models
beyond RGB. YUV, HSV, CMY, CMYK, YCbCr color models are supported, and
may be specified as a color model attribute of the palette dataset. (see
"Palette Attributes" for details).
In HDF 4 and earlier, palettes were limited to 256 colors. The HDF5
palette specification allows for palettes of varying length. The length
is specified as the number of rows of the palette dataset.
In a standard palette, the color entries are indexed directly. HDF5
supports the notion of a range index table. Such a table defines an ascending
ordered of ranges that map dataset values to the palette. If a range index
table exists for the palette, the PAL_TYPE attribute will be set to "RANGEINDEX",
and the PAL_RANGEINDEX attribute will contain an object reference to a
range index table array. If not, the PAL_TYPE attribute either does not
exist, or will be set to "STANDARD".
The range index table array consists of a one dimensional array with
the same length as the palette dataset - 1. Ideally, the range index would
be of the same type as the dataset it refers to, however this is not a
requirement.
Example 2: A range index array of type floating point
The range index array attribute defines the "to" of the range.
Notice that the range index array attribute is one less entry in size than
the palette. The first entry of 0.1259, specifies that all values below
and up to 0.1259 inclusive, will map to the first palette entry. The second
entry signifies that all values greater than 0.1259 up to 0.3278 inclusive,
will map to the second palette entry, etc. All value greater than the last
range index array attribute (100000) map to the last entry in the palette.
2.2. Palette Attributes
A palette exists in an HDF file as an independent data set with accompanying
attributes.
These attributes are defined as follows:
-
Attribute name="CLASS"
-
This attribute is of type H5T_C_S1, with size 8.
-
For all palettes, the value of this attribute is "PALETTE". This attribute
identifies this palette data set as a palette that conforms to the specifications
on this page.
-
Attribute name="PAL_COLORMODEL"
-
This attribute is of type H5T_C_S1, with size 4, 5, or 6.
-
Possible values for this are "RGB", "YUV", "CMY", "CMYK", "YCbCr", "HSV".
-
This defines the color model that the entries in the palette data set represent.
-
"RGB"
-
Each color index contains a triplet where the the first value defines the
red component, second defines the green component, and the third the blue
component.
-
"CMY"
-
Each color index contains a triplet where the the first value defines the
cyan component, second defines the magenta component, and the third the
yellow component.
-
"CMYK"
-
Each color index contains a quadruplet where the the first value defines
the cyan component, second defines the magenta component, the third the
yellow component, and the forth the black component.
-
"YCbCr"
-
Class Y encoding model. Each color index contains a triplet where the the
first value defines the luminance, second defines the Cb Chromonance, and
the third the Cr Chromonance.
-
"YUV"
-
Composite encoding color model. Each color index contains a triplet where
the the first value defines the luminance component, second defines the
chromonance component, and the third the value component.
-
"HSV"
-
Each color index contains a triplet where the the first value defines the
hue component, second defines the saturation component, and the third the
value component. The hue component defines the hue spectrum with a low
value representing magenta/red progressing to a high value which would
represent blue/magenta, passing through yellow, green, cyan. A low value
for the saturation component means less color saturation than a high value.
A low value for value will be darker than a high value.
-
Attribute name="PAL_TYPE"
-
This attribute is of type H5T_C_S1, with size 10 or 11.
The current supported values for this attribute are : "STANDARD8" or
"RANGEINDEX"
A PAL_TYPE of "STANDARD8" defines a palette dataset such that the first
entry defines index 0, the second entry defines index 1, etc. up until
the length of the palette - 1. This assumes an image dataset with direct
indexes into the palette.
If the PAL_TYPE is set to "RANGEINDEX", there will be an additional
attribute with a name of "PAL_RANGEINDEX", (See example 2
for more details)
-
Attribute name="PAL_RANGEINDEX"
-
The PAL_RANGEINDEX attribute contains an HDF object reference pointer
which specifies a range index array in the file to be used for color lookups
for the palette. (Only for PAL_TYPE="RANGEINDEX"
-
Attribute name="PAL_MINMAXNUMERIC"
-
If present, this attribute is an array of two numbers, of the same HDF5
datatype as the palette elements or color numerics.
They specify the minimum and maximum values of the color numeric components.
For example, if the palette was an RGB of type Float, the color numeric
range for Red, Green, and Blue could be set to be between 0.0 and 1.0.
The intensity of the color guns would then be scaled accordingly to be
between this minimum and maximum attribute.
Attribute name="PAL_VERSION"
This attribute is of type H5T_C_S1, with size corresponding to the
length of the version string. This attribute identifies the version
number of this specification to which it conforms. The initial version
number is "1.0".
Attributes of a Palette Dataset
Attribute Name |
|
Type |
Value |
CLASS |
R |
String |
"PALETTE" |
PAL_COLORMODEL |
R |
String |
Color Model: "RGB", YUV", "CMY", "CMYK", "YCbCr", or "HSV" |
PAL_TYPE |
R |
String |
"STANDARD8" or "RANGEINDEX" |
RANGE_INDEX |
R1 |
Object Reference |
<Object Reference to Dataset of range index values> |
PAL_MINMAXNUMERIC |
O |
Array[2] of <same datatype as palette> |
The first value is the <Minimum value for color values>, the second
value is <Maximum value for color values>2 |
PAL_VERSION |
R |
String |
<specification version number> |
1. The RANGE_INDEX attribute is required if the
PAL_TYPE is "RANGEINDEX". Otherwise, the RANGE_INDEX attribute should
be omitted.
2. The minimum and maximum are optional.
If not set, the range is assumed to the maximum range of the number type.
If one of these attributes is set, then both should be set. The value
of the minumum must be less than or equal to the value of the maximum.
2.3. Storage Layout for Palettes
The values of the Palette are stored as a dataset. The datatype can
be any HDF 5 atomic numeric type. The dataset will have dimensions
(nentries by ncomponents), where 'nentries'
is the number of colors (usually 256) and 'ncomponents' is the
number of values per color (3 for RGB, 4 for CMYK, etc.)
3. Consistency and Correlation of Image and Palette Attributes
The objects in this specification are an extension to the base HDF5 specification
and library. They are accessible with the standard HDF5 library,
but the semantics of the objects are not enforced by the base library.
For example, it is perfectly possible to add an attribute called IMAGE
to any dataset, or to include an object reference to any
HDF5 dataset as a "palette" in a PALETTE attribute. this would
be a perfectly valid HDF5 file, but not conformant to this specification.
The rules defined in this specification must be implemented with appropriate
software, and applications must use conforming software to assure correctness.
The Image and Palette specifications include several mutually redundant
standard attributes, such as the IMAGE_COLORMODEL and the PAL_COLORMODEL.
These attributes are informative not normative, in that it is acceptable
to attach a Palette to an Image dataset even if their attributes do not
match. Software is not required to enforce consistency, and files
may contain mismatched associations of Images and Palettes. In all
cases, it is up to applications to determine what kinds of images and color
models can be supported.
For example, an Image that was created from a file with an "RGB" may
have a "YUV" Palette in its PALETTE attribute array. This
would be a legal HDF5 file and also a conforms to this specification, although
it may or may not be correct or useable for a given application.
HDF Help Desk
Last modified: 2 June 2000
Describes HDF5 Release 1.4 Beta, December 2000