Hyperspectral data¶

The Hypervision SDK is built for processing hyperspectral image data. Therefore, it is important to understand some properties of this data before looking at the SDK itself, in particular, what hyperspectral images are and how they differ from regular images.

See also

This guide covers how hyperspectral images are represented as data and how manipulating them relates to and differs from regular grayscale or RGB images. For a detailed introduction to hyperspectral imaging, refer to the detailed guide on our main documentation page.

A hyperspectral image is like a regular RGB image but with many more color channels, usually hundreds. It is therefore convenient and appropriate to understand hyperspectral images as three-dimensional arrays. While RGB images are also three-dimensional, we usually don’t think of them as such because they visually appear two-dimensional and we mostly operate on them from a spatial perspective. Hyperspectral images, on the other hand, operate on both spatial and spectral dimensions, depending on what kind of information we wish to extract from them. Due to their inherent three-dimensional structure, hyperspectral images are often referred to as data cubes.

A hyperspectral image \(H\) is an \(L\times S\times B\) array, where \(L\) is the spatial height (denoted lines), \(S\) is the spatial width (denoted samples), and \(B\) is the spectral depth (denoted bands).

Memory layout¶

With regular RGB images, we rarely think of the memory representation because the low number of channels results in relatively contiguous memory no matter what is operated on. This is not the case for hyperspectral images. As shown above, operating on either a spatial slice or individual spectra may require reading memory locations spaced far apart. Additionally, apart from snapshot hyperspectral cameras, the raw camera data is naturally formatted as planar slices of a full data cube. Depending on what type of camera is used, the natural memory format will be different.

In HSI, we operate with three memory orderings: BSQ (band sequential), BIL (band interleaved by line), and BIP (band interleaved by pixel). The ordering of a particular data cube is referred to as its interleave. BSQ is like a stack of grayscale images and BIP is similar to how RGB is stored in many file formats (for example PNG), with the channels of each pixel being contiguous. BIL is in between. When using the size notation for three-dimensional arrays, the format sizes can be described as:

\[\begin{split}BSQ = B\times L\times S\\ BIL = L\times B\times S\\ BIP = L\times S\times B\end{split}\]

Interleave conversion¶

Converting between the formats is simple but can get confusing. The diagram below illustrates how one transforms from each of the formats to any other.