mirror of
https://github.com/p2p-ld/docs.git
synced 2024-11-10 00:34:30 +00:00
15 lines
No EOL
793 B
Text
15 lines
No EOL
793 B
Text
# HDF5
|
|
|
|
We are starting with hdf5 because our initial test case is the [NWB](https://www.nwb.org/) format for neurophysiology data. This is a challenging initial test case because the data is heterogeneous, large, and the specification is written in an idiosyncratic format specification.
|
|
|
|
HDF has three primary types of objects:
|
|
- Groups
|
|
- Datasets - contain the raw values in the file
|
|
- Attributes - metadata about groups or datasets.
|
|
|
|
Datasets have additional properties:
|
|
- Datatypes: binary representation of the data
|
|
- Dataspaces: Layout of individual data elements
|
|
- Properties: Additional information about the representation of the dataset, eg. chunked or contiguous
|
|
|
|
These map naturally onto triplets, where each group or dataset is a subject, attributes are properties and objects. |