Package dev.hardwood.metadata
package dev.hardwood.metadata
Low-level Parquet file metadata types that mirror the Thrift definitions.
These types provide direct access to the metadata structures stored in
a Parquet file footer. For a higher-level schema representation with
computed definition and repetition levels, see
dev.hardwood.schema.
- See Also:
-
ClassDescriptionColumn chunk metadata.Column index for a column chunk, providing per-page min/max statistics for page-level filtering.Ordering of min/max values across pages.Metadata for a column chunk.Compression codecs supported by Parquet.Legacy converted types in Parquet schema (used by PyArrow for LIST/MAP annotation).Encoding types for Parquet data.Path from the root schema to a leaf column, represented as a list of field names.Top-level file metadata for a Parquet file.Logical types that provide semantic meaning to physical types.BSON document stored as a byte array.Calendar date (days since Unix epoch).Decimal with fixed scale and precision.Enum stored as a UTF-8 string.Interval stored as a 12-byte fixed-length byte array (months, days, millis).Integer type with a specific bit width and signedness.JSON document stored as a UTF-8 string.List (repeated element) logical type.Map (key-value pairs) logical type.UTF-8 encoded string.Timestamp with configurable precision and UTC adjustment.Time of day with configurable precision and UTC adjustment.Resolution of time and timestamp logical types.UUID stored as a 16-byte fixed-length byte array.Offset index for a column chunk, providing page locations for direct lookup.Location of a data page within a column chunk.Physical types supported by Parquet format.Field repetition types in Parquet schema.Row group metadata.Schema element in Parquet file metadata.Column chunk statistics for min/max values, null count, and distinct count.