Package dev.hardwood.reader
package dev.hardwood.reader
Parquet file readers with row-oriented and column-oriented APIs.
ParquetFileReader opens a single file and provides access to metadata and
schema. From there, create a RowReader for row-at-a-time access with typed
getters, or a ColumnReader for batch-oriented columnar access with
primitive arrays. FilterPredicate enables predicate pushdown at both the
row-group and page level.
For reading multiple files as a single dataset, use MultiFileParquetReader
via Hardwood.
-
ClassDescriptionBatch-oriented column reader for reading a single column across all row groups.A predicate for filtering row groups based on column statistics.Predicate for DATE columns.Predicate for DECIMAL columns.Predicate for TIMESTAMP columns.Predicate that matches rows where the column value is not null.Predicate that matches rows where the column value is null.Predicate for
FIXED_LEN_BYTE_ARRAYcolumns that require signed (two's complement) comparison, such as decimals.Predicate for TIME columns.Holds multipleColumnReaderinstances backed by a sharedFileManagerfor cross-file prefetching across multiple Parquet files.Entry point for reading multiple Parquet files with cross-file prefetching.A RowReader that reads across multiple Parquet files with automatic file prefetching.Reader for individual Parquet files.Provides row-oriented iteration over a Parquet file.