All Classes and Interfaces
Class
Description
Factory for creating
AvroRowReader instances from a
ParquetFileReader.Reads Parquet rows as Avro
GenericRecord instances.JFR event emitted when the consumer thread blocks waiting for a batch
to be assembled by the assembly thread.
Column chunk metadata.
Column index for a column chunk, providing per-page min/max statistics for page-level filtering.
Ordering of min/max values across pages.
Metadata for a column chunk.
Specifies which columns to read from a Parquet file.
Batch-oriented column reader for reading a single column across all row groups.
Represents a primitive column in a Parquet schema.
Compression codecs supported by Parquet.
Legacy converted types in Parquet schema (used by PyArrow for LIST/MAP annotation).
Encoding types for Parquet data.
Path from the root schema to a leaf column, represented as a list of field names.
JFR event emitted when Hardwood memory-maps a region of a Parquet file.
Top-level file metadata for a Parquet file.
JFR event emitted when a Parquet file is opened and its metadata is read.
Root schema container representing the complete Parquet schema.
A predicate for filtering row groups based on column statistics.
Predicate for DATE columns.
Predicate for DECIMAL columns.
Predicate for TIMESTAMP columns.
Predicate that matches rows where the column value is not null.
Predicate that matches rows where the column value is null.
Predicate for
FIXED_LEN_BYTE_ARRAY columns that require signed (two's complement)
comparison, such as decimals.Predicate for TIME columns.
Entry point for reading Parquet files with a shared thread pool.
Context object that manages shared resources for Parquet file reading.
Abstraction for reading Parquet file data.
Logical types that provide semantic meaning to physical types.
BSON document stored as a byte array.
Calendar date (days since Unix epoch).
Decimal with fixed scale and precision.
Enum stored as a UTF-8 string.
Interval stored as a 12-byte fixed-length byte array (months, days, millis).
Integer type with a specific bit width and signedness.
JSON document stored as a UTF-8 string.
List (repeated element) logical type.
Map (key-value pairs) logical type.
UTF-8 encoded string.
Timestamp with configurable precision and UTC adjustment.
Time of day with configurable precision and UTC adjustment.
Resolution of time and timestamp logical types.
UUID stored as a 16-byte fixed-length byte array.
Holds multiple
ColumnReader instances backed by a shared FileManager
for cross-file prefetching across multiple Parquet files.Entry point for reading multiple Parquet files with cross-file prefetching.
A RowReader that reads across multiple Parquet files with automatic file prefetching.
Offset index for a column chunk, providing page locations for direct lookup.
JFR event emitted when a single Parquet data page is decoded.
JFR event emitted when pages are filtered by Column Index predicate push-down.
Location of a data page within a column chunk.
Reader for individual Parquet files.
Physical types supported by Parquet format.
Specialized list interface for DOUBLE values.
Specialized list interface for INT32 values.
Type-safe list interface for reading Parquet list values.
Specialized list interface for INT64 values.
Type-safe interface for accessing Parquet MAP values.
A single key-value entry in a map.
Type-safe struct interface for reading nested Parquet data.
JFR event emitted when the prefetch queue cannot supply a decoded page in time.
Represents a projected view of a Parquet schema containing only selected columns.
JFR event emitted when records are filtered by record-level predicate evaluation.
Field repetition types in Parquet schema.
Row group metadata.
JFR event emitted when row groups are filtered by a predicate push-down filter.
JFR event emitted when all pages in a row group column chunk have been scanned.
Provides row-oriented iteration over a Parquet file.
Long-term or temporary AWS credentials.
Supplies
S3Credentials for signing S3 requests.InputFile backed by an object in Amazon S3 (or an S3-compatible service).A configured connection to an S3-compatible object store.
Builder for
S3Source.Schema element in Parquet file metadata.
Tree-based representation of Parquet schema for nested data support.
Group node representing a struct, list, or map.
Primitive leaf node representing an actual data column.
Bridges the AWS SDK credential chain to Hardwood's types.
Column chunk statistics for min/max values, null count, and distinct count.
Common interface for name-based field access, shared by both top-level row readers
and nested struct values.