Release Notes¶
See GitHub Releases for the full changelog.
1.0.0.Beta1¶
- S3 and remote object store support with coalesced reads (#31)
- CLI tool for inspecting and querying Parquet files (#38)
- Avro
GenericRecordsupport via thehardwood-avromodule (#131) - Row group filtering with predicate push-down (#59)
- Page-level column index filtering (#118)
InputFileabstraction for pluggable file sources (#98)FieldPathfor unambiguous column lookup (#59)- Page CRC verification (#76)
- Key/value metadata access (#135)
- S3 support and filtering in the parquet-java compatibility layer (#123)
- Project documentation site (#109)
1.0.0.Alpha1¶
- Zero-dependency Parquet file reader for Java
- Row-oriented and columnar read APIs
- Support for flat and nested schemas (lists, maps, structs)
- All standard encodings (RLE, DELTA_BINARY_PACKED, DELTA_BYTE_ARRAY, BYTE_STREAM_SPLIT, etc.)
- Compression: Snappy, ZSTD, LZ4, GZIP, Brotli
- Projection push-down
- Parallel page pre-fetching and eager batch assembly
- Memory-mapped file I/O
- JFR events for observability
- Multi-file reader
- BOM for dependency management
- parquet-java compatibility layer
- Optional Vector API acceleration on Java 22+