Class ParquetFileReader
java.lang.Object
dev.hardwood.reader.ParquetFileReader
- All Implemented Interfaces:
AutoCloseable
Reader for individual Parquet files.
For single-file usage:
try (ParquetFileReader reader = ParquetFileReader.open(path)) {
RowReader rows = reader.createRowReader();
// ...
}
For multi-file usage with shared thread pool, use Hardwood.
-
Method Summary
Modifier and TypeMethodDescriptionvoidclose()createColumnReader(int columnIndex) Create a ColumnReader for a column by index, spanning all row groups.createColumnReader(String columnName) Create a ColumnReader for a named column, spanning all row groups.Create a RowReader that iterates over all rows in all row groups.createRowReader(ColumnProjection projection) Create a RowReader that iterates over selected columns in all row groups.static ParquetFileReaderOpen a Parquet file with a dedicated context.static ParquetFileReaderopen(Path path, HardwoodContext context) Open a Parquet file with a shared context.
-
Method Details
-
open
Open a Parquet file with a dedicated context. The context is closed when this reader is closed.- Throws:
IOException
-
open
Open a Parquet file with a shared context. The context is NOT closed when this reader is closed.- Throws:
IOException
-
getFileMetaData
-
getFileSchema
-
createColumnReader
Create a ColumnReader for a named column, spanning all row groups. -
createColumnReader
Create a ColumnReader for a column by index, spanning all row groups. -
createRowReader
Create a RowReader that iterates over all rows in all row groups. -
createRowReader
Create a RowReader that iterates over selected columns in all row groups.- Parameters:
projection- specifies which columns to read- Returns:
- a RowReader for the selected columns
-
close
- Specified by:
closein interfaceAutoCloseable- Throws:
IOException
-