Skip to main content
Version: Next

read_parquet

Reads events from a Parquet byte stream.

read_parquet

Description

Reads events from a Parquet byte stream.

Apache Parquet is a columnar storage format that a variety of data tools support.

MMAP Parsing

When using theis with the load_file operator, we recommend passing the mmap=true option to load_file to give the parser full control over the reads, which leads to better performance and memory usage.

Limitation

Tenzir currently assumes that all Parquet files use metadata recognized by Tenzir. We plan to lift this restriction in the future.

Examples

Read a Parquet file:

load_file "/tmp/data.prq", mmap=true
read_parquet