Stream Parquet File Reader
Learn more about the Stream Parquet File Reader connector and how to use it in the Digibee Integration Platform.
The Stream Parquet File Reader connector allows you to read Parquet files triggering subpipelines to process each message individually. This connector should be used for large files.
Parquet is a columnar file format designed for efficient data storage and retrieval. For more information, see the official website.
Parameters
Take a look at the configuration parameters of the connector. Parameters supported by Double Braces expressions are marked with (DB)
.
General Tab
Parameter | Description | Default value | Data type |
---|---|---|---|
File Name | The file name of the Parquet file to be read. | {{ message.fileName }} | String |
Parallel Execution | Occurs in parallel with loop execution. | false | Boolean |
Fail On Error | If the option is active, the execution of the pipeline with an error will be interrupted. Otherwise, the pipeline execution proceeds, but the result will show a false value for the “success” property. | false | Boolean |
Documentation Tab
Parameter | Description | Default value | Data type |
---|---|---|---|
Documentation | Section for documenting any necessary information about the connector configuration and business rules. | N/A | String |
A compressed Parquet file generates JSON content larger than the file itself when it is read. It is important that you checj whether the pipeline has enough memory to handle the data, as it will be stored in the pipeline's memory.
Usage examples
Reading Parquet file
File Name: file.parquet
Parallel: deactivated
Output:
If the lines have been processed correctly, their respective subpipelines return { "success": true }
for each individual line.
Last updated