> For the complete documentation index, see [llms.txt](https://docs.digibee.com/documentation/llms.txt). Markdown versions of documentation pages are available by appending `.md` to page URLs; this page is available as [Markdown](https://docs.digibee.com/documentation/connectors-and-triggers/connectors/files/stream-parquet-file-reader.md). # Stream Parquet File Reader The **Stream Parquet File Reader** connector allows you to read Parquet files triggering subpipelines to process each message individually. This connector should be used for large files. Parquet is a columnar file format designed for efficient data storage and retrieval. For more information, [see the official website](https://parquet.apache.org/). ## **Parameters** Configure the connector using the parameters below. Fields that support [Double Braces expressions ](/documentation/connectors-and-triggers/double-braces/overview.md)are marked in the **Supports DB** column. {% tabs fullWidth="true" %} {% tab title="General" %}

Parameter	Description	Type	Supports DB	Default
Alias	Name (alias) for this connector’s output, allowing you to reference it later in the flow using Double Braces expressions.	String	✅	`stream-parquet-reader-1`
File Name	The file name of the Parquet file to be read.	String	✅	{{ message.fileName }}
Parallel Execution	Occurs in parallel with loop execution.	Boolean	❌	false
Convert Date Fields	If enabled, `DATE/TIMESTAMP` fields from the file are converted to string format (e.g. `yyyy-MM-dd` for `DATE`, ISO-8601 for `TIMESTAMP`). When default, dates remain as numeric values (days/millis since epoch).	Boolean	❌	False
Date Field Paths (optional)	Manually indicates date fields when the schema does not declare a logical type `DATE`.	String	❌	N/A
Decode Base64 Fields	If enabled, the connector recursively scans the output JSON nodes. Any string identified as a valid Base64 sequence is automatically decoded to UTF-8 and replaced in-place.	Boolean	❌	False
Fail On Error	If the option is active, the execution of the pipeline with an error will be interrupted. Otherwise, the pipeline execution proceeds, but the result will show a false value for the “success” property.	Boolean	❌	false

{% endtab %} {% tab title="Documentation" %}

Parameter	Description	Type	Supports DB	Default
Documentation	Section for documenting any necessary information about the connector configuration and business rules.	String	❌	N/A

{% endtab %} {% endtabs %} {% hint style="info" %} A compressed Parquet file generates JSON content larger than the file itself when it is read. It is important that you checj whether the pipeline has enough memory to handle the data, as it will be stored in the pipeline's memory. {% endhint %} ## **Usage examples** ### **Reading Parquet file** * **File Name:** file.parquet * **Parallel:** deactivated **Output:** ``` { "total": 1000, "success": 1000, "failed": 0 } ``` If the lines have been processed correctly, their respective subpipelines return `{ "success": true }` for each individual line. --- # Agent Instructions This documentation is published with GitBook. GitBook is the documentation platform designed so that both humans and AI agents can read, navigate, and reason over technical content effectively. Learn more at gitbook.com. ## Querying This Documentation If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question. Perform an HTTP GET request on the current page URL with the `ask` query parameter: ``` GET https://docs.digibee.com/documentation/connectors-and-triggers/connectors/files/stream-parquet-file-reader.md?ask= ``` The question should be specific, self-contained, and written in natural language. The response will contain a direct answer to the question and relevant excerpts and sources from the documentation. Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.