AWS Athena

Discover more about the AWS Athena connector and how to use it on the Digibee Integration Platform.

Important information:

  • This documentation is currently available only in English but will soon be translated into Portuguese.

  • The content is under revision and may change. We appreciate your understanding as we work to improve the documentation.

Access AWS Athena service using AWS SDK version 2.x.

Parameters

Take a look at the configuration options for the component. Parameters supported by Double Braces expressions are marked with (DB).

Required Tab

ParameterDescriptionDefault valueData type

Dynamic Account

When activated, the connector will use the account dynamically. When deactivated, it’ll use the account statically.

false

Boolean

Account Name DB

The name of the account that is generated dynamically via the Store Account connector.

N/A

Account

Account

Sets the account to be used by the connector.

N/A

string

Label DB

Logical name

N/A

string

Fail On Error

If true will stop pipeline with an error, if false will let the pipeline continue but the output will show a property success with value false

false

boolean

Optional Tab

ParameterDescriptionDefault valueData type

Database DB

The Athena database to use.

N/A

string

Delay

Milliseconds before the next poll for query execution status. See the section ‘Waiting for Query Completion and Retrying Failed Queries’ to learn more.

2000

number

Encryption Option

The encryption type to use when storing query results in S3. One of SSE_S3, SSE_KMS, or CSE_KMS.

string

Include Trace

Include useful trace information at the beginning of queries as an SQL comment (prefixed with –).

false

boolean

Initial Delay

Milliseconds before the first poll for query execution status. See the section ‘Waiting for Query Completion and Retrying Failed Queries’ to learn more.

1000

number

Kms Key DB

For SSE-KMS and CSE-KMS, this is the KMS key ARN or ID.

N/A

string

Max Attempts

Maximum number of times to attempt a query. Set to 1 to disable retries. See the section ‘Waiting for Query Completion and Retrying Failed Queries’ to learn more.

1

number

Max Results

Max number of results to return for the given operation (if supported by the Athena API endpoint). If not set, will use the Athena API default for the given operation.

N/A

number

Next Token DB

Pagination token to use in the case where the response from the previous request was truncated.

N/A

string

Operation

The Athena API function to call.

startQueryExecution

string

Output Location DB

The location in Amazon S3 where query results are stored, such as s3://path/to/query/bucket/. Ensure this value ends with a forward slash (‘/’).

N/A

string

Output Type

How query results should be returned. One of StreamList (default - return a GetQueryResultsIterable that can page through all results), SelectList (returns at most 1,000 rows at a time, plus a NextToken value as a header than can be used for manual pagination of results), S3Pointer (return an S3 path pointing to the results).

StreamList

string

Query Execution Id DB

The unique ID identifying the query execution.

N/A

string

Query String DB

The SQL query to run. Except for simple queries, prefer setting this as the body of the Exchange or as a header using Athena2Constants.QUERY_STRING to avoid having to deal with URL encoding issues.

N/A

string

Region DB

The region in which Athena client needs to work. When using this parameter, the configuration will expect the lowercase name of the region (for example ap-east-1). You’ll need to use the name Region.EU_WEST_1.id().

N/A

string

Reset Wait Timeout On Retry

Reset the waitTimeout countdown in the event of a query retry. If set to true, potential max time spent waiting for queries is equal to waitTimeout x maxAttempts. See the section ‘Waiting for Query Completion and Retrying Failed Queries’ to learn more.

true

boolean

Retry

Optional comma separated list of error types to retry the query for. Use ‘retryable’ to retry all retryable failure conditions (e.g. generic errors and resources exhausted), ‘generic’ to retry ‘GENERIC_INTERNAL_ERROR’ failures, ‘exhausted’ to retry queries that have exhausted resource limits, ‘always’ to always retry regardless of failure condition, or ‘never’ or null to never retry (default). See the section ‘Waiting for Query Completion and Retrying Failed Queries’ to learn more.

never

string

Wait Timeout

Optional max wait time in millis to wait for a successful query completion. See the section ‘Waiting for Query Completion and Retrying Failed Queries’ to learn more.

0

number

Work Group DB

The workgroup to use for running the query.

N/A

string

Client Request Token DB

A unique string to ensure issues queries are idempotent. It is unlikely you will need to set this.

N/A

string

Headers Tab

ParameterDescriptionDefault valueData type

CamelAwsAthenaOperation

The operation to perform. Permitted values are getQueryExecution, getQueryResults, listQueryExecutions, startQueryExecution.

startQueryExecution

string

CamelAwsAthenaDatabase DB

The Athena database to use.

N/A

string

CamelAwsAthenaQueryExecutionId DB

The unique ID identifying the query execution.

N/A

string

CamelAwsAthenaWorkGroup DB

The workgroup to use for running the query.

N/A

string

CamelAwsAthenaNextToken DB

Pagination token to use in the case where the response from the previous request was truncated.

N/A

string

CamelAwsAthenaMaxResults

Max number of results to return for the given operation (if supported by the Athena API endpoint). If not set, will use the Athena API default for the given operation.

N/A

number

CamelAwsAthenaIncludeTrace

Include useful trace information at the beginning of queries as an SQL comment (prefixed with –).

false

boolean

CamelAwsAthenaOutputLocation DB

The location in Amazon S3 where query results are stored, such as s3://path/to/query/bucket/. Ensure this value ends with a forward slash (‘/’).

N/A

string

CamelAwsAthenaOutputType

How query results should be returned. One of StreamList (default - return a GetQueryResultsIterable that can page through all results), SelectList (returns at most 1,000 rows at a time, plus a NextToken value as a header than can be used for manual pagination of results), S3Pointer (return an S3 path pointing to the results).

string

CamelAwsAthenaQueryExecutionState

The state of the query execution.

string

CamelAwsAthenaClientRequestToken DB

A unique string to ensure issues queries are idempotent. It is unlikely you will need to set this.

N/A

string

CamelAwsAthenaQueryString DB

The SQL query to run. Except for simple queries, prefer setting this as the body of the Exchange or as this header to avoid having to deal with URL encoding issues.

N/A

string

CamelAwsAthenaEncryptionOption

The encryption type to use when storing query results in S3.

string

CamelAwsAthenaKmsKey DB

For SSE-KMS and CSE-KMS, this is the KMS key ARN or ID.

N/A

string

CamelAwsAthenaWaitTimeout

Optional max wait time in millis to wait for a successful query completion. See the section ‘Waiting for Query Completion and Retrying Failed Queries’ to learn more.

N/A

number

CamelAwsAthenaInitialDelay

Milliseconds before the first poll for query execution status. See the section ‘Waiting for Query Completion and Retrying Failed Queries’ to learn more.

N/A

number

CamelAwsAthenaDelay

Milliseconds before the next poll for query execution status. See the section ‘Waiting for Query Completion and Retrying Failed Queries’ to learn more.

N/A

number

CamelAwsAthenaMaxAttempts

Maximum number of times to attempt a query. Set to 1 to disable retries. See the section ‘Waiting for Query Completion and Retrying Failed Queries’ to learn more.

N/A

number

CamelAwsAthenaRetry DB

Optional comma separated list of error types to retry the query for. Use ‘retryable’ to retry all retryable failure conditions (e.g. generic errors and resources exhausted), ‘generic’ to retry ‘GENERIC_INTERNAL_ERROR’ failures, ‘exhausted’ to retry queries that have exhausted resource limits, ‘always’ to always retry regardless of failure condition, or ‘never’ or null to never retry (default). See the section ‘Waiting for Query Completion and Retrying Failed Queries’ to learn more.

N/A

string

CamelAwsAthenaResetWaitTimeoutOnRetry

Reset the waitTimeout countdown in the event of a query retry. If set to true, potential max time spent waiting for queries is equal to waitTimeout x maxAttempts. See the section ‘Waiting for Query Completion and Retrying Failed Queries’ to learn more.

false

boolean

CamelAwsAthenaStartQueryExecutionAttempts

Total number of attempts made to run the query. Will be greater than 1 if the query is retried.

N/A

number

CamelAwsAthenaStartQueryExecutionElapsedMillis

Total time in millis taken in startQueryExecution (mostly relevant when waiting for query completion within startQueryExecution).

N/A

number

Successful executions should output at least a property "success": true

Last updated