Skip to main content

Intro

Amazon Athena is a server-less, interactive query service that lets you analyze big data in S3 using standard SQL. Use Domo’s Amazon Athena JDBC 2.0 connector to query data from your Athena instance. Use Domo’s Amazon Athena JDBC 2.0 connector to visualize your Amazon Athena data using JDBC 2.0. To learn more about the Amazon Athena API, visit their page (https://aws.amazon.com/documentation/athena/). You connect to your Amazon Athena account in the Data Center. This topic discusses the fields and menus that are specific to the Amazon Athena JDBC 2.0 connector user interface. General information for adding DataSets, setting update schedules, and editing DataSet information is discussed in Adding a DataSet Using a Data Connector.

Prerequisites

To connect to your Amazon Athena account and create a DataSet, you must have the following:
  • Your AWS access key.
  • Your AWS secret key.
  • The Amazon S3 location to which the query output is written (e.g. s3://aws-athena-query-results-1234-us-west-2/ ). You can find this by going to Settings in the Athena web console.
  • Your AWS region.
For information about obtaining AWS credentials, visit http://docs.aws.amazon.com/general/latest/gr/aws-sec-cred-types.html.

Required Permission Policies

To gain the access to AWS services and resources, such as Athena and the Amazon S3 buckets, provide the JDBC driver credentials to your application. If you are using the JDBC driver, ensure that the IAM permissions policy includes all actions listed in AmazonAthenaFullAccess Managed Policy. The AmazonAthenaFullAccess managed policy grants full access to Athena. Managed policy contents change, so the policy shown here may be out-of-date. Check the IAM console for the most up-to-date policy.
{
    "Version": "2012-10-17",
    "Statement": [
        {
            "Effect": "Allow",
            "Action": [
                "athena:*"
            ],
            "Resource": [
                "*"
            ]
        },
        {
            "Effect": "Allow",
            "Action": [
                "glue:CreateDatabase",
                "glue:DeleteDatabase",
                "glue:GetDatabase",
                "glue:GetDatabases",
                "glue:UpdateDatabase",
                "glue:CreateTable",
                "glue:DeleteTable",
                "glue:BatchDeleteTable",
                "glue:UpdateTable",
                "glue:GetTable",
                "glue:GetTables",
                "glue:BatchCreatePartition",
                "glue:CreatePartition",
                "glue:DeletePartition",
                "glue:BatchDeletePartition",
                "glue:UpdatePartition",
                "glue:GetPartition",
                "glue:GetPartitions",
                "glue:BatchGetPartition"
            ],
            "Resource": [
                "*"
            ]
        },
        {
            "Effect": "Allow",
            "Action": [
                "s3:GetBucketLocation",
                "s3:GetObject",
                "s3:ListBucket",
                "s3:ListBucketMultipartUploads",
                "s3:ListMultipartUploadParts",
                "s3:AbortMultipartUpload",
                "s3:CreateBucket",
                "s3:PutObject"
            ],
            "Resource": [
                "arn:aws:s3:::aws-athena-query-results-*"
            ]
        },
        {
            "Effect": "Allow",
            "Action": [
                "s3:GetObject",
                "s3:ListBucket"
            ],
            "Resource": [
                "arn:aws:s3:::athena-examples*"
            ]
        },
        {
            "Effect": "Allow",
            "Action": [
                "s3:ListBucket",
                "s3:GetBucketLocation",
                "s3:ListAllMyBuckets"
            ],
            "Resource": [
                "*"
            ]
        },
        {
            "Effect": "Allow",
            "Action": [
                "sns:ListTopics",
                "sns:GetTopicAttributes"
            ],
            "Resource": [
                "*"
            ]
        },
        {
            "Effect": "Allow",
            "Action": [
                "cloudwatch:PutMetricAlarm",
                "cloudwatch:DescribeAlarms",
                "cloudwatch:DeleteAlarms"
            ],
            "Resource": [
                "*"
            ]
        }
    ]
}
To gain access to AWS services and resources, you need to provide the JDBC or ODBC driver credentials to your application. For more information refer to Access through JDBC and ODBC connections.

Connecting to Your Amazon Athena Account

This section enumerates the options in the Credentials and Details panes in the Amazon Athena JDBC 2.0 Connector page. The components of the other panes in this page, Scheduling and Name & Describe Your DataSet, are universal across most connector types and are discussed in greater length in Adding a DataSet Using a Data Connector.

Credentials Pane

This pane contains fields for entering credentials to connect to your Amazon Athena account. The following table describes what is needed for each field:

Field

Description

Access Key Enter your AWS access key.
Secret Key Enter your AWS secret key.
S3 Staging Directory Enter the Amazon S3 location where your query output is written. From the Athena web console, this can be found by going to Settings . Example: s3://aws-athena-query-results-1234-us-west-2/
Region Select your AWS region.
Workgroup Enter the workgroup.
Once you have entered valid Amazon Athena credentials, you can use the same account any time you go to create a new Amazon Athena JDBC 2.0 DataSet. You can manage connector accounts in the Accounts tab in the Data Center. For more information about this tab, see Managing User Accounts for Connectors.

Details Pane

This pane contains a number of options for entering your query and configuring the report.

Menu

Description

Report

Select the Amazon Athena JDBC 2.0 report you want to run.

Query

Let’s you query data from a specified S3 database and table.

Query Parameter Enter an SQL query with a parameter.

Catalog Name / Data Source

Select the specific catalog or data source in the dropdown you want to query.

Database Name

Select the database you want to query.

Table Name

Select the table in the database you want to query.

Table Columns

Select all the columns you want to query.

Query Helper

Based on the selected database name, table name and column names, the Query Helper will build a sample query for you.

Query

Enter the SQL query you want to execute (e.g. select * from table_name).

Query Parameter

Enter the query parameter value. It is the initial value for query parameter. The last run date is optional. By default, it is ‘02/01/1700’ if is not provided.

Example: !{lastvalue:_id}!=1,!{lastvalue:gender}!='male',!{lastrundate:start_date}!=02/01/1944

Connection Timeout

Enter the maximum amount of time, in milliseconds, to make a successful connection to Athena before an attempt is terminated. The default value is 10000ms.

Socket Timeout

Enter the maximum amount of time, in milliseconds, to wait for a socket in order to send data to Athena. The default value is 10000ms.

Maximum Error Retries

The maximum number of retries that the JDBC client attempts to make a request to Athena. The default value is 10.

Convert BigInt to Integer Select this check box if you want to convert the BIGINT columns into Integer.

Other Panes

For information about the remaining sections of the connector interface, including how to configure scheduling, retry, and update options, see Adding a DataSet Using a Data Connector.