Google BigQuery High Bandwidth Service Connector

Intro

BigQuery is a data platform to create, manage, share, and query data. Domo’s Google BigQuery High Bandwidth Service connector lets you bring in larger results from BigQuery through the assistance of the Google Cloud Storage service. To learn more about the BigQuery API, go to https://cloud.google.com/bigquery/docs/reference/v2/. The Google BigQuery High Bandwidth Service connector is a “Cloud App” connector, meaning it retrieves data stored in the cloud. In the Data Center, you can access the connector page for this and other Cloud App connectors by clicking Cloud App in the toolbar at the top of the window. This topic discusses the fields and menus that are specific to the Google BigQuery High Bandwidth Service connector user interface. For general information about adding DataSets, setting update schedules, and editing DataSet information, see Adding a DataSet Using a Data Connector.

Prerequisites

To connect to BigQuery and create a DataSet, you must have the following:

A JSON BigQuery service account key
A JSON Google Cloud Storage service account key

To generate the necessary service account keys, do the following:

In the Google Cloud Platform Console, open the IAM & Admin page.

Click Service accounts.
Select your project and click Open.
Click Create Service Account.

Enter a name and description for the service account.
Click Create.
To grant this service account access to the project, click the Select a role drop-down.
Click Project and then click Owner.

Owner will appear in the Role field.
Click Continue.
Click the Create key.

Select JSON as the key type.

Click Create.

A private key will be saved to your computer.

Include the following permissions for the Service keys:

BIGQUERY ROLES

bigquery.datasets.get
bigquery.jobs.create
bigquery.jobs.get
bigquery.jobs.list
bigquery.tables.list
bigquery.tables.export
bigquery.tables.create
bigquery.tables.delete

CLOUD STORAGE ROLES

storage.buckets.get
storage.buckets.list
storage.objects.create
storage.objects.delete
storage.objects.get
storage.objects.list

Connecting to BigQuery

This section enumerates the options in the Credentials and Details panes on the Google BigQuery High Bandwidth Service Connector page. The components of the other panes on this page, Scheduling, and Name & Describe Your DataSet, are universal across most connector types and are discussed in greater length in Adding a DataSet Using a Data Connector. You cannot export nested and repeated data in CSV format. Nested and repeated data are supported for Avro, JSON, and Parquet exports. If the customer has a column with type as ARRAY, BigQuery doesn’t support unloading that data in CSV. The customer needs to exclude the ARRAY columns or UNNEST them to run the unload successfully

Credentials Pane

This pane contains fields for entering credentials to connect to your BigQuery Unload account. The following table describes what is needed for each field:

Field	Description
Service Account Key JSON BigQuery	Enter your Google BigQuery JSON service account key.
Service Account Key JSON Google Cloud Storage	Enter your Google Cloud Storage JSON service account key.

Once you have entered valid credentials, you can use the same account any time you go to create a new Google BigQuery High Bandwidth Service DataSet. You can manage connector accounts in the Accounts tab in the Data Center. For more information about this tab, see Managing User Accounts for Connectors.

Details Pane

This pane contains a primary Reports menu, along with various other menus which may or may not appear depending on the report type you select.

Description

Report

Report Name	Description
Query	Runs a BigQuery SQL query and returns results if the query completes.

Dataset ID

Select the BigQuery dataset ID for your data.
For more info about BigQuery datasets, visit https://cloud.google.com/bigquery/docs/datasets-intro .

Google Cloud Storage Bucket Selection Criteria

Select if you want to provide a Google Cloud Storage bucket name or select from discovery.

Google Cloud Storage bucket name

Select the Google Cloud Storage bucket name that will be used for temporary storage as we transfer your data into Domo.

Query

Enter a query to execute. Only Standard SQL query is supported. Example: Select * from Table_Name.

Partition query to determine partition tags

Enter partition query to determine the distinct partition tags. The column containing the Date data is your partition column. Example: Select Date from Table_Name.

Force to string

Enter a comma-separated list of fields that need to be treated as STRING in Domo.

Force to number

Enter a comma-separated list of fields that need to be treated as NUMBER in Domo.

How would you like to pull your data?

Choose how you would like to pull your data.

Partition column name

Enter partition column name. The column containing the Date data is your partition column.

Upsert key columns

Enter a comma-separated list of upsert key column names.

Partition Criteria

Select if you want to partition your data using the date keys, non-date keys, or meta query.

Partition Query to determine partition tags

Enter partition query to determine the distinct partition tags.

Partition Non Date Key Column Name

Select the partition column name.

Partition Support Format

Select the partition format.

Past Days

Enter the number of past days that you want to get data for. Value can be X, where X is a positive integer. For example: 30.

Past Years

Enter the number of past years that you want to get data for. Value can be X, where X is a positive integer. For example: 30.

Past Months

Enter the number of past months that you want to get data for. Value can be X, where X is a positive integer. For example 30.

Future Years

Enter the number of future years that you want to get data for. Value can be X, where X is a positive integer. For example 30.

Future Months

Enter the number of future months that you want to get data for. Value can be X, where X is a positive integer. For example 30.

Date Format

Select the proper date format according to the date format present in the partition column. By default, <b>yyyy-MM-dd</b> will be used.

Days Back

The number of days back that you would like to get data from. Example: Specify 7 to get data for the last 7 days.

Data Rolling Window

The data will be retained for the number of days specified. Example: Specify 60 to retain the data for 60 days.

Entering the SQL Query and Partition Query

Entering the Partition column name and other parameters

2._Query___Rolling_window___Column_name.png

Your data will be fetched from the number of days mentioned in the Days Back field and will be retained for the number of days specified in the Data Rolling Window field.

Note: When using the Partition option, your DataSet must be set to Append and NOT Replace.

Using the Upsert mode

Other Panes

For information about the remaining sections of the connector interface, including how to configure scheduling, retry, and update options, see Adding a DataSet Using a Data Connector.

Note: Currently this connector only supports date partitions.

Troubleshooting

What if no rows of data are being returned but there are rows in the run history?

Google API documentation has stated that BigQuery and Cloud Storage need to follow the below rules:

If the BigQuery dataset is in a specific location then Cloud Storage should be in the same location.
If BigQuery Dataset is in a multi-region then Cloud Storage also needs to be either in the same multi-region or in a location that is contained within that multi-region.

Refer to Google Cloud Storage for the API documentation reference. For Bucket locations, visit https://cloud.google.com/storage/docs/locations.

Connect & Integrate

Transform & Manage

Visualize & Interact

AI & Data Science

Automate

Distribute

Admin

General Information

Google BigQuery High Bandwidth Service Connector

Intro

Prerequisites

To generate the necessary service account keys, do the following:

Include the following permissions for the Service keys:

Connecting to BigQuery

Credentials Pane

Details Pane

Entering the SQL Query and Partition Query

Entering the Partition column name and other parameters

Using the Upsert mode

Other Panes

Troubleshooting

What if no rows of data are being returned but there are rows in the run history?

Connect & Integrate

Transform & Manage

Visualize & Interact

AI & Data Science

Automate

Distribute

Admin

General Information

​Intro

​Prerequisites

To generate the necessary service account keys, do the following:

Include the following permissions for the Service keys:

​Connecting to BigQuery

​Credentials Pane

​Details Pane

Entering the SQL Query and Partition Query

Entering the Partition column name and other parameters

Using the Upsert mode

​Other Panes

​Troubleshooting

​What if no rows of data are being returned but there are rows in the run history?

Intro

Prerequisites

Connecting to BigQuery

Credentials Pane

Details Pane

Other Panes

Troubleshooting

What if no rows of data are being returned but there are rows in the run history?