Intro
Databricks is a cloud-based collaborative data science, data engineering, and data analytics platform that combines the best of data warehouses and data lakes into a lakehouse architecture. With Databricks you can access all your data, analytics, and AI on one lake house platform. The simple, open, and collaborative environment helps reduce the infrastructure complexity, keeps control of your data, and makes it easy for your teams to partner across the entire data and workflow. For more information about the Databricks API, visit their website. (https://docs.databricks.com/dev-tools/api/index.html ) The Databricks connector is a “Database” connector, meaning it retrieves data from a database using a query. In the Data Center, you can access the connector page for this and other Database connectors by clicking Database in the toolbar at the top of the window. You connect to your Databricks database in the Data Center. This topic discusses the fields and menus that are specific to the Databricks connector user interface. General information for adding DataSets, setting update schedules, and editing DataSet information is discussed in Adding a DataSet Using a Connector. Note: Contact your AE or CSM to have it enabled in your instance.Prerequisites
To connect to a Databricks database and create a DataSet, you must have the following:- The hostname for the database
- The port number for the database
- The database name or schema name
- Databricks personal access token
- The HTTP Path
Connecting to Your Databricks Database
This section enumerates the options in the Credentials and Details panes on the Databricks Connector page. The components of the other panes on this page, Scheduling, and Name & Describe Your DataSet, are universal across most connector types and are discussed in greater length in Adding a DataSet Using a Connector.Credentials Pane
This pane contains fields for entering credentials to connect to your database. The following table describes what is needed for each field:Field | Description |
|---|---|
Host | Enter the hostname for the Databricks database. Example: |
Port | Enter the port number for the Databricks database. |
Database | Enter the name of the Databricks database. |
Personal Access Token | Enter your Databricks personal access token. |
| HTTP Path | Enter the HTTP path. |
Details Pane
In this pane you create an SQL query to pull data from your database, with or without a parameter.Menu | Description | ||||||
|---|---|---|---|---|---|---|---|
Query Type | Select the desired query type.
| ||||||
Query | Enter the Structured Query Language (SQL) query to use in selecting the data you want. Example: You can use the Query Helper parameter to help you write a usable SQL query. To use the Query Helper , do the following:
| ||||||
Database Tables | Select the database table you want to import into Domo. | ||||||
Table Columns | Select the table columns you want to import into Domo. | ||||||
Query Helper | Copy and paste the SQL statement in this field into the Query field. For more information, see Query , above. | ||||||
Fetch Size | Enter the fetch size for memory performance. The default value will be used if no fetch size is specified. If an ” out of memory ” error occurs, retry decreasing the fetch size. | ||||||
Partition Support Format | Select the Year, Day and Month format of the data you would like to retrieve. | ||||||
Fetch Size | Enter the fetch size for memory performance. The default value will be used if no fetch size specified. If an “out of memory” error occurs, retry decreasing the fetch size. | ||||||
How you would like to import data into Domo? | You can import the data from the following:
|