Intro
Apache Impala is an open source massively parallel processing (MPP) SQL query engine for data stored in a computer cluster running Apache Hadoop. It consists of different processes that run on specific hosts within your CDH cluster. The Domo Apache Impala SSH connector brings your data from the Apache server securely through an SSH tunnel into Domo. The ApacheImpala Connector is a “Cloud App” Connector, meaning it retrieves data stored in the cloud. In the Data Center, you can access the Connector page for this and other Cloud App Connectors by clicking Cloud App in the toolbar at the top of the window. You connect to your ApacheImpala account in the Data Center. This topic discusses the fields and menus that are specific to the ApacheImpala Connector user interface. General information for adding DataSets, setting update schedules, and editing DataSet information is discussed in Adding a DataSet Using a Data Connector.Prerequisites
To connect to your Apache Impala database and create a DataSet, you must have the following:- The username and password you use to log into your Apache Impala database
- The host name or IP address for the database server (e.g. db.company.com ).
- The port number for the database
- The database name
- Connecting String Parameter
Connecting to Your ApacheImpala Connector Account
This section enumerates the options in the Credentials and Details panes in the [insert Connector name here] Connector page. The components of the other panes in this page, Scheduling and Name & Describe Your DataSet, are universal across most connector types and are discussed in greater length in Adding a DataSet Using a Data Connector.Credentials Pane
This pane contains fields for entering credentials to connect to your (third-party tool) account. The following table describes what is needed for each field:Field | Description |
|---|---|
| Host | Enter the hostname or IP address of your database server. Example: db.company.com |
| Database Port | Enter your Apache Impala port number. |
| Database Name | Enter your Apache Impala database/schema name. |
| Username | Enter your Apache Impala username. |
| Password | Enter your Apache Impala password. |
| Database Connection String Parameter(s) | Enter the parameter(s) you want to include in the database connection string. Multiple parameters are separated by a semicolon. (Example: AuthMech=3;SSL=1;AllowSelfSignedCerts=1) |
Details Pane
This pane contains a primary Reports menu, along with various other menus which may or may not appear depending on the report type you select.Menu | Description | ||||
|---|---|---|---|---|---|
Query Type | Select a query type.
| ||||
| Query | Enter the SQL query to execute. The query will execute on the Apache Impala server and fetch the data from it. | ||||
| Query Parameter | Enter the query parameter value. It is the initial value for query parameter. The last run date is optional. The default value for the last date is ‘02/01/1700’ if not provided. Example: | ||||
| Database Table | Select the database table. | ||||
| Table Columns | Select the table columns. | ||||
| Query Helper | This query is automatically generated when you select a table and columns in the Database Table and Table Columns fields, respectively. Copy and paste this query into the Query field if you need help building a query. |
Other Panes
For information about the remaining sections of the Connector interface, including how to configure scheduling, retry, and update options, see Adding a DataSet Using a Data Connector.FAQs
What kind of credentials do I need to power up the Apache Impala connector?
What kind of credentials do I need to power up the Apache Impala connector?
How frequently will my data update?
How frequently will my data update?
Are there any API limits that I need to be aware of?
Are there any API limits that I need to be aware of?
Can I use the same Apache Impala account to create multiple datasets?
Can I use the same Apache Impala account to create multiple datasets?
What do I need to be aware of while writing a query?
What do I need to be aware of while writing a query?
Troubleshooting
- Make sure your authentication remains valid.
- Review the configuration to make sure that all required items have been selected.
- Review the Connector history for error messages.
- In rare cases, you may be requesting too much information and reaching API limitations or timeouts. If this is the case, you can review the history of the Connector run to see the error message and duration. If this is the case, you can reduce the number of accounts that are being pulled, choose a smaller number of metrics for the report that you are pulling, or reduce the timeframe that you are trying to pull.