Intro
There are two available scripting tiles in Magic ETL: Python Script and R Script.Note:
These tiles are available by default to users on the Domo Consumption agreement.For non-consumption users, the tiles are available
on demand
and
paid
. To enable scripting tiles, contact your Domo account team. You may need to complete training before you can use a tile.

- General information
- Configuration steps
- API Reference
- Project Directory and /tmp Directory
- Troubleshooting
General Information
This section provides information about all scripting tiles in Magic ETL. For information about specific tiles, use one of the following links to jump to the API reference for that tile: Scripting tiles are a powerful tool within Magic ETL. They allow you to write custom R or Python algorithms and implement them directly into DataFlows. Use them to create complex data science analyses that run every time your data updates.Take a Tour
All scripting tiles have the same look. The only difference between them is the supported language. The main body of the tile is the code editor. This is where you write your script. It includes syntax highlighting for the specific language the tile supports.


. Clicking this button resets the script to the example code template that exists when you first open a scripting tile. Be careful though—if you have already written a script, clicking this button overwrites your script (after a confirmation prompt). However, as long as you do not close the tile, you can retrieve your overwritten script by placing the cursor within the code editor and pressing Command+z or CTRL+z.
In the Schema tab, you tell the tile what the results of your script looks like. To use this, click Schema in the tile editor panel toolbar. For a deep dive into this tab, see the Populating the Resulting Schema section of this article.

stdout ) and standard error (stderr ) that your script produces. This tab is not available until after a preview is run on the tile, when it will appear as an option in the tile editor panel toolbar.


Get Data In and Out of the Scripting Tile
Domo provides a basic API for importing data (read) from the Domo ecosystem into your script and then for exporting your results out (writing) of the script and back into the Domo ecosystem. The API is different for each scripting language and tries to follow the semantics of the language.Note:
Be aware of the following data type issues when using scripting tiles in Domo:
- In Python, LONG becomes DOUBLE in Python because NumPY does not support NULLs in integer columns.
- In both Python and R, DECIMAL becomes DOUBLE because there is no analogous type in NumPY or R.
Step 1: Import The Domo Package
The first line in any script should be used for importing the language-specific Domo package into the script. This follows standard package import semantics for the chosen language. It is part of the initial code template. Alternatively, you can search for “domo” in the Packages list to the right of the code editor and click on the package to inject the code snippet into the editor.Step 2: Read Data Into The Script
The next step in writing a script is getting access to the data. Each language will have its own semantic version of a Domo “read” method. This is included in the initial code template. Alternatively, you can click on one of the inputs in the input list to the right of the code editor. This injects a code snippet that reads the input data and stores it in a variable. You can rename this variable if you like.Step 3: Export Data Out Of The Script
The last step in writing a script is returning the results to the Domo ecosystem. Each language has its own semantic version of a Domo “write” method. This is also included in the initial code template. There is no other way of injecting a code snippet for this, however. Provide the name of the variable that is storing the results of your script to this “write” method. There can only be one export per script.Populate the Resulting Schema
There are three ways to tell the tile what schema your script produces. These methods can be mixed and matched as necessary to get the proper result. You perform all of these in the Schema tab of the tile.Manually Adding the Columns
Follow these steps to add the columns manually:-
Type the name of a column that your script will produce into the
Column Name field.

-
In the list of available data types, select the data type for this column.

-
Add another column by clicking the
Add Column button.

Bulk Add Columns
Follow these steps to add the columns in bulk:- Add all the columns from a particular “Input DataSet” tile in bulk by clicking on the Add From DataSet button and selecting the “Input DataSet” tile you want to populate from.

- Remove any excess columns using the “X” button on the far right side of the column.

Run Script
Follow these steps to have Domo run the script for you:-
Click either the
button on the right side of the tile editor panel toolbar or the Run Preview button in the toolbar at the top of the canvas. (If there is an issue with your preview, see the Troubleshooting section of this article.)
After the preview has finished running, a sentence appears under the text inputs for creating new columns. This tells you how many columns the preview returned that are not currently listed as being part of the tile’s schema.
- Click the button at the end of the sentence.
Configuration Steps
If you have not done so already, check out the Take a Tour section of this article to become familiar with the different features of the Scripting Tile.- Start a new Magic ETL DataFlow.
- Drag an Input DataSet tile onto the canvas and select the DataSet you want to use. You will find the Scripting tiles under the “Data Science” section of the left sidebar.
-
Drag the Scripting tile for the language you want to write onto the canvas and then drag a connection from the “Input DataSet” to this Scripting tile.

-
(Conditional) If the editor panel is not open at the bottom of the screen, click the Scripting tile you just added to the canvas so that it is selected; the editor panel will then open.
The code editor will be auto-populated with an initial code template. For more information on what is included in the template, see Get Data In and Out of the Scripting Tile, above. A yellow banner also displays. Ignore this for now; it will be addressed later on in the configuration steps.
You will see that the template is pulling the data from the “Input DataSet” tile that is connected to the Scripting tile and storing it in a variable. This is the variable that your script should manipulate.

-
(Conditional) If you have multiple data sources, simply drag out another “Input DataSet,” select the next DataSet you need, connect this tile to the Scripting tile, and add another import statement to the script.
Underneath the line that is creating the variable to hold your data, you will see a space for you to write your script.


- Write your script in the line provided, making sure that the export statement at the very bottom of the template is exporting the variable that represents the final product of your script. You will notice that your tile is still not marked as configured and there is still a yellow banner at the top of the tile telling you this. This is because the tile does not yet know what the result of your script looks like.
-
To provide the tile with this information…
- Select the Schema tab in the toolbar of the tile editor panel.
- Choose one of the methods listed in the Populate the Resulting Schema section of this article and follow the steps there. Afterwards, continue the rest of the steps listed below.
- Drag an “Output DataSet” tile from the sidebar and drop it onto the canvas.
-
Drag a connection from the Scripting tile to this new “Output DataSet” tile.

- Select the Output DataSet tile to open the tile editor and give the tile a name and description. The tiles within the DataFlow are now fully configured.
- Next, enter a name for the DataFlow in the field in the top left corner.
- Configure the settings in the panel using the Settings button in the canvas toolbar.
- Save the DataFlow.
API Reference
domomagic is the simple API provided to both the Magic ETL Python Tile and R Tile authors, allowing them to load and unload data to and from the Python and R contexts.Python Script Tile Module API Reference

R Script Tile Package API Reference

Method Signature | Parameters | Returns |
|---|---|---|
|
| The tiles input data.frame. |
|
| N/A |
read.dataframe function also supports the following parameters that allow you to customize its behavior:
Project Directory and /tmp Directory in Scripting Tiles
In Magic ETL’s Python and R scripting environment, you can use two special directories to store and share data between scripting tiles in a DataFlow—the Project Directory and the /tmp Directory. These directories simplify the experience of working with complex data processing tasks.Project Directory
The Project Directory is a shared space for all scripting tiles within a DataFlow, and its contents persist between executions. This means that you can store things like statistical models in the Project Directory and continue working on them across multiple executions. The Project Directory is located at /home/domo/project, but you can simply refer to it asproject in your script. Any changes you make to the Project Directory are only saved after successful executions. Changes from preview or failed executions are discarded.
The total size of a Project Directory may not exceed eight gigabytes (8 GB). Contact Domo Support
if you need to increase this limit, restore a directory from a backup, or copy a directory from one DataFlow to another.
Use Cases
Possible use cases for the project directory include the following:- Use Case 1 — Train a statistical model over time, which allows you to keep multiple versions while you figure out which one works best.
- Use Case 2 — Update complex calculations or aggregate statistics with new data instead of re-computing everything from the beginning when using the Input DataSet option that loads only new batches since the last successful execution.
Set Up a Project Directory
- Create your DataFlow with the necessary scripting tiles (Python and/or R) and any required input and output DataSet tiles.
-
In the first scripting tile, whether Python or R, write your script to process the data incrementally and save the intermediate results or models to the Project Directory.

-
In another scripting tile, whether Python or R, read the intermediate results or models saved in the Project Directory from the previous tile and continue processing as needed.

- Save the final results or updated models to the Project Directory, if needed, and use the Output DataSet tile to store the final results.
/tmp Directory
The /tmp directory is a shared space for all scripting tiles within a DataFlow. If you create a file in the /tmp Directory in one scripting tile, you can access it from another scripting tile within the same DataFlow. Unlike the Project Directory, the contents of the /tmp Directory are temporary and are deleted between executions of the DataFlow.Use Cases
Possible use cases for the /tmp Directory include the following:- Use Case 1 — Create multiple output DataFrames in one script as files in /tmp and use them in separate tables in downstream scripting tiles.
- Use Case 2 — Share an object between scripting tiles that can’t be easily represented as a DataFrame.
- Use Case 3 — Improve communication efficiency between scripting tiles.
- Create your DataFlow with the necessary scripting tiles, whether Python or R, and any required input and output DataSet tiles.
-
In the first scripting tile, write your script to process the data as needed and save the resulting data or objects to the /tmp Directory.

-
In the second scripting tile, read the data saved in the /tmp Directory from the previous tile.

- Continue processing the data as needed, and use the Output DataSet tile to save the final results.
Troubleshooting
- If your tile is not marked as configured after writing the script, make sure you have provided a resulting schema. See Populate the Resulting Schema above to learn more.
-
To get feedback on the behavior of your script, run a preview. You can run a preview by selecting the
Preview in the toolbar at the top of the canvas. After running a preview, a Console tab appears in the tile editor panel toolbar. This displays the standard out (
stdout) and standard error (stderr) that your script produces. If the preview fails, check this console for errors. - If a script is failing, check that you have properly finished all three of the steps listed in the Get Data In and Out of the Scripting Tile section of this article. Check the variable name that you are writing back to Domo to make sure it is correct.
-
Previews run on only a sample of your DataSet. If your script requires a certain variability in data to function properly, the sample used by the preview might not meet this variability requirement. You can control the size of the preview sample by changing the
Row Limit in the canvas toolbar.
Note: Note that larger sample sizes will result in longer preview run times.
- Make sure that you have explicitly imported the packages/libraries that you need for your script to run. A list of available packages displays in the Package tab.
- If a column is resulting in all null values, it could mean that the column is not actually produced by your script. If a column is provided in the list in the Schema tab, that column is added to the results regardless of whether the script actually produces it or not. If the script does not produce that column, then all of the values for that column are set to null. You can see which of your columns are and are not created by the script by running a preview and then looking at the Schema tab. The status of each column displays to the right of the column data type.
- If your script is failing during runtime with an out of memory error, your input DataSet could be too large or your script could be doing some sort of join that is causing the data to explode. Try filtering the data before it gets injected into the scripting tile and/or refactoring your script to prevent data explosion. For more information on troubleshooting memory errors, see Troubleshooting Scripting Tiles Memory Errors.