This feature is available only in Enterprise Edition
Databricks plugin
The Databricks Lakehouse Platform
The SlashDB Databricks plugin adds support for Databricks databases, further extending SlashDB's generated REST API.
Installation
SlashDB version >= 1.5.0 must be installed before the plugin can be installed. If you don't have SlashDB yet, see the chapter "Installation" for details.
From System Package
If SlashDB was installed from the DEB or RPM package, or if you have SlashDB running from a virtual machine or cloud images, then the easiest way to add Databricks support is by downloading and installing the corresponding plugin package of the same type.
Debian or Ubuntu
wget https://downloads.slashdb.com/versions/1.8.90/slashdb-databricks_1.2.1_amd64.deb
sudo apt-get update
sudo apt-get install -y ./slashdb-databricks_1.2.1_amd64.deb
RedHat or CentOS
wget https://downloads.slashdb.com/versions/1.8.90/slashdb-databricks-1.2.1.x86_64.rpm
sudo dnf install -y ./slashdb-databricks-1.2.1.x86_64.rpm
Once installation is complete SlashDB will automatically be restarted for changes to take the effect.
From Python Wheel
If SlashDB was installed as a Python package then it's best to also install the Databricks plugin as a Python package in the same environment.
Available wheel packages:
Download and install the package that matches your version of Python, for example:
wget https://downloads.slashdb.com/versions/1.8.90/slashdb_databricks-1.2.1-cp38-cp38-manylinux2014_x86_64.whl
/opt/slashdb/bin/pip install ./slashdb_databricks-1.2.1-cp38-cp38-manylinux2014_x86_64.whl
After installation SlashDB has to be restarted accordingly to the custom setup.
Configuration
License
Uploading a license valid for Databricks
Log in as admin and in the top menu go to Configure > License.
Unless your license allows connecting to Databricks databases you will see a padlock icon which means you need to upload a valid license.
If you need a license please contact us at licensing@slashdb.com, if you already have a license please see how to correctly install it at the chapter "License".
When the new license is uploaded you will see Databricks highlighted in green which means you will be able to add connections to Databricks data lakes.
Acquiring Databricks credentials
Token
Generate and copy the token as described in the databricks docs
Hostname, Http Path
Copy Server Hostname and HTTP Path from the cluster configuration
For details please see the Databricks documentation
Database Name
In the azure workspace, in the sidebar go to data > databases to list them. Or keep blank to use the default.
More details can be found in the Databricks documentation
Add new database configuration to SlashDB
Most steps for adding a connection to Databricks are the same as for other database types with a few differences mixed in.
- Inside of Configuration -> Databases click the blue button in the top right that says "+ New".
- From the list of available database connections click on the Databricks logo.
- Fill the form with Databricks credentials.
- Click "Test Connection" to verify the connection to the server is valid.
- Proceed according to the chapter "Adding a new database".
- When the reflection process is done you will be able to browse your data through Data Discovery or SQL Pass-thru
Updating
To update install a newer version of the package.
Note After updating the core slashdb rpm/deb package you have to reinstall plugins