Skip to content

This feature is available only in Enterprise Edition

Databricks plugin

The Databricks Lakehouse Platform

The SlashDB Databricks plugin adds support for Databricks databases, further extending SlashDB's generated REST API.

Installation

SlashDB version >= 1.5.0 must be installed before the plugin can be installed. If you don't have SlashDB yet, see the chapter "Installation" for details.

From System Package

If SlashDB was installed from the DEB or RPM package, or if you have SlashDB running from a virtual machine or cloud images, then the easiest way to add Databricks support is by downloading and installing the corresponding plugin package of the same type.

Debian or Ubuntu

wget https://downloads.slashdb.com/versions/1.8.0/slashdb-databricks_1.1.0_amd64.deb
sudo apt-get update
sudo apt-get install -y ./slashdb-databricks_1.1.0_amd64.deb

RedHat or CentOS

wget https://downloads.slashdb.com/versions/1.8.0/slashdb-databricks-1.1.0.x86_64.rpm
sudo dnf install -y ./slashdb-databricks-1.1.0.x86_64.rpm

Once installation is complete SlashDB will automatically be restarted for changes to take the effect.

From Python Wheel

If SlashDB was installed as a Python package then it's best to also install the Databricks plugin as a Python package in the same environment.

Available wheel packages:

Download and install the package that matches your version of Python, for example:

wget https://downloads.slashdb.com/versions/1.8.0/slashdb_databricks-1.1.0-cp38-cp38-manylinux2014_x86_64.whl
/opt/slashdb/bin/pip install ./slashdb_databricks-1.1.0-cp38-cp38-manylinux2014_x86_64.whl

After installation SlashDB has to be restarted accordingly to the custom setup.

Configuration

License

Uploading a license valid for Databricks

Log in as admin and in the top menu go to Configure > License.

Unless your license allows connecting to Databricks databases you will see a padlock icon which means you need to upload a valid license.

If you need a license please contact us at licensing@slashdb.com, if you already have a license please see how to correctly install it at the chapter "License".

When the new license is uploaded you will see Databricks highlighted in green which means you will be able to add connections to Databricks data lakes.

Acquiring Databricks credentials

Token

Generate and copy the token as described in the databricks docs

Generate token

Hostname, Http Path

Copy Server Hostname and HTTP Path from the cluster configuration

Cluster connection details

For details please see the Databricks documentation

Database Name

In the azure workspace, in the sidebar go to data > databases to list them. Or keep blank to use the default.

More details can be found in the Databricks documentation

Add new database configuration to SlashDB

Most steps for adding a connection to Databricks are the same as for other database types with a few differences mixed in.

  1. Inside of Configuration -> Databases click the blue button in the top right that says "+ New".
  2. From the list of available database connections click on the Databricks logo.
  3. Fill the form with Databricks credentials.
  4. Click "Test Connection" to verify the connection to the server is valid.
  5. Proceed according to the chapter "Adding a new database".
  6. When the reflection process is done you will be able to browse your data through Data Discovery or SQL Pass-thru

Updating

To update install a newer version of the package.

Note After updating the core slashdb rpm/deb package you have to reinstall plugins