Databricks - Unity Catalog
Databricks is a cloud-based data analytics platform that integrates data engineering, data science, and machine learning tools to enable seamless processing, analysis, and exploration of large datasets, offering services like Databricks Delta Lake and Databricks Runtime. Unity Catalog is a unified governance solution build on top of Databricks.
The connector is available here and supports
- Synchronizing databrick users to an identity store in Raito Cloud.
- Synchronizing Databricks Unity Catalog meta data (data structure, known permissions, …) to a data source in Raito Cloud.
- Synchronizing Databricks Unity Catalog grants from an to Raito.
- Synchronize the data usage information to Raito Cloud.
Prerequisites
Unity Catalog
Databricks Unity Catalog should be enabled on the account and workspaces, as this is essential to the Raito Databricks plugin.
Authentication
Currently, we support OAuth authentication (recommended), as well as, basic authentication by email and password. The associated account should be admin in the Databricks account and on all workspaces. There are no required permissions within the Databricks Unity catalog.
OAuth
Authentication using OAuth, requires a valid client_id
and client_secret
.
The client_id
and client_secret
should be provided in the databricks-client-id
and databricks-client-secret
parameter respectively.
If using a service principal, the service principal should be admin of the account and all workspaces. On top of that, he should be (included) as owner to all metastores.
Basic Authentication
To authenticate by email and password, email and password can be provided in the databricks-user
and databricks-password
parameters respectively.
We recommend to use basic authentication only for testing purposes.
Databricks-specific CLI parameters
To see all parameters, type
$> raito info raito-io/cli-plugin-databricks
in a terminal window.
Currently, the following configuration parameters are available:
- databricks-account-id (mandatory): The ID of your Databricks account.
- databricks-client-id: The client ID of the databricks account used within the plugin, to authenticate by using oauth.
- databricks-client-secret: The client secret of the databricks account used within, to authenticate by using oauth.
- databricks-user: The email-address of the user that should be used within the plugin, to authenticate by using basic authentication.
- databricks-password:: The password of the user that should be used within the plugin, to authenticate by using basic authentication.