Getting Started with Databricks Unity Catalog

The how-to guides linked on this page illustrate how to integrate Databricks Unity Catalog with Immuta. See the reference guide for information about the Databricks Unity Catalog integration.

Requirements:

  • Unity Catalog metastore created and attached to a Databricks workspace. Immuta supports configuring a single metastore for each configured integration, and that metastore may be attached to multiple Databricks workspaces.

  • Unity Catalog enabled on your Databricks cluster or SQL warehouse. All SQL warehouses have Unity Catalog enabled if your workspace is attached to a Unity Catalog metastore.

1

Connect your technology

These guides provide instructions on getting your data set up in Immuta.

  1. Register your Databricks Unity Catalog connection: Using a single setup process, connect Databricks Unity Catalog to Immuta. This will register your data objects into Immuta and allow you to start dictating access through global policies.

  2. Organize your data sources into domains and assign domain permissions to accountable teams: Use domains to segment your data and assign responsibilities to the appropriate team members. These domains will then be used in policies, audit, and sensitive data discovery.

Connections are generally available on all 2025.1+ tenants. If you do not have connections enabled on your tenant, configure Databricks Unity Catalog and register data sources using the legacy workflow.

2

Register your users

These guides provide instructions on getting your users set up in Immuta.

  1. Connect an IAM: Bring the IAM your organization already uses and allow Immuta to register your users for you.

  2. Map external user IDs from Databricks to Immuta: Ensure the user IDs in Immuta, Databricks, and your IAM are aligned so that the right policies impact the right users.

3

Add data metadata

These guides provide instructions on getting your data metadata set up in Immuta.

  1. Connect an external catalog: Bring the external catalog your organization already uses and allow Immuta to continually sync your tags with your data sources for you.

  2. Run sensitive data discovery: Sensitive data discovery (SDD) allows you to automate data tagging using identifiers that detect certain data patterns.

4

Start using the Governance app

These guides provide instructions on using the Governance app for the first time.

  1. Author a global subscription policy: Once you add your data metadata to Immuta, you can immediately create policies that utilize your tags and apply to your tables. Subscription policies can be created to dictate access to data sources.

  2. Author a global data policy: Data metadata can also be used to create data policies that apply to data sources as they are registered in Immuta. Data policies dictate what data a user can see once they are granted access to a data source. Using catalog and SDD tags you can create proactive policies, knowing that they will apply to data sources as they are added to Immuta with the automated tagging.

  3. Configure audit: Once you have your data sources and users, and policies granting them access, you can set up audit export. This will export the audit logs from user queries, policy changes, and tagging updates.

Last updated