> For the complete documentation index, see [llms.txt](https://documentation.immuta.com/2024.2/llms.txt). Markdown versions of documentation pages are available by appending `.md` to page URLs; this page is available as [Markdown](https://documentation.immuta.com/2024.2/discover-your-data/data-discovery/how-to-guides/migrate-legacy-to-native.md).

# Migrate From Legacy to Native SDD

This guide provides information and best practices for migrating from the deprecated legacy sensitive data discovery (SDD) option to the improved native SDD. This guide is for users who have already enabled SDD on their tenant and have Discovered tags on their data sources.

## Before you begin

### Native vs legacy SDD

Legacy SDD is deprecated. It will be removed and replaced by native SDD. Native SDD is significantly improved from legacy SDD for discovering and tagging your data with upgrades to the built-in patterns. Additionally, the greatest benefit is the respect for data residency. Native SDD doesn't move any of your data when running. The discovery is done right in your data platform, and the platform only returns the matching patterns and column names to Immuta.

See the [Sensitive data discovery reference page](/2024.2/discover-your-data/data-discovery.md) for more information on native SDD.

### Requirements

* Native SDD requires Snowflake, Databricks, Redshift, or Starburst (Trino) data sources
* Legacy SDD enabled on your tenant
* Legacy SDD tags applied to your data sources: To find out if you have legacy SDD tags applied, create a governance report as described in the [understand the context of you tags section](#understand-the-context-of-your-tags).

## Enable native SDD

Contact your Immuta representative to enable native SDD on your Immuta tenant. Note that unless specifically disabled, all Immuta installations after the 2024.2 LTS have native SDD automatically enabled. Proceed to [understand the context of your tags](#understand-the-context-of-your-tags) if you want to self-service check if native SDD is already running and tagging your data before you reach out to the representative.

This action will not change anything immediately on your tenant; however, anytime SDD runs in the future, it will be native SDD instead of the legacy version.

To assess native SDD for your data, proceed with the steps below. If you do not review native SDD, the legacy SDD tags will all remain on your data source columns. However, when [SDD automatically runs](/2024.2/discover-your-data/architecture.md#frequency) on new data sources and columns, it will apply native SDD tags, and because of the improvements to SDD, it may tag different data than legacy SDD.

## Understand the context of your tags

**Requirement**: Immuta permission `GOVERNANCE`

1. [Manually run SDD globally](/2024.2/discover-your-data/data-discovery/how-to-guides/enable-sdd.md#run-sdd-on-all-data-sources) to run native SDD on your data sources.
2. To check the tags on an individual data source, navigate to the data source data dictionary and select a Discovered tag. On the tag side sheet, you can determine the context of the tag. When patterns match data, native SDD will apply tags, and their tag context will be `Sensitive Data Discovery`. Any tags with the context `Legacy Sensitive Data Discovery` were not matched by native SDD but will remain on the data source.
3. To check your tags globally, navigate to the governance reports page and build a report for sensitive data discovery. This report will present the legacy tags on your data sources' columns and native SDD tags that are also on those columns. Use this report to assess the context of the Discovered tags and understand if native SDD is matching the data you want it to.

These actions will allow you to understand the differences between how native SDD and legacy SDD tag your data and whether your data is recognized as expected by native SDD or if legacy SDD was over-tagging your data. This way you can better tune SDD to your data.

If there are any legacy SDD tags that you want native SDD to catch, you need to tune native SDD so that this type of data is discovered in future tables and columns; see guidance on that in the next section.

## Tune SDD

**Requirement**: Immuta permission `GOVERNANCE`

Using the report you built above, complete these actions to tune SDD:

1. Focus on a legacy SDD tag properly applied to your data. Assess whether the native SDD tag on the column instead was applied more accurately than the legacy tag. If it is applied incorrectly, proceed to the next step.
2. [Create a new regex or dictionary pattern](/2024.2/discover-your-data/data-discovery/how-to-guides/manage-patterns.md#create-a-pattern) to discover this data. Ensure it is specific and will match your data with a 90% confidence.
3. [Create a new rule](/2024.2/discover-your-data/data-discovery/how-to-guides/manage-rules.md#create-a-rule) in your framework using the new pattern and the Discovered tag you want applied to the data.
4. Complete the steps above for all legacy SDD tags.
5. Retest your updated rules and patterns by [re-running SDD on the select data sources](/2024.2/discover-your-data/data-discovery/how-to-guides/manage-sdd-tags.md#run-sdd-on-a-data-source) and continue refining to the level of accuracy you want.

Completing the actions above will create parity between what legacy SDD was tagging your data and what native SDD will tag in the future.


---

# Agent Instructions
This documentation is published with GitBook. GitBook is the documentation platform designed so that both humans and AI agents can read, navigate, and reason over technical content effectively. Learn more at gitbook.com.

## Querying This Documentation
If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter, and the optional `goal` query parameter:

```
GET https://documentation.immuta.com/2024.2/discover-your-data/data-discovery/how-to-guides/migrate-legacy-to-native.md?ask=<question>&goal=<endgoal>
```

`ask` is the immediate question: it should be specific, self-contained, and written in natural language.
`goal` is optional and describes the broader end goal you are ultimately trying to accomplish on behalf of the user. GitBook uses it to tailor the answer towards what is most useful for that goal.

The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.