Adjust Identification and Classification Framework Tags

Requirements:

Registered data sources; see the reference page for supported technologies
Immuta permission GOVERNANCE

Immuta provides identifiers out-of-the-box to recognize and tag data. Users can then utilize classification frameworks and build them to apply tags based off those identifier tags and their own catalog tags.

Tune identifiers first to adjust where the tags are applied. Because classification frameworks can apply classification tags from the identification applied tags, tuning identification should come first and will have trickle-down effects on classification. Customizing identification requires some initial work but will automate data tagging for all data sources in the future.

Follow the steps below to tune identification for your data:

Create new identifiers
Add identifiers to a domain
Add data sources to your domain
Run identification for the domain: This will remove the tags from any previous identification runs and re-run identification with your new identifiers. From here, either continue to edit identifiers to reconfigure the applied tags, or you're finished if you are happy with the results.

Assess

After identification has applied entity tags, any active classification frameworks will automatically reapply their tags to account for any changes to tags. It may be necessary to adjust the classification tags based on your organization's data, security, and compliance needs.

After identification runs, you will receive a notification that the job is complete. Then, you can view the results from the data source dictionary.

Navigate to the data source overview page of the data source you added to the framework.
Click the Data Dictionary tab.
Assess whether the tags are applied as expected.
If you are happy with the tags, add the rest of your data sources to the domain and run identification on the rest of your data sources.
If you want additional tags, follow the Create an identifier guide to create identifiers that matter to your data.

Assess your data source tags

Requirement: Immuta permission GOVERNANCE or data owner

Target some data sources to manually review tags:

Navigate to the data dictionary for the data source by opening the Data Sources page and selecting a data source. Click the Data Dictionary tab to open the data dictionary.
The data dictionary lists the data source columns, with details about the name, data type, and a list of the tags on each column. Assess whether the tags are accurate to your data.

If you find that too many tags are applied

Tags may be unexpected but still accurate to your data. Additionally, they may have been applied because they were found to be the best match from the identifiers in the framework.

If you want to improve identification and personalize it to your data, assess why the tag was applied to your data:

Is the identifier incorrectly matching your data and irrelevant to your organization? Delete the identifier that applied the tag from the domain.
Is the identifier incorrectly matching this specific column, but correct in other places? It must have been the most correct match found by identification. Create a better match by completing the following steps:
1. Create an identifier specific to the column with a new tag.
2. Add the identifier to the domain so this column is correctly matched by identification.

If you want to remove the unexpected tags, use one of the following how-to guides:

Remove unnecessary identifiers from the domain.
Ensure the tags are applied properly by adjusting identification.
Remove any excess tags. Note that classification tags build off of other tags, so removing a single classification or identification applied tag can have trickle-down effects on the data source.
Adjust the classification framework rules using the frameworks API.

If you find that tags are missing

If you were expecting some sensitive data to be tagged and it is not, enable additional tags using one of the following how-to guides:

Create classification frameworks relevant to your organization.
Ensure the tags are applied properly by adjusting identification.
Add additional tags. Note that classification tags build off of other tags, so adding a single tag can have trickle-down effects on the data source.
Adjust the classification framework rules using the frameworks API.

Tune your data dictionaries

Requirement: Immuta permissions GOVERNANCE and AUDIT

Tags can be edited on an individual basis for each data source. If broad changes to the classification framework are necessary to re-tag your data, use the frameworks API.

Navigate to the Data Sources page and select the data sources that you assessed and noted issues.
Click the Data Dictionary tab.
Delete unnecessary tags by clicking on the tag you want to remove from the column, and select Disable from the tag side sheet.
To add tags,
1. Click Add Tags in the Actions column.
2. Begin typing the name of the tag you want to add in the Search by Name field and select the tag from the dropdown list.
3. Click Add.

Last updated 5 months ago

Was this helpful?