In previous documentation, identifier is referred to as classifier. The language is being updated to identifier to be more accurate and not conflate meaning with the Immuta data classification and frameworks feature.
Create a template
Generate your API key on the API Keys tab on your profile page and save the API key somewhere secure. You will include this API key in the authorization header when you make a request to the Immuta API.
Find identifiers to include in your template using one of these methods:
Immuta CLI
immuta api sdd/classifier?sortField=name&sortOrder=asc&limit=25&searchText=IDENTIFIER
{
"name": "EMPLOYEE_DESK_LOCATION_TEMPLATE",
"displayName": "Employee Desk Location Template",
"description": "This template contains the identifier that detects when the name of the room an employee's desk is in appears in a dataset.",
"classifiers": [
{
"name": "EMPLOYEE_DESK_LOCATION_IDENTIFIER"
}
],
"sampleSize": 100
}
{
"name": "SOCIAL_SECURITY_NUMBERS_TEMPLATE",
"displayName": "Social Security Numbers Template",
"description": "This template contains the identifier that matches social security number column names with the defined regex.",
"classifiers": [
{
"name": "SOCIAL_SECURITY_NUMBER_COLUMNS_IDENTIFIER"
}
],
"sampleSize": 100
}
{
"name": "STUDENT_LOCATION_TEMPLATE",
"displayName": "Student Location Template",
"description": "This template contains the identifier that detects when a student's residence hall, floor, or room appears in a dataset.",
"classifiers": [
{
"name": "STUDENT_LOCATION_IDENTIFIER"
}
],
"sampleSize": 100
}
Create the template:
Immuta CLI
immuta api sdd/template -X POST --input ./example-payload.json
If the request is successful, you will receive a response that contains details about the template. Use the tabs below to see different responses for different templates.
After the template is applied to data sources and sensitive data discovery is run, the Discovered.account-number tag will be applied to columns that Immuta identifies with 50% confidence, as configured in the identifier.
{
"name": "EMPLOYEE_DESK_LOCATION_TEMPLATE",
"displayName": "Employee Desk Location Template",
"description": "This template contains the identifier that detects when the name of the room an employee's desk is in appears in a dataset.",
"sampleSize": 100,
"createdBy": {
"id": 1,
"name": "John",
"email": "john@example.com"
},
"id": 1,
"createdAt": "2021-10-21T18:03:58.967Z",
"updatedAt": "2021-10-21T18:03:58.967Z",
"classifiers": [{
"name": "EMPLOYEE_DESK_LOCATION_IDENTIFIER",
"overrides": {}
}]
}
After the template is applied to data sources and sensitive data discovery is run, the Discovered.desk-location tag will be applied to columns when Immuta detects the values Research Lab, Blue Room or Purple Room with 60% confidence, as configured in the identifier.
{
"name": "SOCIAL_SECURITY_NUMBERS_TEMPLATE",
"displayName": "Social Security Numbers Template",
"description": "This template contains the identifier that matches social security number column names with the defined regex.",
"sampleSize": 100,
"createdBy": {
"id": 1,
"name": "John",
"email": "john@example.com"
},
"id": 2,
"createdAt": "2021-10-21T19:12:22.092Z",
"updatedAt": "2021-10-21T19:12:22.092Z",
"classifiers": [
{
"name": "SOCIAL_SECURITY_NUMBER_COLUMNS_IDENTIFIER",
"overrides": {}
}
]
}
After the template is applied to data sources and sensitive data discovery is run, the Discovered.social-security-number tag will be applied to columns that have a name that match the ssn|social ?security regex, such as ssn, socialsecurity, or social security.
{
"name": "STUDENT_LOCATION_TEMPLATE",
"displayName": "Student Location Template",
"description": "This template contains the identifier that detects when a student's residence hall, floor, or room appears in a dataset.",
"sampleSize": 100,
"createdBy": {
"id": 1,
"name": "John",
"email": "john@example.com"
},
"id": 1,
"createdAt": "2021-10-21T18:03:58.967Z",
"updatedAt": "2021-10-21T18:03:58.967Z",
"classifiers": [{
"name": "STUDENT_LOCATION_IDENTIFIER",
"overrides": {}
}]
}
After the template is applied to data sources and sensitive data discovery is run, the Discovered.residence-hall tag will be applied to columns when Immuta detects values that match those listed in the Residence Halls data source with 70% confidence, as configured in the identifier.
Apply a template to data sources
Attributes of all custom identifiers and templates are provided on the Sensitive data discovery API page. However, attributes specific to this section are outlined in the table below.
Attribute
Description
template
string The name of the template to apply to the data sources; null clears the current template.
sources
string The name of the data sources to apply the template to.
You can now modify the template, such as changing the identifiers (classifiers) included and the sampleSize.
Configure entity tags and confidence
To disable entity tags from being set, you can create a template to that configures the identifier that contains that tag.
For example, the built-in PERSON_NAME identifier contains the following tags: Discovered.PHI, Discovered.PII, Discovered.Entity.Person Name, and Discovered.Identifier Indirect. However, your organization doesn't have any health data, so you don't want the PHI tag to be applied to your data sources but you do want all the other tags within that identifier.
To override the Discovered.PHI tag, you would create a template that includes the PERSON_NAME identifier and removes the Discovered.PHI from the list of tags in the template payload.
View the details about the PERSON_NAME identifier so you know what to include in your template using one of these methods:
Immuta CLI
immuta api sdd/classifier?sortField=name&sortOrder=asc&limit=25&searchText=PERSON_NAME