If you want to disable the metadata collection that requires sampling data, you must
These steps will ensure that Immuta queries no data, under any circumstances. Without this sample data, some Immuta features will be unavailable. Sensitive data discovery (SDD) cannot be used to automatically detect sensitive data in your data sources, and the following masking policies will not work:
Masking with format preserving masking
Masking with k-anonymization
Masking using randomized response
To stop Immuta from running fingerprints on all data sources,
Navigate to the App Settings page, and scroll to the Advanced Configuration section.
Enter the following YAML:
Click Save.
To stop Immuta from running data source health checks on all data sources,
Navigate to the App Settings page, and scroll to the Advanced Configuration section.
Enter the following YAML:
Click Save.
Tag each data source with the seeded Skip Stats Job
tag to stop Immuta from collecting a sample and running table stats on the sample. You can tag data sources as you create them in the UI or via the Immuta API.
Note that data sources automatically skip the stats job upon registration, without the Skip Stats Job
tag, as long as there are no active policies requiring them. The following policies require stats:
Column masking with randomized response
Column masking with format preserving masking
Column masking with k-anonymization
Column masking with rounding
Column masking with reversibility
Row minimization