1 of 7

Automate Data Access Control Decisions

This section focuses on how to use Immuta to automate decisions that determine whether users should have access to data objects. The image below illustrates where tags, groups and attributes, data identification and classification, and policy sit within your data ecosystem and how they interact to automate access controls.

You will learn about each of these features, how they interact to automate and enforce access controls on your data, and how to implement them to meet your business objectives.

Who is this for?

This guide is intended for users who want to build table access control policies in a scalable manner using Immuta.

Prerequisites

and registered in Immuta

Goals

This use case is the most common across Immuta users. With it, you solve the problem of entitlement of users to data based on metadata about the user (attributes and groups) and metadata about the data (tags).

This use case is unique to Immuta, because rather than coupling an access decision with a role, you are able to instead decouple access decisions from user and data metadata. This is powerful because it means when metadata about the users or metadata about the data changes, the access for a user or set of users may also change - it is a dynamic decision.

Decoupling access decisions from metadata also eliminates the classic problem of role explosion. In a world where policy decisions are coupled to a role, you must manage a new role for every permutation of access, causing an explosion of roles to manage. Instead, with this use case you decouple that logic from the user metadata and data metadata, so real-time, dynamic decisions are possible. Immuta’s method of building policies allows for these many permutations in a clear, concise manner through the decoupling of policy logic from the user and data metadata.

This use case also eliminates the need to have a human in-the-loop for approval to data access. If you can describe clearly why the approver would approve an access request - that can instead be expressed as an Immuta subscription policy, as you’ll see below. Removing humans from this process increases the speed to access data, makes access decisions consistent, and removes error and favoritism.

Want to learn more? Check out this on the methodology.

Table vs column access

Lastly, this use case is primarily focused on automating table grants, termed in Immuta. We recommend you also read the to learn about column masking, which will allow you to enforce more granular controls so that you can open more data. It's common to see users mix the automate data access control decisions use case with the compliantly open more sensitive data for ML and analytics use case, so it's recommended to read both.

Business value

Following this use case will reap huge operational cost savings. You will have to manage far fewer data policies; those policies will be more granular, accurate, and easily authored; and you will be able to prove compliance more easily. Furthermore, instead of an explosion of incomprehensible roles, the users and data will have meaningful metadata matched with clear policies.

Quantified benefits:

75x fewer policy changes required to accomplish the same use cases
70% reduction in dedicated resources to data access management and policy administration
Onboarding new employees two weeks faster and provisioning new data one week faster

Unquantified benefits:

Improved compliance standard and security posture
Enhanced employee satisfaction
Better user experiences

More details on the business value can be found in these reports:

GIGAOM: ABAC vs RBAC: (Immuta is an OT-ABAC approach)
Forrester:

Next steps

Managing User Metadata

This guide describes how to organize and manage user metadata, which is used by Immuta to identify users targeted by policy.

No matter if you choose orchestrated RBAC or ABAC as described in the , you must have metadata on your users. This can be done , via your , or other .

Doing so allows you to build scalable policies that do not reference individual users and instead use metadata about them to target them with access decisions. In Immuta, user metadata is termed user attributes and groups.

Manage User Metadata How-to Guide

Before authoring global subscription policies to automate access controls, user metadata must exist in Immuta so that it can be used in the policy to identify the users that should be granted or revoked access to data.

This how-to guide demonstrates how to manually add groups and attributes or use existing groups in external identity managers to identify users that should be targeted by a subscription policy.

For detailed explanations and examples of how to manage user metadata, see the .

Managing Data Metadata

This guide describes how to organize and manage data metadata, which is used by Immuta to identify data targeted by policy.

Considerations

Now that we’ve enriched facts about our users, let’s focus on the second point on the policy triangle: the data tags.

Just like you need user metadata, you need metadata on your data (tags) in order to decouple policy logic from referencing physical tables or columns. You must choose between the or method of data access:

Orchestrated RBAC method: tag data sources at the table level
ABAC method: tag data at the table and column level

While it is possible to target policies using both table- and column-level tags, for ABAC it’s more common to target column tags because they represent more granularly what is in the table. Just like user metadata needs to be facts about your users, the data metadata must be facts about the data. The tags on your tables should not contain any policy logic.

Fact-based column tags are descriptive (recommended):

Column ssn has column tag social security number
Column f_name has column tag name

Logic-based column tags requires subjective decisions (not recommended):

Column ssn has column tag PII
Column f_name has column tag sensitive

But can't I get policy authoring scalability by tagging things with higher level classifications, like PII, so I can build broader policies? This is what Immuta’s are for.

Entity tags are facts about the contents of individual columns in isolation. Entity tags are what we listed above: social security number, name, date, and data of birth. Entity tags do not attempt to contextualize column contents with neighboring columns' contents. Instead, categorization and classification tags describe the sensitive contents of a table with the context of all its columns, which is what is listed in the logic-based tags above, things like PII, sensitive, and indirect identifier.

For example, under the HIPAA framework a list of procedures a doctor performed is only considered protected health information (PHI) if it can be associated with the identity of patients. Since entity tagging operates on a single column-by-column basis, it can’t reason whether or not a column containing procedure codes merits classification as PHI. Therefore, entity tagging will not tag procedure codes as PHI. But categorization tagging will tag it PHI if it detects patient identity information in the other columns of the table.

Additionally, entity tagging does not indicate how sensitive the data is, but categorization tags carry a sensitivity level, the classification tag. For example, an entity tag may identify a column that contains telephone numbers, but the entity tag alone cannot say that the column is sensitive. A phone number associated with a person may be classified as sensitive, while the publicly-listed phone number of a company might not be considered sensitive.

Contextual tags are really what you should target with policy where possible. This provides a way to create higher level objects for more scalable and generic policy. Rather than building a policy like “allow access to tables with columns tagged person name and phone number,” it would be much easier to build it like “allow access to tables with columns tagged PII.”

In short, you must tag your entities, and then rely on a classification framework (provided by Immuta or customized by you) to provide the higher level context, also as tags. Remember, the owners of the tables (those who created them) can tag the data with facts about what is in the columns without having to understand the higher level implications of those tags (categorization and classification). This allows better separation of duty.

For orchestrated-RBAC, the data tags are no longer facts about your data, they are instead a single variable that determines access. As such, they should be table-level tags (which also improves the amount of processing Immuta must do).

Applying data tags

There are several options for applying data tags:

Identification: This is the most powerful option. Immuta is able to , and you are able to extend what types of entities are discovered to those specific to your business. Identification can run completely within your data platform, with no data leaving at all for Immuta to analyze. Identification is more relevant for the ABAC approach because the tags are facts about the data.
Tags from an external source: You may have already done all the work tagging your data in some external catalog or your own homegrown tool. If so, Immuta can pull those tags in and use them. See the for a list of the supported external catalogs. But remember, just like user metadata, these should represent facts about your data and not policy decisions.

Data tag hierarchy

Just like hierarchy has an impact with user metadata, so can data tag hierarchy. We discussed the matching of user metadata to data metadata in the guide. However, there are even simpler approaches that can leverage data tag hierarchy beyond matching. This will be covered in more detail in the guide, but is important to understand as you think through data tagging.

As a quick example, it is possible to tag your data with Cars and then also tag that same data with more specific tags (in the hierarchy) such as Cars.Nissan.Xterra. Then, when you build policies, you could allow access to tables tagged Cars to administrators, but only those tagged Cars.Nissan.Xterra to suv_inspectors. This will result in two separate policies landing on the same table, and the beauty of Immuta is that it will handle the conflict of those two separate policies. This provides a large amount of scalability because you have to manage far fewer policies.

Imagine if you didn’t have this capability? You would have to include administrators access to every policy you created for the different vehicle makes - and if that policy needed to evolve, such as adding more than administrators to all cars, it would be an enormous effort to make that change. With Immuta, it’s one policy change.

Next steps

Manage Data Metadata How-to Guide

Before authoring global subscription policies to automate access controls, data metadata must exist in Immuta so that it can be used in the policy to identify the data that should be governed.

This how-to guide demonstrates how to manually manage tags, use data identification, or use existing tags in external catalogs to identify data that should be governed by a subscription policy.

For detailed explanations and examples of how to manage data metadata, see the .

Author Policy

Once user and data metadata have been added in Immuta, you can use that metadata to create subscription policies that automate granting or revoking access to users. This guide describes how to author policies for orchestrated RBAC and ABAC models.

Now we are ready to focus on the third point of the triangle for data access decisions: access control policies, and, more specifically, how you can author them with Immuta now that you have the foundation with your data and user metadata in place.

It’s important to understand that you only need a starting point to begin onboarding data access control use cases - you do not have to have every user perfectly associated with metadata for all use cases, nor do you need all data tagged perfectly for all use cases. Start small on a focused access control use case, and grow from there.

The policy authoring approaches are broken into the you should choose between discussed previously.

Author Policy How-to Guide

Authoring global subscription policies to automate access controls involves using the data metadata and user metadata in Immuta to identify the data that should be governed and the users the policy should target.

This how-to guide demonstrates how to author a global subscription policy in Immuta to automat access decisions.

For detailed explanations and examples of how to author subscription policies, see the Author policy guide.

Requirements

Immuta permission: GOVERNANCE global permission, Manage Policies domain permission, or own the data source

Prerequisites

Understand your metadata

How you author policies is dictated by how your user and data metadata is organized to grant access:

: Many variables determine access, and data sources are tagged at the column and table level.
: A

Author a subscription policy

ABAC policy authoring

Determine why someone should be given access to data. For example, let’s say that to have access to Strictly Confidential, you have determined that someone should be
- an employee (not contractor)

Orchestrated RBAC policy authoring

Determine how user metadata and data metadata is organized. What variable determines access?
to target tables. Since orchestrated RBAC is all about one-to-one matching of user metadata to data metadata, use the special functions in the subscription policy builder for managing this:

Next steps

Managing Data Metadata

This guide describes how to organize and manage data metadata, which is used by Immuta to identify data targeted by policy.

Considerations

Now that we’ve enriched facts about our users, let’s focus on the second point on the policy triangle: the data tags.

Orchestrated RBAC method: tag data sources at the table level
ABAC method: tag data at the table and column level

Fact-based column tags are descriptive (recommended):

Column ssn has column tag social security number
Column f_name has column tag name

Logic-based column tags requires subjective decisions (not recommended):

Column ssn has column tag PII
Column f_name has column tag sensitive

But can't I get policy authoring scalability by tagging things with higher level classifications, like PII, so I can build broader policies? This is what Immuta’s are for.

Applying data tags

There are several options for applying data tags:

Identification: This is the most powerful option. Immuta is able to , and you are able to extend what types of entities are discovered to those specific to your business. Identification can run completely within your data platform, with no data leaving at all for Immuta to analyze. Identification is more relevant for the ABAC approach because the tags are facts about the data.
Tags from an external source: You may have already done all the work tagging your data in some external catalog or your own homegrown tool. If so, Immuta can pull those tags in and use them. See the for a list of the supported external catalogs. But remember, just like user metadata, these should represent facts about your data and not policy decisions.

Data tag hierarchy

Next steps

Automate Data Access Control Decisions

You will learn about each of these features, how they interact to automate and enforce access controls on your data, and how to implement them to meet your business objectives.

Who is this for?

This guide is intended for users who want to build table access control policies in a scalable manner using Immuta.

Prerequisites

and registered in Immuta

Goals

Want to learn more? Check out this on the methodology.

Table vs column access

Business value

Quantified benefits:

75x fewer policy changes required to accomplish the same use cases
70% reduction in dedicated resources to data access management and policy administration
Onboarding new employees two weeks faster and provisioning new data one week faster

Unquantified benefits:

Improved compliance standard and security posture
Enhanced employee satisfaction
Better user experiences

More details on the business value can be found in these reports:

GIGAOM: ABAC vs RBAC: (Immuta is an OT-ABAC approach)
Forrester:

Automate Data Access Control Decisions

hashtagWho is this for?

hashtagPrerequisites

hashtagGoals

hashtagTable vs column access

hashtagBusiness value

hashtagNext steps

Managing User Metadata

hashtag

Manage User Metadata How-to Guide

hashtag

Managing Data Metadata

hashtagConsiderations

hashtagApplying data tags

hashtagData tag hierarchy

hashtagNext steps

Manage Data Metadata How-to Guide

hashtag

Author Policy

Author Policy How-to Guide

hashtagRequirements

hashtagPrerequisites

hashtagUnderstand your metadata

hashtagAuthor a subscription policy

hashtagNext steps

Managing Data Metadata

hashtagConsiderations

hashtagApplying data tags

hashtagData tag hierarchy

hashtagNext steps

Automate Data Access Control Decisions

hashtagWho is this for?

hashtagPrerequisites

hashtagGoals

hashtagTable vs column access

hashtagBusiness value

hashtagNext steps

Author Policy How-to Guide

hashtagRequirements

hashtagPrerequisites

hashtagUnderstand your metadata

hashtagAuthor a subscription policy

hashtagNext steps

Managing User Metadata

hashtag

Manage User Metadata How-to Guide

hashtag

hashtagOrchestrated RBAC: user attributes and groups

hashtagABAC: user attributes and groups

hashtagApplying user metadata

hashtagNext steps

hashtagPrerequisite

hashtagSelect your metadata strategy

hashtagOrganize your user metadata

hashtagAdd user metadata to Immuta

hashtagNext steps

Author Policy

Manage Data Metadata How-to Guide

hashtag

hashtagPath 2: ABAC policy authoring

hashtagPolicy #1: Employee Access

hashtagPolicy #2: Country Access

hashtagPolicy #3: Legal Team Access

hashtagManual overrides of grant subscription policies

hashtagNext steps

hashtagPrerequisites

hashtagSelect your strategy

hashtagOrganize your data metadata

hashtagEnable schema monitoring

hashtagApply tags to data in Immuta

hashtagNext steps

Who is this for?

Prerequisites

Goals

Table vs column access

Business value

Next steps

Considerations

Applying data tags

Data tag hierarchy

Next steps

Requirements

Prerequisites

Understand your metadata

Author a subscription policy

Next steps

Considerations

Applying data tags

Data tag hierarchy

Next steps

Who is this for?

Prerequisites

Goals

Table vs column access

Business value

Next steps

Requirements

Prerequisites

Understand your metadata

Author a subscription policy

Next steps

Orchestrated RBAC: user attributes and groups

ABAC: user attributes and groups

Applying user metadata

Next steps

Prerequisite

Select your metadata strategy

Organize your user metadata

Add user metadata to Immuta

Next steps

Path 2: ABAC policy authoring

Policy #1: Employee Access

Policy #2: Country Access

Policy #3: Legal Team Access

Manual overrides of grant subscription policies

Next steps

Prerequisites

Select your strategy

Organize your data metadata

Enable schema monitoring

Apply tags to data in Immuta

Next steps