> For the complete documentation index, see [llms.txt](https://documentation.immuta.com/2025.1/llms.txt). Markdown versions of documentation pages are available by appending `.md` to page URLs; this page is available as [Markdown](https://documentation.immuta.com/2025.1/configuration/integrations/snowflake/reference-guides/overview.md).

# Snowflake Lineage Tag Propagation

{% hint style="info" %}
**Private preview**: This feature is available to select accounts. Contact your Immuta representative to enable this feature.
{% endhint %}

Snowflake column lineage specifies how data flows from source tables or columns to the target tables in write operations. When Snowflake lineage tag propagation is enabled in Immuta, Immuta automatically applies tags added to a Snowflake table to its descendant data source columns in Immuta so you can build policies using those tags to restrict access to sensitive data.

Snowflake Access History tracks user read and write operations. Snowflake column lineage extends this Access History to specify how data flows from source columns to the target columns in write operations, allowing data stewards to understand how sensitive data moves from ancestor tables to target tables so that they can

* trace data back to its source to validate the integrity of dashboards and reports,
* identify who performed write operations to meet compliance requirements,
* evaluate data quality and pinpoint points of failure, and
* tag sensitive data on source tables without having tag columns on their descendant tables.

However, tagging sensitive data doesn’t innately protect that data in Snowflake; users need Immuta to disseminate these lineage tags automatically to descendant tables registered in Immuta so data stewards can build policies using the semantic and business context captured by those tags to restrict access to sensitive data. When Snowflake lineage tag propagation is enabled, Immuta propagates tags applied to a data source to its descendant data source columns in Immuta, which keeps your data inventory in Immuta up-to-date and allows you to protect your data with policies without having to manually tag every new Snowflake data source you register in Immuta.

## Data flow

1. An application administrator enables the feature on the Immuta app settings page.
2. Snowflake lineage metadata (column names and tags) for the Snowflake tables is stored in the metadata database.
3. A data owner creates a new data source (or adds a new column to a Snowflake table) that initiates a job that applies all tags for each column from its ancestor columns.
4. A data owner or governor adds a tag to a column in Immuta that has descendants, which initiates a job that propagates the tag to all descendants.
5. An audit record is created that includes which tags were applied and from which columns those tags originated.

## Snowflake access history view and Immuta lineage job

The Snowflake Account Usage `ACCESS_HISTORY` view contains column lineage information.

To appropriately propagate tags to descendant data sources, Immuta fetches Access History metadata to determine what column tags have been updated, stores this metadata in the Immuta metadata database, and then applies those tags to relevant descendant columns of tables registered in Immuta.

Consider the following example using the Customer, Customer 2, and Customer 3 tables that were all registered in Immuta as data sources.

* Customer: source table
* Customer 2: descendant of Customer
* Customer 3: descendant of Customer 2

If the `Discovered.Electronic Mail Address` tag is added to the Customer data source in Immuta, that tag will propagate through lineage to the Customer 2 and Customer 3 data sources.

## Data source registration

After an application administrator has enabled Snowflake lineage tag propagation, data owners can register data in Immuta and have tags in Snowflake propagated from ancestor tables to descendant data sources. Whenever new tags are added to those tables in Immuta, those upstream tags will propagate to descendant data sources.

By default, all tags are propagated, but these tags can be filtered on the app settings page or using the Immuta API.

## Managing tags

Lineage tag propagation works with any tag added to the data dictionary. Tags can be manually added, synced from an external catalog, or discovered by identification.

Consider the following example using the tables that were all registered in Immuta as data sources:

| Data source    | Parent table       | Tag applied                          | Application type           |
| -------------- | ------------------ | ------------------------------------ | -------------------------- |
| **Customer**   | None, source table | `Discovered.Electronic Mail Address` | Manually applied           |
| **Customer 2** | **Customer**       | `Discovered.Electronic Mail Address` | Propagated through lineage |
| **Customer 3** | **Customer 2**     | `Discovered.Electronic Mail Address` | Propagated through lineage |

Immuta added the `Discovered.Electronic Mail Address` tag to the **Customer** data source, and that tag propagated through lineage to the **Customer 2** and **Customer 3** data sources.

### Deleting tags

When a tag is deleted, downstream lineage tags are removed, unless another parent data source still has that tag. The tag remains visible, but it will not be re-added if a future propagation event specifies the same tag again. *Immuta prevents you from removing Snowflake object tags from data sources. You can only remove Immuta-managed tags.* *To remove Snowflake object tags from tables, you must remove them in Snowflake.*

Removing the `Discovered.Electronic Mail Address` tag from the **Customer 2** table soft deletes it from the **Customer 2** data source. However the `Discovered.Electronic Mail Address` tag still applies to the **Customer 3** data source because **Customer** still has the tag applied.

| Data source    | Parent table       | Tag applied                          | Application type           |
| -------------- | ------------------ | ------------------------------------ | -------------------------- |
| **Customer**   | None, source table | `Discovered.Electronic Mail Address` | Manually applied           |
| **Customer 2** | **Customer**       | None, manually removed               | Manually removed           |
| **Customer 3** | **Customer 2**     | `Discovered.Electronic Mail Address` | Propagated through lineage |

The only way a tag will be removed from descendant data sources is if no other ancestor of the descendant still prescribes the tag.

If the Snowflake lineage tag propagation feature is disabled, tags will remain on Immuta data sources.

## Identification

[Identification](/2025.1/configuration/manage-data-metadata/data-discovery.md) will still run on data sources and can be manually triggered. Tags applied through identification will propagate as tags added through lineage to descendant Immuta data sources.

## Snowflake lineage audit

Immuta audit records include Snowflake lineage tag events when a tag is added or removed.

The example audit record below illustrates the `SNOWFLAKE_TAGS.pii` tag successfully propagating from the Customer table to Customer 2:

```json
{
  "id": "c8e020cb-232c-4ba9-a0d8-f3a84ba6808d",
  "dateTime": "1670355170336",
  "month": 1475,
  "profileId": 1,
  "userId": "immuta_system_account",
  "dataSourceId": 2,
  "dataSourceName": "Customer 2",
  "count": 1,
  "recordType": "nativeLineageDataSourceTagUpdate",
  "success": true,
  "component": "dataSource",
  "extra": {
    "sourceColumn": {
      "nativeColumnName": "\"MY_DATABASE\".\"PUBLIC\".\"CUSTOMER\".\"C_FIRST_NAME\"",
      "dataSourceId": 1,
      "columnName": "c_first_name"
    },
    "dataSourceId": 2,
    "columnName": "c_first_name",
    "tagPropagationDirection": "downstream",
    "tags": [
      {
        "name": "SNOWFLAKE_TAGS.pii",
        "source": "immuta-us-east-1"
      }
    ]
  },
  "newAuditServiceFields": {
    "actorIp": null,
    "sessionId": null
  },
  "createdAt": "2022-12-06T19:32:50.372Z",
  "updatedAt": "2022-12-06T19:32:50.372Z"
}
```

## Limitations

* Without `tableFilter` set, Immuta will ingest lineage for every table on the Snowflake instance.
* Tag propagation based on lineage is not retroactive. For example, if you add a table, add tags to that table, and then run the lineage ingestion job, tags will not get propagated. However, if you add a table, run the lineage ingestion job, and then add tags to the table, the tags will get propagated.
* The lineage job needs to pull in lineage data before any tag is applied in Immuta. When Immuta gets new lineage information from Snowflake, Immuta does not update existing tags in Immuta.
* There can be up to a 3-hour delay in Snowflake for a lineage event to make it into the `ACCESS_HISTORY` view.
* Immuta does not ingest lineage information for views.
* Snowflake only captures lineage events for `CTAS`, `CLONE`, `MERGE`, and `INSERT` write operations. Snowflake does not capture lineage events for `DROP`, `RENAME`, `ADD`, or `SWAP`. Instead of using these latter operations, you need to recreate a table with the same name if you need to make changes.
* Immuta cannot enforce coherence of your Snowflake lineage. If a column, table, or schema in the middle of the lineage graph gets dropped, Immuta will not do anything unless a table with that same name gets recreated. This means a table that gets dropped but not recreated could live in Immuta’s system indefinitely.


---

# Agent Instructions
This documentation is published with GitBook. GitBook is the documentation platform designed so that both humans and AI agents can read, navigate, and reason over technical content effectively. Learn more at gitbook.com.

## Querying This Documentation
If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter, and the optional `goal` query parameter:

```
GET https://documentation.immuta.com/2025.1/configuration/integrations/snowflake/reference-guides/overview.md?ask=<question>&goal=<endgoal>
```

`ask` is the immediate question: it should be specific, self-contained, and written in natural language.
`goal` is optional and describes the broader end goal you are ultimately trying to accomplish on behalf of the user. GitBook uses it to tailor the answer towards what is most useful for that goal.

The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.