> For the complete documentation index, see [llms.txt](https://documentation.immuta.com/latest/llms.txt). Markdown versions of documentation pages are available by appending `.md` to page URLs; this page is available as [Markdown](https://documentation.immuta.com/latest/configuration/integrations/snowflake/reference-guides/warehouse-sizing-recommendations.md).

# Warehouse Sizing Recommendations

The warehouse you select when configuring the Snowflake integration uses compute resources to set up the integration, register data sources, orchestrate policies, and run jobs like identification. Snowflake credit charges are based on the size of and amount of time the warehouse is active, not the number of queries run.

This document prescribes how and when to adjust the size and scale of clusters for your warehouse to manage workloads so that you can use Snowflake compute resources the most cost effectively.

In general, increase the size of and number of clusters for the warehouse to handle heavy workloads and multiple queries. Workloads are typically lighter after data sources are onboarded and policies are established in Immuta, so compute resources can be reduced after those workloads complete.

## Integration and data source registration warehouse use

The Snowflake integration uses warehouse compute resources to sync policies created in Immuta to the Snowflake objects registered as data sources and, if enabled, to run [identification](/latest/configuration/manage-data-metadata/data-discovery.md) and [schema monitoring](/latest/configuration/integrations/registering-metadata/data-sources/schema-monitoring/reference-guides/schema-monitoring.md). Follow the guidelines below to adjust the warehouse size and scale according to your needs.

* Increase the [size](https://docs.snowflake.com/en/user-guide/warehouses-tasks#resizing-a-warehouse) of and [number](https://docs.snowflake.com/en/user-guide/warehouses-multicluster#increasing-or-decreasing-clusters-for-a-multi-cluster-warehouse) of clusters for the warehouse during large policy syncs, updates, and changes.
* Enable [auto-suspend and auto-resume](https://docs.snowflake.com/en/user-guide/warehouses-overview#auto-suspension-and-auto-resumption) to optimize resource use in Snowflake. In the Snowflake UI, the lowest auto suspend time setting is 5 minutes. However, through SQL query, you can set `auto_suspend` to 61 seconds (since the minimum uptime for a warehouse is 60 seconds). For example,

  ```sql
  ALTER WAREHOUSE "WH_NAME" SET WAREHOUSE_SIZE = 'XSMALL' AUTO_SUSPEND = 61 AUTO_RESUME = TRUE MIN_CLUSTER_COUNT = 1 MAX_CLUSTER_COUNT = 2 SCALING_POLICY = 'STANDARD' COMMENT = '';
  ```
* Identification uses compute resources for each table it runs on. Consider [turning off autoscanning for your domains with identifiers and dynamic assignment](/latest/configuration/manage-data-metadata/data-discovery/how-to-guides/manage-sdd-tags.md#configure-a-domains-autoscanning-setting) when registering data sources if you have an [external catalog available](/latest/configuration/manage-data-metadata/catalogs/reference-guides/pre-configuration.md) or a tagging strategy in place.
* Register data before creating global policies. Immuta does not apply a subscription policy on registered data unless an existing global policy applies to it, which allows Immuta to only pull metadata instead of also applying policies when data sources are created. Registering data before policies are created reduces the workload and the Snowflake compute resources needed.
* Begin onboarding with a small dataset of tables, and then review and monitor query performance in the [Snowflake Query Monitor](https://docs.snowflake.com/en/user-guide/ui-snowsight-activity). Adjust the virtual warehouse accordingly to handle heavier loads.
* [Schema monitoring](/latest/configuration/integrations/registering-metadata/data-sources/schema-monitoring.md) uses the compute warehouse that was employed during the initial ingestion to periodically monitor the schema for changes. If you expect a low number of new tables or minimal changes to the table structure, consider scaling down the warehouse size.
* Resize the warehouse after data sources are registered and policies are established. For example,

{% code overflow="wrap" %}

```sql
ALTER WAREHOUSE "INTEGRATION_WH" SET WAREHOUSE_SIZE = 'XSMALL' AUTO_SUSPEND = 120 AUTO_RESUME = TRUE MIN_CLUSTER_COUNT = 1 MAX_CLUSTER_COUNT = 2 SCALING_POLICY = 'STANDARD'; 
```

{% endcode %}

For more details and guidance about warehouse sizing, see the [Snowflake Warehouse Considerations documentation](https://docs.snowflake.com/en/user-guide/warehouses-considerations).

## Identifying bulk jobs and heavy workloads

Even after your integration is configured, data sources are registered, and policies are established, changes to those data sources or policies may initiate heavy workloads. Follow the guidelines below to adjust your warehouse size and scale according to your needs.

* Review your [Snowflake query history](https://docs.snowflake.com/en/user-guide/ui-snowsight-activity) to identify query performance and bottlenecks.
* Check how many credits queries have consumed:

  ```sql
  SELECT h.* FROM "SNOWFLAKE"."ACCOUNT_USAGE"."QUERY_HISTORY" h
  INNER JOIN "SNOWFLAKE"."ACCOUNT_USAGE"."SESSIONS" s
  ON s.session_id = h.session_id
  WHERE GET(parse_json(s.client_environment), 'APPLICATION') = 'IMMUTA' limit 25;
  ```
* After reviewing query performance and cost, implement [strategies above](#integration-and-data-source-registration-warehouse-use) to adjust your warehouse.


---

# Agent Instructions
This documentation is published with GitBook. GitBook is the documentation platform designed so that both humans and AI agents can read, navigate, and reason over technical content effectively. Learn more at gitbook.com.

## Querying This Documentation
If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://documentation.immuta.com/latest/configuration/integrations/snowflake/reference-guides/warehouse-sizing-recommendations.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.