> For the complete documentation index, see [llms.txt](https://documentation.immuta.com/2024.2/llms.txt). Markdown versions of documentation pages are available by appending `.md` to page URLs; this page is available as [Markdown](https://documentation.immuta.com/2024.2/data-and-integrations/databricks-spark/how-to-guides/project-udfs.md).

# Configure Project UDFs Cache Settings

This page outlines the configuration for setting up project UDFs, which allow users to set their current project in Immuta through Spark. For details about the specific functions available and how to use them, see the [Use Project UDFs (Databricks) page](/2024.2/secure-your-data/projects-and-purpose-based-access-control/writing-to-projects/reference-guides/project-udfs.md).

{% hint style="info" %}
**Use project UDFs in Databricks**

Currently, caches are not all invalidated outside of Databricks because Immuta caches information pertaining to a user's current project in the NameNode plugin and in Vulcan. Consequently, this feature should only be used in Databricks.
{% endhint %}

## Web Service and On-Cluster Caches

Immuta caches a mapping of user accounts and users' current projects in the Immuta Web Service and on-cluster. When users change their project with UDFs instead of the Immuta UI, Immuta invalidates all the caches on-cluster (so that everything changes immediately) and the cluster submits a request to change the project context to a web worker. Immediately after that request, another call is made to a web worker to refresh the current project.

To allow use of project UDFs in Spark jobs, raise the caching on-cluster and lower the cache timeouts for the Immuta Web Service. Otherwise, caching could cause dissonance among the requests and calls to multiple web workers when users try to change their project contexts.

## Recommended Configuration

### 1 - Lower Web Service Cache Timeout

1. Click the **App Settings** icon in the left sidebar and scroll to the **HDFS Cache Settings** section.
2. Lower the **Cache TTL of HDFS user names (ms)** to **0**.
3. Click **Save**.

### 2 - Raise Cache Timeout On-Cluster

In the Spark environment variables section, set the `IMMUTA_CURRENT_PROJECT_CACHE_TIMEOUT_SECONDS` and `IMMUTA_PROJECT_CACHE_TIMEOUT_SECONDS` to high values (like `10000`).

*Note: These caches will be invalidated on cluster when a user calls `immuta.set_current_project`, so they can effectively be cached permanently on cluster to avoid periodically reaching out to the web service.*

## Blocking UDFs

If your compliance requirements restrict users from changing projects within a session, you can block the use of Immuta's project UDFs on a Databricks Spark cluster. To do so, configure the `immuta.spark.databricks.disabled.udfs` option as described on the [Databricks environment variables page](/2024.2/data-and-integrations/databricks-spark/reference-guides/configuration-settings/configuration.md).


---

# Agent Instructions
This documentation is published with GitBook. GitBook is the documentation platform designed so that both humans and AI agents can read, navigate, and reason over technical content effectively. Learn more at gitbook.com.

## Querying This Documentation
If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter, and the optional `goal` query parameter:

```
GET https://documentation.immuta.com/2024.2/data-and-integrations/databricks-spark/how-to-guides/project-udfs.md?ask=<question>&goal=<endgoal>
```

`ask` is the immediate question: it should be specific, self-contained, and written in natural language.
`goal` is optional and describes the broader end goal you are ultimately trying to accomplish on behalf of the user. GitBook uses it to tailor the answer towards what is most useful for that goal.

The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.