Subscribe to and Manage Data Sources
Data Source API reference guide
This page describes the dataSource
endpoint, through which users can subscribe to data sources, make unmasking requests, and manage data source tasks. To create data sources, see the specific handler endpoints.
Additional fields may be included in some responses you receive; however, these attributes are for internal purposes and are therefore undocumented.
Data source workflow
Search data sources and data source details
GET
Get parent and child relationship records for derived data sources using a specified data source ID.
GET
Get all users with the provided access level for this data source.
GET
Retrieves the visibilities, masking information, and filters that the passed in user has access to in the specified data source.
GET
Retrieves a summary of total records, total visibilities, and visibilities a given user has access to.
GET
Retrieves a summary of total records, total visibilities, and visibilities the current user has access to for a specified data source.
Search for data sources
GET /dataSource
Search for data sources.
Query parameters
blobHandlerType
array[string]
Describes the type of underlying blob handler that will be used with this data source (e.g., Custom, MS SQL).
No
subscription
array[string]
The requesting user's subscription status: pending
, owner
, subscribed
, not_subscribed
, expert
, or ingest
.
No
status
array[string]
The data source status: passed
or failed
.
No
tag
array[string]
Filters data sources by tags associated with the data sources.
No
searchText
string
Searches for data source names using the provided string
.
No
column
array[string]
Searches for data source column names.
No
connectionString
array[string]
Searches by connection string.
No
schema
string
Searches for data source schema.
No
nameOnly
boolean
When true
, searchText
will only search data source names. Default is false
.
No
idOnly
boolean
When true
, only returns the ID Of the data source and the user's subscription status.
No
dataSourceIds
array[integer]
Searches for the provided data source IDs.
No
selectFields
array[string]
This field accepts the values id
, name
, and columnEvolutionEnabled
. When id
or name
are provided, the request will return only the ID or name of the data source and the subscription status. If columnEvolutionEnabled
is provided, the response will also include information about the policies, policy conflicts, and workspaces associated with the data sources.
No
offset
integer
Used in combination with size
to fetch pages. Default is 0
.
No
size
integer
The number of results to return per page. Default is 10
.
No
sortField
string
Used to sort results by field, which must be createdAt
, name
, blobHandlerType
, subscriptionStatus
, recordCount
, status
, policy
, or editable
.
No
sortOrder
string
Sorts results by order, which must be asc
or desc
.
No
excludedProjects
array[integer]
Filter out any data sources that belong to the specified projects.
No
ephemeral
boolean
When true
, returns ephemeral data sources.
No
clusterName
string
The name of the remote cluster the data source is connected to.
No
mode
integer
Specifies the query mode, which must be 0
(FULL
), 1
(COUNT
), 4
(TAG
), 5
(MIN_MAX
), or 6
(STATUS
).
No
globalPolicy
string
Filter by data sources that have this Global Policy applied.
No
hostname
string
Searches data sources by hostname.
No
determinePolicyContflicts
boolean
When true
, filters results to return the data sources with policy conflicts.
No
collectionId
string
Filter by data sources in the domain with this ID.
No
collectionExclude
string
When true
, filters results to return the data sources in the domain of the provided ID.
No
sddTemplateName
string
Filter by data sources that have the specified SDD identification framework applied.
No
excludeSddTemplateName
string
Filter by data sources that do not have the specified SDD identification framework applied.
No
Response schema
name
string
Data source name.
id
integer
Data source ID.
deleted
boolean
If true
the data source is a deleted data source.
description
string
The data source description.
createdAt
timestamp
The date and time the data source was created.
subscriptionPolicy
array
Details the type of Subscription Policy applied to the data source.
schemaEvolutionId
integer
The schema evolution ID.
recordCount
integer
The record count.
status
array[string]
Accepted statuses are passed
or failed
.
subscriptionStatus
array[string]
Accepted statuses are subscribed
or unsubscribed
.
blobHandlerType
array[string]
Describes the type of underlying blob handler of this data source (e.g., Custom, MS SQL).
subscriptionType
string
The type of subscription policy on the project. The type can be automatic
(which allows anyone to subscribe), approval
(which requires the subscriber to be manually approved), policy
(which only allows users with specific groups or attributes to subscribe), or manual
(which requires users to be manually added).
connectionString
string
The connection string information.
sqlSchemaName
string
The schema name.
policy
string
When this value is none
, there are no data policies applied to the data source. Otherwise, this field indicates whether or not there are policy conflicts among the data policies applied to the data source.
policyHandlerType
string
The policy handler type, such as None
or Builder
.
Request example
The following request returns 2 data sources.
Response example
Get a data source by ID
GET
/dataSource/{dataSourceId}
Get a data source based on the ID.
Query parameters
dataSourceId
integer
The data source ID .
Yes
Response Schema
name
string
The data source name.
recordFormat
string
The data format of blobs in the data source, such as json
, xml
, html
, or jpeg
.
description
string
The description of the data source.
policyHandler
array
The ID of the policy handler and details about the data policies enforced on the data source.
sqlSchemaName
string
A string that represents this data source's schema name in Immuta.
sqlTableName
string
The SQL table name in Immuta.
blobHandler
array[object]
A list of full URLs providing the locations of all blob store handlers to use with this data source.
blobHandlerType
string
Describes the type of underlying blob handler that will be used with this data source (e.g., MS SQL
).
createdBy
integer
The ID of the profile creating the data source.
deleted
boolean
If true, the data source was deleted.
type
string
The data source type, such as queryable
or ingested
.
rowCount
integer
The number of rows.
documentation
string
Documentation associated with the data source.
id
integer
The data source ID.
policyHandlerType
string
The type of policy handler applied to the data source: Builder
.
subscriptionType
string
The type of subscription policy on the data source. The type can be automatic
(which allows anyone to subscribe), approval
(which requires the subscriber to be manually approved), policy
(which only allows users with specific groups or attributes to subscribe), or manual
(which requires users to be manually added).
subscriptionPolicy
array
Details about the Subscription Policy applied to the data source.
globalPolicies
string
Details about the Global Policies applied to the data source.
status
string
The data source health status.
Request example
The following request gets a data source based on the ID 22
.
Response example
Get data source by name
GET
/dataSource/name/{dataSourceName}
Get a data source based on the name.
Query parameters
dataSourceName
string
The data source name.
Yes
Response schema
name
string
The data source name.
recordFormat
string
The data format of blobs in the data source, such as json
, xml
, html
, or jpeg
.
description
string
The description of the data source.
policyHandler
array
The ID of the policy handler and details about the data policies enforced on the data source.
sqlSchemaName
string
A string that represents this data source's schema name in Immuta.
sqlTableName
string
The SQL table name in Immuta.
blobHandler
array[object]
A list of full URLs providing the locations of all blob store handlers to use with this data source.
blobHandlerType
string
Describes the type of underlying blob handler that will be used with this data source (e.g., MS SQL
).
createdBy
integer
The ID of the profile creating the data source.
deleted
boolean
If true, the data source was deleted.
type
string
The data source type, such as queryable
or ingested
.
rowCount
integer
The number of rows.
documentation
string
Documentation associated with the data source.
id
integer
The data source ID.
policyHandlerType
string
The type of policy handler applied to the data source: Builder
.
subscriptionType
string
The type of subscription policy on the data source. The type can be automatic
(which allows anyone to subscribe), approval
(which requires the subscriber to be manually approved), policy
(which only allows users with specific groups or attributes to subscribe), or manual
(which requires users to be manually added).
subscriptionPolicy
array
Details about the Subscription Policy applied to the data source.
globalPolicies
string
Details about the Global Policies applied to the data source.
status
string
The data source health status.
Request example
The following request gets a data source based on the name Public Barfoo
.
Response example
Get a data source by the short name
GET
/dataSource/sqlTableName/{shortName}
Get a data source based on the SQL table name.
Query parameters
shortName
string
The data source SQL table name.
Yes
Response schema
name
string
The data source name.
recordFormat
string
The data format of blobs in the data source, such as json
, xml
, html
, or jpeg
.
description
string
The description of the data source.
policyHandler
array
The ID of the policy handler and details about the data policies enforced on the data source.
sqlSchemaName
string
A string that represents this data source's schema name in Immuta.
sqlTableName
string
The SQL table name in Immuta.
blobHandler
array[object]
A list of full URLs providing the locations of all blob store handlers to use with this data source.
blobHandlerType
string
Describes the type of underlying blob handler that will be used with this data source (e.g., MS SQL
).
createdBy
integer
The ID of the profile creating the data source.
deleted
boolean
If true, the data source was deleted.
type
string
The data source type, such as queryable
or ingested
.
rowCount
integer
The number of rows.
documentation
string
Documentation associated with the data source.
id
integer
The data source ID.
policyHandlerType
string
The type of policy handler applied to the data source: Builder
.
subscriptionType
string
The type of subscription policy on the data source. The type can be automatic
(which allows anyone to subscribe), approval
(which requires the subscriber to be manually approved), policy
(which only allows users with specific groups or attributes to subscribe), or manual
(which requires users to be manually added).
subscriptionPolicy
array
Details about the Subscription Policy applied to the data source.
globalPolicies
string
Details about the Global Policies applied to the data source.
status
string
The data source health status.
Request example
The following request gets a data source based on the SQL table name customer_data
.
Response example
Get data source relationships
GET
/dataSource/{dataSourceId}/lineage/{type}
Get parent and child relationship records for derived data sources using a specified data source ID.
Query parameters
type
string
The type of lineage records to return. Options include: parents
, children
, and all
.
Yes
dataSourceId
integer
The target data source ID.
Yes
Response schema
children
array
Details of the child data source, including dataSourceId
, dataSourceName
, projectId
, policyHandlerDiff
, deleted
, createdBy
, and createdAt
.
parents
array
Details of the parent data source, including dataSourceId
, dataSourceName
, projectId
, policyHandlerDiff
, deleted
, createdBy
, and createdAt
.
Request example
The following request gets the parent relationship records for the derived data source with the data source ID 4
.
Response example
Retrieve a Blob
GET
/dataSource/{dataSourceId}/blob/{blobid*}
Retrieve a blob.
Query parameters
dataSourceId
integer
The data source ID.
Yes
blobId
string
The blob ID.
Yes
Response schema
The response will download the blobs in a file you specify.
Request example
The following request retrieves a blob.
Response example
Get users by access level
GET
/dataSource/{dataSourceId}/access
Get all users with the provided access level for this data source.
Query parameters
dataSourceId
integer
The data source ID.
Yes
states
Array[string]
The status levels to include when querying for user access.
No
approved
boolean
Denotes whether the returned access objects should be approved.
No
searchText
string
A string used to filter returned users. The query is executed with a wildcard prefix and suffix.
No
size
integer
The number of results to return.
No
offset
integer
The number of results to skip (for paging).
No
sortField
string
The field on which to sort the result set.
No
sortOrder
string
The order in which to sort the results.
No
expandGroups
boolean
If true will return individual members of any group subscribed.
No
ignoreSystemGenerated
boolean
If true, will not return system generated accounts.
No
filterBySchemaEvolution
boolean
If true, will only return users who have the specified level of access across ALL data sources within the same schema evolution group as this one.
No
Response schema
count
integer
The number of users with access to the data source.
users
string
The metadata regarding the users with access to the data source.
Request example
The following request gets all users with the provided access level for this data source.
Response example
Get user access info for a data source
GET
/dataSource/{dataSourceId}/users/{profileId}/policyInfo
Retrieves the visibilities, masking information, and filters that the passed in user has access to in the specified data source.
Query parameters
dataSourceId
integer
The data source ID.
Yes
profileId
integer
The profile ID of the user.
Yes
projectId
integer
The project ID. If provided, this project will be used when evaluating the user's visibilities.
No
Response schema
visibilities
array
Details of the user's visibilities, including anyKey
.
visibilityRuleApplies
boolean
If true
, a visibility rule exists and the user is not excepted from it.
masked
array
Masking information for the data source, including metadata
, name
, type
, and actionType
.
additionalFilters
array
Policy information for the data source, including customWhere
, differentialPrivacy
, eventTimeColumn
, minimization
, time
, filterSeconds
, and isOlderOrNewer
.
Request example
The following request gets the visibility information for the user with the profile ID 2
on the data source with the data source ID 16
.
Response example
Get user visibility info for a data source
GET
/dataSource/{dataSourceId}/users/{profileId}/visibilityReport
Retrieves a summary of total records, total visibilities (the unique values contained in a column protected by a row-level security policy that allow Immuta to determine whether or not a user can see a given row if they possess an attribute that matches the visibility of that row), and visibilities a given user has access to.
Query parameters
dataSourceId
integer
The data source ID.
Yes
profileId
integer
The profile ID of the user.
Yes
informationOnly
boolean
If true
, the query will just return information for the UI and will skip running some queries for ephemeral data sources.
No
includeNestedColumns
boolean
If true
, the query will return just information for the dictionary page, including the masking policies for nested columns.
No
Response schema
noVisibilities
boolean
If true
, the data source has no row-level security or purpose-based restriction policies applied to it.
dataSourceVisibilitiesCount
integer
The total number of possible visibilities the given data source has.
userVisibilitiesCount
integer
The number of visibilities the current user can see for the given data source.
masked
array
Masking information for the data source, including metadata
, name
, type
, and actionType
.
dataSource
integer
The data source ID.
dataSourceName
string
The data source name.
additionalFilters
array
Policy information for the data source, including customWhere
, differentialPrivacy
, eventTimeColumn
, minimization
, time
, filterSeconds
, and isOlderOrNewer
.
allowMaskedJoins
boolean
If true
the data source allows masked joins.
policySet
array
Details about the policies on the data source.
Request example
The following request gets all of the visibility information for the user with the profile ID 2
on the data source with the data source ID 16
.
Response example
Get current user visibility info
GET
/dataSource/{dataSourceId}/visibilityReport
Retrieves a summary of total records, total visibilities (the unique values contained in a column protected by a row-level security policy that allow Immuta to determine whether or not a user can see a given row if they possess an attribute that matches the visibility of that row), and visibilities the current user has access to for a specified data source.
Query parameters
dataSourceId
integer
The data source ID.
Yes
Response schema
noVisibilities
boolean
If true
, the data source has no row-level security or purpose-based restriction policies applied to it.
dataSourceVisibilitiesCount
integer
The total number of possible visibilities the given data source has.
userVisibilitiesCount
integer
The number of visibilities the current user can see for the given data source.
denialReason
string
Reason the user was denied visibility.
masked
array
Masking information for the data source, including metadata
, name
, type
, and actionType
.
dataSource
integer
The data source ID.
dataSourceName
string
The data source name.
additionalFilters
array
Policy information for the data source, including customWhere
, differentialPrivacy
, eventTimeColumn
, minimization
, time
, filterSeconds
, and isOlderOrNewer
.
allowMaskedJoins
boolean
If true
the data source allows masked joins.
policySet
array
Details about the policies on the data source.
Request example
The following request gets all of the visibility information for the current user on the data source with the data source ID 16
.
Response example
Access data sources and make data source requests
Subscribe to a data source
POST
/dataSource/subscribe
Subscribe to a data source.
Query parameters
dataSourceId
integer
Data source ID number.
Yes
Payload parameters
dataSourceIds
array
The ID of the data source the user is subscribing to.
No
approvals
array
Includes details about the Subscription policy on the data source: requiredPermissions
, specificApproverRequired
, specificApprover
, and ownerModelId
.
No
Response schema
body
array
Contains details about the data source, including the data source ID, subscription status of the user, the profile ID of the user, and the dates the data source was created and updated.
Request example
The following request subscribes to the data source with ID 22
.
Payload example
Response example
Request to unmask values
POST
/dataSource/{dataSourceId}/reverseMask
Makes a request for values to be unmasked.
Query parameters
dataSourceId
integer
The data source ID.
Yes
Payload parameters
column
string
The column to unmask.
Yes
unmaskingReason
string
The reason the values need to be unmasked.
Yes
unmaskingUsers
array[integer]
The profile ID of the users who can unmask the values for the requestor.
Yes
projectId
integer
The ID of the associated project.
No
dataSourceId
integer
The data source ID.
No
Response schema
id
integer
The ID of the request.
requestingUserProfile
integer
The requesting user profile ID.
dataSourceId
integer
The data source ID.
reason
string
The reason for the unmasking request.
metadata
string
Metadata regarding the masking, such as the column
, values
, and maskingConfig
.
type
string
The type of request.
state
string
The state of the task, such as pending
.
createdAt
timestamp
The date and time the task was created.
updatedAt
timestamp
The date and time the task was updated.
Request example
The following requests for values to be unmasked.
Request payload example
Response example
Add a user to a data source
POST
/dataSource/{dataSourceId}/access
Add a user to a specific data source. Requestors cannot add themselves to a data source. To request access to a data source, use the /dataSource/subscribe
endpoint.
Query parameters
dataSourceId
integer
The data source ID.
Yes
Payload parameters
state
string
The status of the user: subscribed
, owner,
expert
, or ingest
.
Yes
profileId
integer
The profile ID of the user being added to the data source.
Yes
groupId
integer
The ID of the group being added to the data source.
No
approvals
array
Details about the user approving access: requiredPermission
, specificApproverRequired
, and specificApprover
.
No
expiration
date
The date the user's data source subscription ends.
No
Response schema
id
integer
The user's subscription ID.
modelId
integer
The model ID.
modelType
string
The model type.
state
string
The user's data source role, such as subscribed
.
denialReasoning
string
If the user was denied access, the reason for denial.
profile
integer
The user's profile ID.
group
integer
If a group was added, the group ID.
expiration
date
The date the user's subscription to the data source will expire.
acknowledgeRequired
boolean
If the data source is associated with a project, this value will be true
if the user needs to confirm they have read the project acknowledgment.
createdAt
timestamp
The date and time of creation.
updatedAt
timestamp
The date and time of update.
approved
boolean
When true
, the user's request has been approved.
Request example
The following request adds a user (saved in example-payload.json
) to this data source.
Request payload example
Response example
Manage data source requests
GET
Get all pending tasks for this user and pending tasks this user has created.
GET
Returns all tasks the user has made/can approve or deny for this data source.
Get pending tasks by user
GET
/dataSource/tasks/pending
Get all pending tasks for this user and pending tasks this user has created.
Query parameters
searchText
string
If specified, will filter results using the specified string.
No
searchModel
string
Will filter the results by model type: dataSource
or schemaEvolution
.
No
offset
integer
The number of results to skip (for paging).
No
size
integer
The number of results to return per page.
No
schemaEvolutionConnectionString
string
The schema evolution connection string to filter by.
No
countBySchemaEvolution
boolean
Iftrue
, will only return the number of tasks, grouped by schema evolution.
No
countByDataSource
boolean
Iftrue
, will only return the number of tasks, grouped by data source.
No
countOnly
boolean
When true
, will only return a count of the pending tasks.
No
groupByDataSource
boolean
If true
, will return the results as an array of { dataSourceId: , rows: }
.
No
types
Array[string]
Filters the results by the type of task: unmask
, dataSourceCreated
, columnAdded
, columnDeleted
, or columnTypeChanged
.
No
Response schema
outgoing
array
Includes details of the tasks or requests created by the user, such as the count
, type
, and targetEmails
.
incoming
array
Includes details about the tasks received by the user, such as the count
, type
, and targetEmails
.
Request example
The following request gets all pending tasks for a user and pending tasks the user has created.
Response example
Mark tasks as complete
GET
/dataSource/tasks/{taskId}
Handles the given task and marks it as complete.
Query parameters
taskId
integer
The task ID.
Yes
Response schema
result
array
Includes details about the task.
Request example
The following request handles the given task and marks it as complete.
Response example
Return tasks for a data source
GET
/dataSource/{dataSourceId}/tasks
Returns all tasks the user has made/can approve or deny for this data source.
Query parameters
dataSourceId
integer
The data source ID.
Yes
states
Array[string]
The state of the tasks: pending
or completed
.
No
targetProfileId
integer
Only returns tasks where the target user has this profile ID.
No
requestingUserProfileId
integer
Only returns tasks where the requesting user has this profile ID.
No
profileId
integer
Returns tasks where either the target or requesting user has this profile ID.
No
searchText
string
A string used to filter returned users. The query is executed with a wildcard prefix and suffix.
No
searchModel
string
A string used to determine how results should be filtered using searchText.
No
types
Array[string]
The type of task: unmask
, dataSourceCreated
, columnAdded
, columnDeleted
, or columnTypeChanged
.
No
size
integer
The number of results to return.
No
offset
integer
The number of results to skip (for paging).
No
sortField
string
The field by which to sort the result set.
No
sortOrder
string
The order in which to sort the results. The default is desc
.
No
countOnly
boolean
If true
, will only return the number of tasks.
No
Response schema
hits
array
Includes details about each task, such as the id
, state
, type
, and requestor
.
count
integer
The total number of tasks.
Request example
The following request returns all tasks the user has made/can approve or deny for this data source.
Response example
Change user status
PUT
/dataSource/{dataSourceId}/access/{subscriptionId}
Change user status for a specific data source. Requestors cannot update their own status for a data source.
Query parameters
dataSourceId
Integer
The data source ID.
Yes
subscriptionId
Integer
The data source member's subscription ID.
Yes
Payload parameters
state
string
The new status for the user: denied
, subscribed
, owner
, expert
or ingest
.
Yes
Response schema
id
integer
The data source member's subscription ID.
modelId
integer
The model ID.
modelType
array
The model type (i.e., datasource
).
state
array
The current state of the user's role: denied
, subscribed
, owner
, expert
, or ingest
.
profile
integer
The profile ID.
group
integer
If a group's status is being updated, this is the group ID.
expiration
timestamp
The date the user will no longer have access to the data source.
acknowledgeRequired
boolean
This attribute is specific to projects. When true
the user needs to confirm they have read the project acknowledgement statement.
createdAt
timestamp
The date and time created.
updatedAt
timestamp
The date and time updated.
originalState
array
The user's previous status for the data source.
approved
boolean
If true
, the status is approved.
Request example
The following request changes the user status to subscribed
for the specified data source.
Payload example
Response example
Update data sources
POST
Save blob metadata to Immuta and store raw content in local blob store.
PUT
Trigger the schema monitoring job for the specified detection group, or all groups if no ID is given.
Update a data source
PUT
/dataSource/{dataSourceId}
Update a data source.
Query parameters
dataSourceId
integer
The data source ID.
Yes
Payload parameters
blobHandler
array[object]
A list of full URLs providing the locations of all blob store handlers to use with this data source.
No
blobHandlerType
string
Describes the type of underlying blob handler that will be used with this data source (e.g., MS SQL
).
No
recordFormat
string
The data format of blobs in the data source, such as json
, xml
, html
, or jpeg
.
No
type
string
The type of data source: ingested
(metadata will exist in Immuta) or queryable
(metadata is dynamically queried).
No
name
string
The name of the data source. It must be unique within the Immuta tenant.
No
sqlTableName
string
A string that represents this data source's table in Immuta.
No
organization
string
The organization that owns the data source.
No
category
string
The category of the data source.
No
description
string
The description of the data source.
No
hasExamples
boolean
When true
, the data source contains examples.
No
Response schema
private
boolean
When false
, the data source will be publicly available in the Immuta UI.
blobHandler
array[object]
A list of full URLs providing the locations of all blob store handlers to use with this data source.
blobHandlerType
string
Describes the type of underlying blob handler that will be used with this data source (e.g., MS SQL
).
recordFormat
string
The data format of blobs in the data source, such as json
, xml
, html
, or jpeg
.
type
string
The type of data source: ingested
(metadata will exist in Immuta) or queryable
(metadata is dynamically queried).
name
string
The name of the data source. It must be unique within the Immuta tenant.
sqlTableName
string
A string that represents this data source's table in Immuta.
organization
string
The organization that owns the data source.
description
string
The description of the data source.
policyHandler
array
The ID of the policy handler and details about the data policies enforced on the data source.
subscriptionPolicy
array
Details about the subscription policy enforced on the data source, including the type of policy and exceptions.
Request example
The following request updates the data source's documentation (saved in example-payload.json
).
Request payload example
Response example
Update multiple data sources
PUT
/dataSource/bulk/{type}
Update data sources.
Query parameters
type
string
The action to perform on the data sources: add-users
, disable
, restore
, delete
, or tags
.
Yes
Payload parameters
ids
array[integer
] The IDs of the data sources to update.
Yes
update
array[object]
Only required for add-users
(includes metadata about the users' profiles
: id
and state
) and tags
(includes metadata about the tags
: name
and source
) types.
No
Response schema
bulkId
string
The ID of the bulk data source update.
jobsCreated
integer
The number of jobs created.
Request example
The following request adds the Address.email
tag to two data sources.
Payload example
Response example
Refresh native views
POST
/dataSource/bulkRefreshViews
Refresh native views.
Payload parameters
dataSourceIds
array[integer]
The IDs of the data sources of the native views to update.
Yes
Request example
The following request with the payload below refreshes the view for the data source with the ID 202.
Payload example
Save blob metadata to Immuta
POST
/dataSource/{dataSourceId}/blobs
Save blob metadata to Immuta.
Query parameters
dataSourceId
integer
The data source ID.
Yes
Payload parameters
blobId
string
The unique ID used to identify this blob within its data source.
Yes
file
string
The binary file to add to the data source.
Yes
filename
string
The name that will display in the filesystem.
No
tags
array[string]
Tags to apply to the blob.
No
date
data
A date that corresponds to a date within the record itself.
No
filesize
integer
The size of the file in bytes.
No
Response schema
blobsWithoutIds
integer
The number of blobs added without IDs.
blobsInError
array
The blobs that were not added because of an error.
blobsInserted
array
The blobs added to the data source.
tags
array[string]
Tags applied to the blobs.
Request example
The following request saves blob metadata to Immuta.
Payload example
Response example
Store blob metadata locally
POST
/dataSource/{dataSourceId}/persistBlob
Save blob metadata to Immuta and store raw content in local blob store.
Query parameters
dataSourceId
integer
The data source ID.
Yes
Payload parameters
blobId
string
The unique ID used to identify this blob within its data source.
Yes
file
string
The binary file to add to the data source.
Yes
filename
string
The name that will display in the filesystem.
No
tags
array[string]
Tags to apply to the blob.
No
date
data
A date that corresponds to a date within the record itself.
No
filesize
integer
The size of the file in bytes.
No
Response schema
blobsWithoutIds
integer
The number of blobs added without IDs.
blobsInError
array
The blobs that were not added because of an error.
blobsInserted
array
The blobs added to the data source.
tags
array[string]
Tags applied to the blobs.
Request example
The following request saves blob metadata to Immuta and stores raw content in local blob stores.
Payload example
Response example
Trigger schema monitoring jobs
PUT
/dataSource/detectRemoteChanges
Trigger the schema monitoring job for the specified detection group, or all groups if no payload parameters are given.
Payload parameters
dataSourceIds
array[integer]
The data source IDs to run the column detection job on. Leave empty to run this job globally on all data sources. This parameter cannot be included in the payload if schemaEvolutionId or any combination of hostname, database, port, or table is included.
No
hostname
string
The hostname of the data sources. This parameter cannot be included in the payload if dataSourceIds or schemaEvolutionId is included.
No
port
integer
The port used to connect the data sources to Immuta. This parameter cannot be included in the payload if dataSourceIds or schemaEvolutionId is included.
No
database
string
The database name. This runs schema monitoring on the database provided. If data sources were initially registered via the V2 API, including this parameter will locate new schemas that contain tables Immuta has the ability to access, and Immuta will create a new schema project associated with these newly discovered schemas and create data sources for each table located. If data sources were initially registered via the V1 API, including this parameter will only update the columns and tables of registered schema and tables of the specified database; it will not register any new schemas. This parameter cannot be included in the payload if dataSourceIds or schemaEvolutionId is included.
No
table
string
The table name. This will run column detection to just update the columns in this table. This parameter cannot be included in the payload if dataSourceIds or schemaEvolutionId is included.
No
schemaEvolutionId
integer
The ID of the schema to run the schema monitoring job on. This will run on all tables associated with the specified ID. The schema ID can be found in the response body of /dataSource/{dataSourceId}
. This parameter cannot be included in the payload if dataSourceIds or any combination of hostname, database, port, or table is included.
No
skipColumnDetection
boolean
When true
, Immuta will only pull new tables from the source server. This parameter can only be paired with schemaEvolutionId.
No
overrides.httpPath
string
If Databricks ephemeral overrides are configured, provide the alternative HTTP path to trigger schema monitoring on that ephemeral cluster.
No
Response schema
schemaDetection
object
Includes details about the resulting schema detection jobs.
columnDetection
object
Includes details about the resulting column detection jobs.
bulkId
string
The unique identifier of the jobs running schema monitoring and column detection.
Responses may include bulkId
, schemaDetection
, or columnDetection
objects, depending on the payload.
Request example
The following request triggers the schema monitoring job for the specified detection group.
Payload example
The tabs below illustrate payloads for triggering schema monitoring on a host, database, or table. The request will run schema monitoring for all databases registered under the hostname provided in the payload.
The request will run schema monitoring for all databases registered under the hostname provided in the payload.
Response examples
The tabs below illustrate the example response for each example payload provided above.
View and Review Data Sources
GET
Get all of the recent policy activities for a given data source.
GET
Return the users who can unmask the given column.
Run a data source health check
GET
/dataSource/{dataSourceId}/test
Run a health check on the data source.
Query parameters
dataSourceId
integer
The data source ID.
Yes
Response schema
blob
object
Indicates whether or not the blob was successfully crawled.
columnEvolution
object
Indicates whether or not the job run to check for columns added or removed from the data source passed
and when it was last run.
externalCatalog
object
Indicates whether or not the external catalog was successfully linked to the data source.
fingerprint
object
Indicates whether or not the fingerprint job was successful (passed
) and when it was last run. The fingerprint captures summary statistics of the data source.
framework
object
Indicates whether or not the classification was successfully run on the data source to determine its sensitivity.
globalPolicy
object
Indicates whether or not global policies were successfully applied to the data source.
highCardinality
object
Indicates whether or not the job run to calculate the data source's high cardinality column passed
and when it was last run.
schemaEvolution
object
Indicates whether or not the job run to check if a new table had been added in the remote database passed
and when it was last run. If a new table was added, Immuta automatically creates a new data source. Correspondingly, if a remote table is removed, that data source will be disabled in the console.
sdd
object
Indicates whether or not sensitive data discovery was successfully run on the data source.
sql
object
Indicates whether or not the SQL query run to check the data source's health passed
and when it was last run.
stats
object
Indicates whether or not the job run to calculate the number of rows in the data source passed
and when it was last run.
Request example
The following request tests a data source.
Response example
Retrieve blob handlers
GET
/dataSource/blobHandlerTypes
Retrieve all blob handlers the current user is allowed to create.
Response schema
name
string
The name of the blob handler.
baseUrl
string
The base URL for the data source.
config
array
Includes information about the connection configuration.
port
integer
The port number.
driver
string
The name of the driver.
Request example
The following request retrieves all blob handlers the current user is allowed to create.
Response example
Get data sources by purpose
GET
/dataSource/byPurposes
Get data sources that match a set of purposes.
Query parameters
purposes
array[string]
The purposes to filter the data sources by.
Yes
excludedProjects
array[integer]
Excludes data sources associated with specified project IDs.
No
Response schema
id
integer
The data source ID.
name
array
The name of the data source.
policyId
integer
The policy ID.
restrictions
array
Details regarding the operator
(and
or or
) and purposes
.
Request example
The following request gets data sources that match a set of purposes.
Response example
Retrieve data sources by user
GET
/dataSource/rpc/mine
Retrieves all the data sources the current user has access to.
Response schema
id
integer
The data source ID.
name
string
The data source name.
type
string
The type of data source, such as ingested
.
sqlTableName
string
The name of the table in Immuta.
sqlSchemaName
string
The name of the schema in Immuta.
blobHandlerType
string
The type of handler, such as Snowflake
.
sparkUseJDBC
boolean
When true
, uses a JDBC driver.
Request example
The following request retrieves all the data sources the current user has access to.
Response example
Get recent policy activities for a data source
GET
/dataSource/{dataSourceId}/activities
Get all of the recent policy activities for a given data source.
Query parameters
dataSourceId
integer
The data source ID.
Yes
offset
integer
The number of results to skip (for paging).
No
size
integer
The number of results to return per page.
No
Response schema
count
integer
The number of results.
activities
array
Includes details about the policy and the data source, including the policy and data source type, when the activity notification was triggered, and whether or not the policy change was triggered by a Global policy.
actionBy
array
Details about who triggered the action.
targetUser
array
Information about the user who received the notification.
Request example
The following request gets all of the recent policy activities for a given data source.
Response example
Get profiles for data source owners and experts
GET
/dataSource/{dataSourceId}/contacts
Gets the profiles for the data source owners and experts.
Query parameters
dataSourceId
integer
The data source ID.
Yes
Response schema
id
integer
The data source ID.
state
string
The user's data source role, such as owner
or subscribed
.
name
string
The user's name.
string
The user's email.
profile
integer
The user's profile ID.
Request example
The following request gets all the profiles for the data source owners and experts.
Response example
Get tags by data source
GET
/dataSource/{dataSourceId}/tags
Get the tags for a data source.
Query parameters
dataSourceId
integer
The data source ID.
Yes
blobId
string
Returns the tags for the specified blob.
No
blobTagsOnly
boolean
When true
, will only display blob tags associated with a data source.
No
Response schema
tags
array
Includes details about the tags, such as the name
, source
, and the profile ID of the user who added the tag.
Request example
The following request gets the tags for data source 4
.
Response example
Get users who can unmask columns
GET
/dataSource/{dataSourceId}/{columnName}/unmaskUsers
Return the users who can unmask the given column.
Query parameters
dataSourceId
integer
The data source ID.
Yes
columnName
string
The name of the column to unmask.
Yes
Response schema
name
array
The name of the user who can unmask the value.
profileId
integer
The profile ID of the user who can unmask the value.
iamid
string
The IAM ID of the user who can unmask the value.
Request example
The following request returns the users who can unmask the given column.
Response example
Delete Data Sources and More
DELETE
Delete a data source. This will perform a soft delete on the first call and a hard delete the second time.
Delete a data source
DELETE
/dataSource/{dataSourceId}
Delete a data source. This will perform a soft delete on the first call and a hard delete the second time.
Query parameters
dataSourceId
integer
The data source ID.
Yes
Response schema
success
boolean
If true
, the data source is deleted.
id
integer
The data source ID.
schemaEvolutionId
integer
The schema evolution ID.
name
string
The data source name.
disabled
boolean
If true
, the data source is disabled.
handlerDeleteErrorMessage
string
The delete error message.
Request example
The following request deletes the data source 23
.
Response example
Delete a task
DELETE
/dataSource/tasks/{taskId}
Delete the specified task.
Query parameters
taskId
integer
Target task ID.
Yes
Response schema
id
integer
The deleted task ID.
state
array
The state of the deleted task, such as pending
.
type
array
The type of deleted task, such as columnAdded
.
targetNames
string
The name of the user who received the request.
targetEmails
string
The email of the user who received the request.
requester
metadata
Details regarding the requesting profile.
dataSource
metadata
details regarding the data source.
metadata
array
Details about the deleted task.
Request example
The following request deletes a specified task.
Response example
Delete a blob
DELETE
/dataSource/{dataSourceId}/blob/{blobId*}
Delete a blob.
Query parameters
dataSourceId
integer
The data source ID.
Yes
blobId
string
The blob ID.
Yes
Response schema
When the blob is successfully deleted, there will be no response.
Request example
The following request deletes a blob.
Unsubscribe from a data source
DELETE
/dataSource/{dataSourceId}/unsubscribe
Unsubscribe from a data source.
Query parameters
dataSourceId
integer
The data source ID.
Yes
Response schema
status
boolean
If true
, the requesting user is unsubscribed from the data source.
Request example
The following request unsubscribes the user from data source 23
.
Response example
Last updated
Was this helpful?