Data Source API reference guide
This page describes the dataSource
endpoint, through which users can subscribe to data sources and manage data source tasks. To create data sources, see the specific handler endpoints.
Additional fields may be included in some responses you receive; however, these attributes are for internal purposes and are therefore undocumented.
GET
Search for data sources.
GET
Get a data source based on the ID.
GET
Get data source based on the name.
GET
Get data source based on the short name.
GET
Get parent and child relationship records for derived data sources using a specified data source ID.
GET
Retrieve a blob.
GET
Get all users with the provided access level for this data source.
GET
Retrieves the visibilities, masking information, and filters that the passed in user has access to in the specified data source.
GET
Retrieves a summary of total records, total visibilities, and visibilities a given user has access to.
GET
Retrieves a summary of total records, total visibilities, and visibilities the current user has access to for a specified data source.
GET /dataSource
Search for data sources.
blobHandlerType
array[string]
Describes the type of underlying blob handler that will be used with this data source (e.g., Custom, MS SQL).
No
subscription
array[string]
The requesting user's subscription status: pending
, owner
, subscribed
, not_subscribed
, expert
, or ingest
.
No
status
array[string]
The data source status: passed
or failed
.
No
tag
array[string]
Filters data sources by tags associated with the data sources.
No
searchText
string
Searches for data source names using the provided string
.
No
column
array[string]
Searches for data source column names.
No
connectionString
array[string]
Searches by connection string.
No
schema
string
Searches for data source schema.
No
nameOnly
boolean
When true
, searchText
will only search data source names. Default is false
.
No
idOnly
boolean
When true
, only returns the ID Of the data source and the user's subscription status.
No
dataSourceIds
array[integer]
Searches for the provided data source IDs.
No
selectFields
array[string]
This field accepts the values id
, name
, and columnEvolutionEnabled
. When id
or name
are provided, the request will return only the ID or name of the data source and the subscription status. If columnEvolutionEnabled
is provided, the response will also include information about the policies, policy conflicts, and workspaces associated with the data sources.
No
offset
integer
Used in combination with size
to fetch pages. Default is 0
.
No
size
integer
The number of results to return per page. Default is 10
.
No
sortField
string
Used to sort results by field, which must be createdAt
, name
, blobHandlerType
, subscriptionStatus
, recordCount
, status
, policy
, or editable
.
No
sortOrder
string
Sorts results by order, which must be asc
or desc
.
No
excludedProjects
array[integer]
Filter out any data sources that belong to the specified projects.
No
ephemeral
boolean
When true
, returns ephemeral data sources.
No
clusterName
string
The name of the remote cluster the data source is connected to.
No
mode
integer
Specifies the query mode, which must be 0
(FULL
), 1
(COUNT
), 4
(TAG
), 5
(MIN_MAX
), or 6
(STATUS
).
No
globalPolicy
string
Filter by data sources that have this Global Policy applied.
No
hostname
string
Searches data sources by hostname.
No
determinePolicyContflicts
boolean
When true
, filters results to return the data sources with policy conflicts.
No
collectionId
string
Filter by data sources in the domain with this ID.
No
collectionExclude
string
When true
, filters results to return the data sources in the domain of the provided ID.
No
sddTemplateName
string
Filter by data sources that have the specified SDD identification framework applied.
No
excludeSddTemplateName
string
Filter by data sources that do not have the specified SDD identification framework applied.
No
name
string
Data source name.
id
integer
Data source ID.
deleted
boolean
If true
the data source is a deleted data source.
description
string
The data source description.
createdAt
timestamp
The date and time the data source was created.
subscriptionPolicy
array
Details the type of Subscription Policy applied to the data source.
schemaEvolutionId
integer
The schema evolution ID.
recordCount
integer
The record count.
status
array[string]
Accepted statuses are passed
or failed
.
subscriptionStatus
array[string]
Accepted statuses are subscribed
or unsubscribed
.
blobHandlerType
array[string]
Describes the type of underlying blob handler of this data source (e.g., Custom, MS SQL).
subscriptionType
string
The type of subscription policy on the project. The type can be automatic
(which allows anyone to subscribe), approval
(which requires the subscriber to be manually approved), policy
(which only allows users with specific groups or attributes to subscribe), or manual
(which requires users to be manually added).
connectionString
string
The connection string information.
sqlSchemaName
string
The schema name.
policy
string
When this value is none
, there are no data policies applied to the data source. Otherwise, this field indicates whether or not there are policy conflicts among the data policies applied to the data source.
policyHandlerType
string
The policy handler type, such as None
or Builder
.
The following request returns 2 data sources.
GET
/dataSource/{dataSourceId}
Get a data source based on the ID.
Query parameters
dataSourceId
integer
The data source ID .
Yes
Response Schema
name
string
The data source name.
recordFormat
string
The data format of blobs in the data source, such as json
, xml
, html
, or jpeg
.
description
string
The description of the data source.
policyHandler
array
The ID of the policy handler and details about the data policies enforced on the data source.
sqlSchemaName
string
A string that represents this data source's schema name in Immuta.
sqlTableName
string
The SQL table name in Immuta.
blobHandler
array[object]
A list of full URLs providing the locations of all blob store handlers to use with this data source.
blobHandlerType
string
Describes the type of underlying blob handler that will be used with this data source (e.g., MS SQL
).
createdBy
integer
The ID of the profile creating the data source.
deleted
boolean
If true, the data source was deleted.
type
string
The data source type: queryable
.
rowCount
integer
The number of rows.
documentation
string
Documentation associated with the data source.
id
integer
The data source ID.
policyHandlerType
string
The type of policy handler applied to the data source: Builder
.
subscriptionType
string
The type of subscription policy on the data source. The type can be automatic
(which allows anyone to subscribe), approval
(which requires the subscriber to be manually approved), policy
(which only allows users with specific groups or attributes to subscribe), or manual
(which requires users to be manually added).
subscriptionPolicy
array
Details about the Subscription Policy applied to the data source.
globalPolicies
string
Details about the Global Policies applied to the data source.
status
string
The data source health status.
The following request gets a data source based on the ID 22
.
GET
/dataSource/name/{dataSourceName}
Get a data source based on the name.
dataSourceName
string
The data source name.
Yes
name
string
The data source name.
recordFormat
string
The data format of blobs in the data source, such as json
, xml
, html
, or jpeg
.
description
string
The description of the data source.
policyHandler
array
The ID of the policy handler and details about the data policies enforced on the data source.
sqlSchemaName
string
A string that represents this data source's schema name in Immuta.
sqlTableName
string
The SQL table name in Immuta.
blobHandler
array[object]
A list of full URLs providing the locations of all blob store handlers to use with this data source.
blobHandlerType
string
Describes the type of underlying blob handler that will be used with this data source (e.g., MS SQL
).
createdBy
integer
The ID of the profile creating the data source.
deleted
boolean
If true, the data source was deleted.
type
string
The data source type:queryable
.
rowCount
integer
The number of rows.
documentation
string
Documentation associated with the data source.
id
integer
The data source ID.
policyHandlerType
string
The type of policy handler applied to the data source: Builder
.
subscriptionType
string
The type of subscription policy on the data source. The type can be automatic
(which allows anyone to subscribe), approval
(which requires the subscriber to be manually approved), policy
(which only allows users with specific groups or attributes to subscribe), or manual
(which requires users to be manually added).
subscriptionPolicy
array
Details about the Subscription Policy applied to the data source.
globalPolicies
string
Details about the Global Policies applied to the data source.
status
string
The data source health status.
The following request gets a data source based on the name Public Barfoo
.
GET
/dataSource/sqlTableName/{shortName}
Get a data source based on the SQL table name.
shortName
string
The data source SQL table name.
Yes
name
string
The data source name.
recordFormat
string
The data format of blobs in the data source, such as json
, xml
, html
, or jpeg
.
description
string
The description of the data source.
policyHandler
array
The ID of the policy handler and details about the data policies enforced on the data source.
sqlSchemaName
string
A string that represents this data source's schema name in Immuta.
sqlTableName
string
The SQL table name in Immuta.
blobHandler
array[object]
A list of full URLs providing the locations of all blob store handlers to use with this data source.
blobHandlerType
string
Describes the type of underlying blob handler that will be used with this data source (e.g., MS SQL
).
createdBy
integer
The ID of the profile creating the data source.
deleted
boolean
If true, the data source was deleted.
type
string
The data source type: queryable
.
rowCount
integer
The number of rows.
documentation
string
Documentation associated with the data source.
id
integer
The data source ID.
policyHandlerType
string
The type of policy handler applied to the data source: Builder
.
subscriptionType
string
The type of subscription policy on the data source. The type can be automatic
(which allows anyone to subscribe), approval
(which requires the subscriber to be manually approved), policy
(which only allows users with specific groups or attributes to subscribe), or manual
(which requires users to be manually added).
subscriptionPolicy
array
Details about the Subscription Policy applied to the data source.
globalPolicies
string
Details about the Global Policies applied to the data source.
status
string
The data source health status.
The following request gets a data source based on the SQL table name customer_data
.
GET
/dataSource/{dataSourceId}/lineage/{type}
Get parent and child relationship records for derived data sources using a specified data source ID.
type
string
The type of lineage records to return. Options include: parents
, children
, and all
.
Yes
dataSourceId
integer
The target data source ID.
Yes
children
array
Details of the child data source, including dataSourceId
, dataSourceName
, projectId
, policyHandlerDiff
, deleted
, createdBy
, and createdAt
.
parents
array
Details of the parent data source, including dataSourceId
, dataSourceName
, projectId
, policyHandlerDiff
, deleted
, createdBy
, and createdAt
.
The following request gets the parent relationship records for the derived data source with the data source ID 4
.
GET
/dataSource/{dataSourceId}/blob/{blobid*}
Retrieve a blob.
dataSourceId
integer
The data source ID.
Yes
blobId
string
The blob ID.
Yes
The response will download the blobs in a file you specify.
The following request retrieves a blob.
GET
/dataSource/{dataSourceId}/access
Get all users with the provided access level for this data source.
dataSourceId
integer
The data source ID.
Yes
states
Array[string]
The status levels to include when querying for user access.
No
approved
boolean
Denotes whether the returned access objects should be approved.
No
searchText
string
A string used to filter returned users. The query is executed with a wildcard prefix and suffix.
No
size
integer
The number of results to return.
No
offset
integer
The number of results to skip (for paging).
No
sortField
string
The field on which to sort the result set.
No
sortOrder
string
The order in which to sort the results.
No
expandGroups
boolean
If true will return individual members of any group subscribed.
No
ignoreSystemGenerated
boolean
If true, will not return system generated accounts.
No
filterBySchemaEvolution
boolean
If true, will only return users who have the specified level of access across ALL data sources within the same schema evolution group as this one.
No
count
integer
The number of users with access to the data source.
users
string
The metadata regarding the users with access to the data source.
The following request gets all users with the provided access level for this data source.
GET
/dataSource/{dataSourceId}/users/{profileId}/policyInfo
Retrieves the visibilities, masking information, and filters that the passed in user has access to in the specified data source.
dataSourceId
integer
The data source ID.
Yes
profileId
integer
The profile ID of the user.
Yes
projectId
integer
The project ID. If provided, this project will be used when evaluating the user's visibilities.
No
visibilities
array
Details of the user's visibilities, including anyKey
.
visibilityRuleApplies
boolean
If true
, a visibility rule exists and the user is not excepted from it.
masked
array
Masking information for the data source, including metadata
, name
, type
, and actionType
.
additionalFilters
array
Policy information for the data source, including customWhere
, differentialPrivacy
, eventTimeColumn
, minimization
, time
, filterSeconds
, and isOlderOrNewer
.
The following request gets the visibility information for the user with the profile ID 2
on the data source with the data source ID 16
.
GET
/dataSource/{dataSourceId}/users/{profileId}/visibilityReport
Retrieves a summary of total records, total visibilities (the unique values contained in a column protected by a row-level security policy that allow Immuta to determine whether or not a user can see a given row if they possess an attribute that matches the visibility of that row), and visibilities a given user has access to.
dataSourceId
integer
The data source ID.
Yes
profileId
integer
The profile ID of the user.
Yes
informationOnly
boolean
If true
, the query will just return information for the UI and will skip running some queries for ephemeral data sources.
No
includeNestedColumns
boolean
If true
, the query will return just information for the dictionary page, including the masking policies for nested columns.
No
noVisibilities
boolean
If true
, the data source has no row-level security or purpose-based restriction policies applied to it.
dataSourceVisibilitiesCount
integer
The total number of possible visibilities the given data source has.
userVisibilitiesCount
integer
The number of visibilities the current user can see for the given data source.
masked
array
Masking information for the data source, including metadata
, name
, type
, and actionType
.
dataSource
integer
The data source ID.
dataSourceName
string
The data source name.
additionalFilters
array
Policy information for the data source, including customWhere
, differentialPrivacy
, eventTimeColumn
, minimization
, time
, filterSeconds
, and isOlderOrNewer
.
allowMaskedJoins
boolean
If true
the data source allows masked joins.
policySet
array
Details about the policies on the data source.
The following request gets all of the visibility information for the user with the profile ID 2
on the data source with the data source ID 16
.
GET
/dataSource/{dataSourceId}/visibilityReport
Retrieves a summary of total records, total visibilities (the unique values contained in a column protected by a row-level security policy that allow Immuta to determine whether or not a user can see a given row if they possess an attribute that matches the visibility of that row), and visibilities the current user has access to for a specified data source.
dataSourceId
integer
The data source ID.
Yes
noVisibilities
boolean
If true
, the data source has no row-level security or purpose-based restriction policies applied to it.
dataSourceVisibilitiesCount
integer
The total number of possible visibilities the given data source has.
userVisibilitiesCount
integer
The number of visibilities the current user can see for the given data source.
denialReason
string
Reason the user was denied visibility.
masked
array
Masking information for the data source, including metadata
, name
, type
, and actionType
.
dataSource
integer
The data source ID.
dataSourceName
string
The data source name.
additionalFilters
array
Policy information for the data source, including customWhere
, differentialPrivacy
, eventTimeColumn
, minimization
, time
, filterSeconds
, and isOlderOrNewer
.
allowMaskedJoins
boolean
If true
the data source allows masked joins.
policySet
array
Details about the policies on the data source.
The following request gets all of the visibility information for the current user on the data source with the data source ID 16
.
POST
Subscribe to a data source.
POST
Make a request for values to be unmasked.
POST
Add a user to a specific data source.
POST
/dataSource/subscribe
Subscribe to a data source.
dataSourceId
integer
Data source ID number.
Yes
dataSourceIds
array
The ID of the data source the user is subscribing to.
No
approvals
array
Includes details about the Subscription policy on the data source: requiredPermissions
, specificApproverRequired
, specificApprover
, and ownerModelId
.
No
body
array
Contains details about the data source, including the data source ID, subscription status of the user, the profile ID of the user, and the dates the data source was created and updated.
The following request subscribes to the data source with ID 22
.
Payload example
Deprecation notice
Support for unmask requests has been deprecated.
POST
/dataSource/{dataSourceId}/reverseMask
Makes a request for values to be unmasked.
dataSourceId
integer
The data source ID.
Yes
column
string
The column to unmask.
Yes
unmaskingReason
string
The reason the values need to be unmasked.
Yes
unmaskingUsers
array[integer]
The profile ID of the users who can unmask the values for the requestor.
Yes
projectId
integer
The ID of the associated project.
No
dataSourceId
integer
The data source ID.
No
id
integer
The ID of the request.
requestingUserProfile
integer
The requesting user profile ID.
dataSourceId
integer
The data source ID.
reason
string
The reason for the unmasking request.
metadata
string
Metadata regarding the masking, such as the column
, values
, and maskingConfig
.
type
string
The type of request.
state
string
The state of the task, such as pending
.
createdAt
timestamp
The date and time the task was created.
updatedAt
timestamp
The date and time the task was updated.
The following requests for values to be unmasked.
Request payload example
POST
/dataSource/{dataSourceId}/access
Add a user to a specific data source. Requestors cannot add themselves to a data source. To request access to a data source, use the /dataSource/subscribe
endpoint.
dataSourceId
integer
The data source ID.
Yes
state
string
The status of the user: subscribed
, owner,
expert
, or ingest
.
Yes
profileId
integer
The profile ID of the user being added to the data source.
Yes
groupId
integer
The ID of the group being added to the data source.
No
approvals
array
Details about the user approving access: requiredPermission
, specificApproverRequired
, and specificApprover
.
No
expiration
date
The date the user's data source subscription ends.
No
id
integer
The user's subscription ID.
modelId
integer
The model ID.
modelType
string
The model type.
state
string
The user's data source role, such as subscribed
.
denialReasoning
string
If the user was denied access, the reason for denial.
profile
integer
The user's profile ID.
group
integer
If a group was added, the group ID.
expiration
date
The date the user's subscription to the data source will expire.
acknowledgeRequired
boolean
If the data source is associated with a project, this value will be true
if the user needs to confirm they have read the project acknowledgment.
createdAt
timestamp
The date and time of creation.
updatedAt
timestamp
The date and time of update.
approved
boolean
When true
, the user's request has been approved.
The following request adds a user (saved in example-payload.json
) to this data source.
Request payload example
GET
Get all pending tasks for this user and pending tasks this user has created.
GET
Handles the given task and marks it as complete.
GET
Returns all tasks the user has made, can approve or deny, or validate for this data source.
PUT
Change user status for a specific data source.
GET
/dataSource/tasks/pending
Get all pending tasks for this user and pending tasks this user has created.
searchText
string
If specified, will filter results using the specified string.
No
searchModel
string
Will filter the results by model type: dataSource
or schemaEvolution
.
No
offset
integer
The number of results to skip (for paging).
No
size
integer
The number of results to return per page.
No
schemaEvolutionConnectionString
string
The schema evolution connection string to filter by.
No
countBySchemaEvolution
boolean
Iftrue
, will only return the number of tasks, grouped by schema evolution.
No
countByDataSource
boolean
Iftrue
, will only return the number of tasks, grouped by data source.
No
countOnly
boolean
When true
, will only return a count of the pending tasks.
No
groupByDataSource
boolean
If true
, will return the results as an array of { dataSourceId: , rows: }
.
No
types
No
outgoing
array
Includes details of the tasks or requests created by the user, such as the count
, type
, and targetEmails
.
incoming
array
Includes details about the tasks received by the user, such as the count
, type
, and targetEmails
.
The following request gets all pending tasks for a user and pending tasks the user has created.
GET
/dataSource/tasks/{taskId}
Handles the given task and marks it as complete.
taskId
integer
The task ID.
Yes
result
array
Includes details about the task.
The following request handles the given task and marks it as complete.
GET
/dataSource/{dataSourceId}/tasks
Returns all tasks the user has made, can approve or deny, or validate for this data source.
dataSourceId
integer
The data source ID.
Yes
states
Array[string]
The state of the tasks: pending
or completed
.
No
targetProfileId
integer
Only returns tasks where the target user has this profile ID.
No
requestingUserProfileId
integer
Only returns tasks where the requesting user has this profile ID.
No
profileId
integer
Returns tasks where either the target or requesting user has this profile ID.
No
searchText
string
A string used to filter returned users. The query is executed with a wildcard prefix and suffix.
No
searchModel
string
A string used to determine how results should be filtered using searchText.
No
types
No
size
integer
The number of results to return.
No
offset
integer
The number of results to skip (for paging).
No
sortField
string
The field by which to sort the result set.
No
sortOrder
string
The order in which to sort the results. The default is desc
.
No
countOnly
boolean
If true
, will only return the number of tasks.
No
hits
array
Includes details about each task, such as the id
, state
, type
, and requestor
.
count
integer
The total number of tasks.
The following request returns all tasks the user has made, can approve or deny, or validate for this data source.
PUT
/dataSource/{dataSourceId}/access/{subscriptionId}
Change user status for a specific data source. Requestors cannot update their own status for a data source.
dataSourceId
Integer
The data source ID.
Yes
subscriptionId
Integer
The data source member's subscription ID.
Yes
state
string
The new status for the user: subscribed
, owner
, expert
or ingest
.
Yes
id
integer
The data source member's subscription ID.
modelId
integer
The model ID.
modelType
array
The model type (i.e., datasource
).
state
array
The current state of the user's role: subscribed
, owner
, expert
, or ingest
.
profile
integer
The profile ID.
group
integer
If a group's status is being updated, this is the group ID.
expiration
timestamp
The date the user will no longer have access to the data source.
acknowledgeRequired
boolean
This attribute is specific to projects. When true
the user needs to confirm they have read the project acknowledgement statement.
createdAt
timestamp
The date and time created.
updatedAt
timestamp
The date and time updated.
originalState
array
The user's previous status for the data source.
approved
boolean
If true
, the status is approved.
The following request changes the user status to subscribed
for the specified data source.
Payload example
PUT
Update a data source.
PUT
Update multiple data sources.
POST
Refresh native views.
POST
Save blob metadata to Immuta.
POST
Save blob metadata to Immuta and store raw content in local blob store.
PUT
Trigger the schema monitoring job for the specified detection group, or all groups if no ID is given.
PUT
/dataSource/{dataSourceId}
Update a data source.
dataSourceId
integer
The data source ID.
Yes
blobHandler
array[object]
A list of full URLs providing the locations of all blob store handlers to use with this data source.
No
blobHandlerType
string
Describes the type of underlying blob handler that will be used with this data source (e.g., MS SQL
).
No
recordFormat
string
The data format of blobs in the data source, such as json
, xml
, html
, or jpeg
.
No
type
string
The type of data source: queryable
(metadata is dynamically queried).
No
name
string
The name of the data source. It must be unique within the Immuta tenant.
No
sqlTableName
string
A string that represents this data source's table in Immuta.
No
organization
string
The organization that owns the data source.
No
category
string
The category of the data source.
No
description
string
The description of the data source.
No
hasExamples
boolean
When true
, the data source contains examples.
No
private
boolean
When false
, the data source will be publicly available in the Immuta UI.
blobHandler
array[object]
A list of full URLs providing the locations of all blob store handlers to use with this data source.
blobHandlerType
string
Describes the type of underlying blob handler that will be used with this data source (e.g., MS SQL
).
recordFormat
string
The data format of blobs in the data source, such as json
, xml
, html
, or jpeg
.
type
string
The type of data source: queryable
(metadata is dynamically queried).
name
string
The name of the data source. It must be unique within the Immuta tenant.
sqlTableName
string
A string that represents this data source's table in Immuta.
organization
string
The organization that owns the data source.
description
string
The description of the data source.
policyHandler
array
The ID of the policy handler and details about the data policies enforced on the data source.
subscriptionPolicy
array
Details about the subscription policy enforced on the data source, including the type of policy and exceptions.
The following request updates the data source's documentation (saved in example-payload.json
).
Request payload example
PUT
/dataSource/bulk/{type}
Update data sources.
type
string
The action to perform on the data sources: add-users
, disable
, restore
, delete
, or tags
.
Yes
ids
array[integer
] The IDs of the data sources to update.
Yes
update
array[object]
Only required for add-users
(includes metadata about the users' profiles
: id
and state
) and tags
(includes metadata about the tags
: name
and source
) types.
No
bulkId
string
The ID of the bulk data source update.
jobsCreated
integer
The number of jobs created.
The following request adds the Address.email
tag to two data sources.
Payload example
POST
/dataSource/bulkRefreshViews
Refresh native views.
dataSourceIds
array[integer]
The IDs of the data sources of the native views to update.
Yes
The following request with the payload below refreshes the view for the data source with the ID 202.
Payload example
POST
/dataSource/{dataSourceId}/blobs
Save blob metadata to Immuta.
dataSourceId
integer
The data source ID.
Yes
blobId
string
The unique ID used to identify this blob within its data source.
Yes
file
string
The binary file to add to the data source.
Yes
filename
string
The name that will display in the filesystem.
No
tags
array[string]
Tags to apply to the blob.
No
date
data
A date that corresponds to a date within the record itself.
No
filesize
integer
The size of the file in bytes.
No
blobsWithoutIds
integer
The number of blobs added without IDs.
blobsInError
array
The blobs that were not added because of an error.
blobsInserted
array
The blobs added to the data source.
tags
array[string]
Tags applied to the blobs.
The following request saves blob metadata to Immuta.
Payload example
POST
/dataSource/{dataSourceId}/persistBlob
Save blob metadata to Immuta and store raw content in local blob store.
dataSourceId
integer
The data source ID.
Yes
blobId
string
The unique ID used to identify this blob within its data source.
Yes
file
string
The binary file to add to the data source.
Yes
filename
string
The name that will display in the filesystem.
No
tags
array[string]
Tags to apply to the blob.
No
date
data
A date that corresponds to a date within the record itself.
No
filesize
integer
The size of the file in bytes.
No
blobsWithoutIds
integer
The number of blobs added without IDs.
blobsInError
array
The blobs that were not added because of an error.
blobsInserted
array
The blobs added to the data source.
tags
array[string]
Tags applied to the blobs.
The following request saves blob metadata to Immuta and stores raw content in local blob stores.
Payload example
PUT
/dataSource/detectRemoteChanges
Trigger the schema monitoring job for the specified detection group, or all groups if no payload parameters are given.
dataSourceIds
array[integer]
The data source IDs to run the column detection job on. Leave empty to run this job globally on all data sources. This parameter cannot be included in the payload if schemaEvolutionId or any combination of hostname, database, port, or table is included.
No
hostname
string
The hostname of the data sources. This parameter cannot be included in the payload if dataSourceIds or schemaEvolutionId is included.
No
port
integer
The port used to connect the data sources to Immuta. This parameter cannot be included in the payload if dataSourceIds or schemaEvolutionId is included.
No
database
string
The database name. This runs schema monitoring on the database provided. If data sources were initially registered via the V2 API, including this parameter will locate new schemas that contain tables Immuta has the ability to access, and Immuta will create a new schema project associated with these newly discovered schemas and create data sources for each table located. If data sources were initially registered via the V1 API, including this parameter will only update the columns and tables of registered schema and tables of the specified database; it will not register any new schemas. This parameter cannot be included in the payload if dataSourceIds or schemaEvolutionId is included.
No
table
string
The table name. This will run column detection to just update the columns in this table. This parameter cannot be included in the payload if dataSourceIds or schemaEvolutionId is included.
No
schemaEvolutionId
No
skipColumnDetection
boolean
When true
, Immuta will only pull new tables from the source server. This parameter can only be paired with schemaEvolutionId.
No
overrides.httpPath
No
schemaDetection
object
Includes details about the resulting schema detection jobs.
columnDetection
object
Includes details about the resulting column detection jobs.
bulkId
string
The unique identifier of the jobs running schema monitoring and column detection.
Responses may include bulkId
, schemaDetection
, or columnDetection
objects, depending on the payload.
The following request triggers the schema monitoring job for the specified detection group.
Payload example
The tabs below illustrate payloads for triggering schema monitoring on a host, database, or table. The request will run schema monitoring for all databases registered under the hostname provided in the payload.
The request will run schema monitoring for all databases registered under the hostname provided in the payload.
The request will run schema monitoring on the database provided in the payload. If data sources were initially registered via the V2 API, this request will locate new schemas that contain tables Immuta has the ability to access, and Immuta will create a new schema project associated with these newly discovered schemas and create data sources for each table located. If data sources were initially registered via the V1 API, this request will only update the columns and tables of registered schema and tables of the specified database; it will not register any new schemas.
The request will run column detection and update the columns on the table specified in the payload.
The tabs below illustrate the example response for each example payload provided above.
GET
Refresh tags from an external catalog on a data source. The external catalog must be linked to the data source.
GET
Retrieve all blob handlers the current user is allowed to create.
GET
Get data sources that match a set of purposes.
GET
Retrieve all the data sources the current user has access to.
GET
Get all of the recent policy activities for a given data source.
GET
Get the profiles for the data source owners and experts.
GET
Get the tags for a data source.
GET
Return the users who can unmask the given column.
GET
/dataSource/{dataSourceId}/test
Refreshes tags from an external catalog on a data source. The external catalog must be linked to the data source.
dataSourceId
integer
The data source ID.
Yes
blob
object
Indicates whether or not the blob was successfully crawled.
columnEvolution
object
Indicates whether or not the job run to check for columns added or removed from the data source passed
and when it was last run.
externalCatalog
object
Indicates whether or not the external catalog was successfully linked to the data source.
fingerprint
object
Indicates whether or not the fingerprint job was successful (passed
) and when it was last run. The fingerprint captures summary statistics of the data source.
framework
object
Indicates whether or not the classification was successfully run on the data source to determine its sensitivity.
globalPolicy
object
Indicates whether or not global policies were successfully applied to the data source.
highCardinality
object
Indicates whether or not the job run to calculate the data source's high cardinality column passed
and when it was last run.
schemaEvolution
object
Indicates whether or not the job run to check if a new table had been added in the remote database passed
and when it was last run. If a new table was added, Immuta automatically creates a new data source. Correspondingly, if a remote table is removed, that data source will be disabled in the console.
sdd
object
Indicates whether or not sensitive data discovery was successfully run on the data source.
sql
object
Indicates whether or not the SQL query run to check the data source's health passed
and when it was last run.
stats
object
Indicates whether or not the job run to calculate the number of rows in the data source passed
and when it was last run.
The following request refreshes external catalog tags on the data source.
GET
/dataSource/blobHandlerTypes
Retrieve all blob handlers the current user is allowed to create.
name
string
The name of the blob handler.
baseUrl
string
The base URL for the data source.
config
array
Includes information about the connection configuration.
port
integer
The port number.
driver
string
The name of the driver.
The following request retrieves all blob handlers the current user is allowed to create.
GET
/dataSource/byPurposes
Get data sources that match a set of purposes.
purposes
array[string]
The purposes to filter the data sources by.
Yes
excludedProjects
array[integer]
Excludes data sources associated with specified project IDs.
No
id
integer
The data source ID.
name
array
The name of the data source.
policyId
integer
The policy ID.
restrictions
array
Details regarding the operator
(and
or or
) and purposes
.
The following request gets data sources that match a set of purposes.
GET
/dataSource/rpc/mine
Retrieves all the data sources the current user has access to.
id
integer
The data source ID.
name
string
The data source name.
type
string
The type of data source: queryable
.
sqlTableName
string
The name of the table in Immuta.
sqlSchemaName
string
The name of the schema in Immuta.
blobHandlerType
string
The type of handler, such as Snowflake
.
sparkUseJDBC
boolean
When true
, uses a JDBC driver.
The following request retrieves all the data sources the current user has access to.
GET
/dataSource/{dataSourceId}/activities
Get all of the recent policy activities for a given data source.
dataSourceId
integer
The data source ID.
Yes
offset
integer
The number of results to skip (for paging).
No
size
integer
The number of results to return per page.
No
count
integer
The number of results.
activities
array
Includes details about the policy and the data source, including the policy and data source type, when the activity notification was triggered, and whether or not the policy change was triggered by a Global policy.
actionBy
array
Details about who triggered the action.
targetUser
array
Information about the user who received the notification.
The following request gets all of the recent policy activities for a given data source.
GET
/dataSource/{dataSourceId}/contacts
Gets the profiles for the data source owners and experts.
dataSourceId
integer
The data source ID.
Yes
id
integer
The data source ID.
state
string
The user's data source role, such as owner
or subscribed
.
name
string
The user's name.
string
The user's email.
profile
integer
The user's profile ID.
The following request gets all the profiles for the data source owners and experts.
GET
/dataSource/{dataSourceId}/tags
Get the tags for a data source.
dataSourceId
integer
The data source ID.
Yes
blobId
string
Returns the tags for the specified blob.
No
blobTagsOnly
boolean
When true
, will only display blob tags associated with a data source.
No
tags
array
Includes details about the tags, such as the name
, source
, and the profile ID of the user who added the tag.
The following request gets the tags for data source 4
.
GET
/dataSource/{dataSourceId}/{columnName}/unmaskUsers
Return the users who can unmask the given column.
dataSourceId
integer
The data source ID.
Yes
columnName
string
The name of the column to unmask.
Yes
name
array
The name of the user who can unmask the value.
profileId
integer
The profile ID of the user who can unmask the value.
iamid
string
The IAM ID of the user who can unmask the value.
The following request returns the users who can unmask the given column.
DELETE
Delete a data source. This will perform a soft delete on the first call and a hard delete the second time.
DELETE
Delete the specified task.
DELETE
Delete a blob.
DELETE
Unsubscribe from a data source.
DELETE
/dataSource/{dataSourceId}
Delete a data source. This will perform a soft delete on the first call and a hard delete the second time.
dataSourceId
integer
The data source ID.
Yes
success
boolean
If true
, the request to disable or delete the data source was successful.
id
integer
The data source ID.
schemaEvolutionId
integer
The schema evolution ID.
name
string
The data source name.
disabled
boolean
If true
, the data source is disabled.
handlerDeleteErrorMessage
string
The delete error message.
The following request deletes the data source 23
.
DELETE
/dataSource/tasks/{taskId}
Delete the specified task.
taskId
integer
Target task ID.
Yes
id
integer
The deleted task ID.
state
array
The state of the deleted task, such as pending
.
type
array
The type of deleted task, such as columnAdded
.
targetNames
string
The name of the user who received the request.
targetEmails
string
The email of the user who received the request.
requester
metadata
Details regarding the requesting profile.
dataSource
metadata
details regarding the data source.
metadata
array
Details about the deleted task.
The following request deletes a specified task.
DELETE
/dataSource/{dataSourceId}/blob/{blobId*}
Delete a blob.
dataSourceId
integer
The data source ID.
Yes
blobId
string
The blob ID.
Yes
When the blob is successfully deleted, there will be no response.
The following request deletes a blob.
DELETE
/dataSource/{dataSourceId}/unsubscribe
Unsubscribe from a data source.
dataSourceId
integer
The data source ID.
Yes
status
boolean
If true
, the requesting user is unsubscribed from the data source.
The following request unsubscribes the user from data source 23
.
Array[string]
Filters the results by the type of task: unmask
, dataSourceCreated
, columnAdded
, columnDeleted
, or columnTypeChanged
. The dataSourceCreated
, columnAdded
, columnDeleted
, and columnTypeChanged
tasks are only created if a policy that contains the New
tag is active and or identifies a change in the remote platform.
Array[string]
The type of task: unmask
, dataSourceCreated
, columnAdded
, columnDeleted
, or columnTypeChanged
. The dataSourceCreated
, columnAdded
, columnDeleted
, and columnTypeChanged
tasks are only created if a policy that contains the New
tag is active and or a identifies a change in the remote platform.
integer
The ID of the schema to run the schema monitoring job on. This will run on all tables associated with the specified ID. The schema ID can be found in the response body of . This parameter cannot be included in the payload if dataSourceIds or any combination of hostname, database, port, or table is included.
string
If Databricks are configured, provide the alternative HTTP path to trigger schema monitoring on that ephemeral cluster.