Data Sources Page
Audience: All Immuta users
Content Summary: The Data Sources page allows Immuta users to view, subscribe to, and create data sources in Immuta. This section highlights the major tabs and features found on the details page for an individual data source.
For details about concepts relating to data sources or instructions for accessing and creating data sources, see the Data Sources section of documentation.
Data Sources Page
Once users navigate to this page, a list of data sources appears in the center window. Users can navigate between the All Data Sources tab and the My Data Sources tab to filter this list.
Additionally, users may use the Search bar in the upper left corner of the Immuta console to filter search results by data source name, tag, project, connection strings, or columns.
To navigate to a specific data source, users click on it from this list, and they are taken to that data source's Overview page.
Data Source Details Page
In addition to the data source's health, this page provides detailed information about the data source and is organized by tabs across the top of the page: Overview, Users, Policies, Data Dictionary, Queries, Metrics, Discussions, Contacts, and Lineage. The visibility and appearance of the tabs vary slightly depending on the type of user accessing the data source.
This tab includes detailed information about the data source in the left side-panel, including its Table Name, Schema Name, Parent Server, Data Catalog, Description, Remote Table, Remote Database, date Created, Technology, File Type, number of Users, and Data Source ID.
In the middle window, the information displayed is divided into three categories:
- Documentation: Data Owners can provide additional information about their data source here. If there is no documentation, only the data source name will appear.
- Connections: This section provides users' SQL connection string and information for connecting the Immuta Query Engine to external analytics tools, including PySpark 1.6, PySpark 2.0, Python+Psycopg2, Python+pyodbc, R, and RStudio.
- Tags: This section lists tags associated with the data source.
This tab contains information about the users associated with the data source, their username, when their access expires, what their role is, and an Actions button for denying access.
This tab is visible to everyone, but Data Owners and Governors can manage users from this page.
This tab lists the policies associated with the data source and includes three components:
- Subscribers: Lists who may access the data source. If a Subscription Policy has already been set by a Global Policy, then a notification and a Disable button will appear at the bottom of this section. Data Owners can click the Disable button to make changes to the Subscription Policy.
- Data Policies: Lists policies that enforce privacy controls on the data source. Data Owners can use this section to manage policies.
- Activity Panel: Records all changes made to policies by Data Owners or Governors, including when the data source was created, the name and type of the policy, when the policy was applied or changed, and if the policy is in conflict on the data source. Global policy changes are identified by the Governance icon; all other updates are labeled by the Data Sources icon.
This tab is visible to everyone, but Data Owners and Governors can manage policies from this page.
Data Dictionary Tab
The Data Dictionary is a table that details information about each column in a data source. The information within the Data Dictionary is generated automatically when the data source is created, if the remote storage technology supports SQL. Otherwise, Data Owners or Experts can manually create Data Dictionaries. The Data Dictionary tab includes four sections:
- Name: The name of the column in the table.
- Type: The type of value (text, integer, decimal, or timestamp).
- Description: May be used to give further information about the data in each column.
- Actions: Users may use the buttons in this column to edit, comment, or tag items in the Data Dictionary.
Discovered and added tags will appear under each entry, and Data Owners or Governors can delete the tags from here as well.
This tab allows users to keep track of their personal queries, share their queries with others, sample public queries, and debug queries.
If users have issues with a query they are running, they can send a request for a query debug to the Data Owner.
This tab details the data source usage and general statistics. Object-backed data sources provide the total number of records available, and query-backed data sources provide the total number of rows.
Users are able to comment on or ask questions about the Data Dictionary columns and definitions, public queries, and the data source in general. Resolved comments and questions are available for review to keep a complete history of all the knowledge sharing that has occurred on a data source.
Contact information for Data Owners is provided for each data source, allowing users to ask questions about accessibility and the attributes required for viewing the data.
This tab lists all projects, derived data sources, or parent data sources associated with the data source and includes the reason the data source was added to a project, who added the data source to the project or created it, and when the data source was added to the project or created.
When users submit a Debug Query or Unmask request in the UI, a Tasks tab appears beside the Lineage tab for the requesting user and the user receiving the request. This tab contains information about the request, and users can view and manage the tasks listed from this tab.