Skip to content

FTP Data Source Creation Tutorial

Audience: Data Owners

Content Summary: This guide contains instructions for configuring an FTP data source in Immuta. To explore data source creation guides for other storage technologies, see the Object-backed and Query-backed guides.

Step 1: Enter Connection Information

  1. Fill out the following fields in the Connection Information window:

  2. Server: hostname of your FTP server

  3. Port: port configured for FTP, typically port 21
  4. SFTP: enable if connecting to an FTP server that supports SFTP
  5. Select Authentication Method: select the authentication method for connecting to the FTP server with (Anonymous, Basic Authentication or SSH Key)
    • Username: the username to connect to the FTP server (only applicable if Basic Authentication or SSH key are selected for the authentication method)
    • Password: password to connect to FTP server (only applicable if Basic Authentication is selected for the authentication method)
    • Private Key: browse to the file containing this user's private SSH Key (only applicable if SSH Key is selected for the authentication method)
  6. Root Path: path from the FTP server that you want the data source to reference, typically ‘/’

    FTP Connection Information

  7. Click Test Connection.

Step 2: Select Data Format

While object-backed data sources can be any format (images, videos, etc.), Immuta can still work under the assumption that some will have common formats. Should your blobs be comma separated, tab-delimited, or json, you can mask values through the Immuta interface. Specifying the data format will allow you to create masking policies for the data source.

Data Format

Step 3: Configure Tags and Features

The only metadata relevant to tags and features that Immuta pulls from the FTP back end is the directory structure. Folder names can be used to tag blobs or they can be applied as blob features.

FTP Tags and Features

Step 4: Determine Refresh Interval

If left blank or set to 0, FTP data will only be indexed once when the data source is initially created. Otherwise, the FTP data will be re-indexed based on the selected time interval.

  1. Fill out the Set Time and Set Period fields:

    • Set Time: This is how often Immuta will re-index data located on the remote FTP server.
    • Set Period: This is the time period and can be set to minutes, hours or days.

    Refresh Interval

  2. Click Apply.