Immuta with h2o.ai
Audience: Data Owners and Users
Content Summary: This page illustrates how to connect h2o.ai to Immuta through an example using IPython Notebook (download here) and the NYC TLC data set, which can be found at the NYC Taxi & Limousine Commission website.
Prerequisites and Notes:
The Immuta Virtual Filesystem must be mounted on all workers using a specific user's credentials.
h2o.ai will not load query-backed data sources because their file size is 0.
IPython Notebook Example
!pip install http://h2o-release.s3.amazonaws.com/h2o/rel-tverberg/5/Python/h2o-22.214.171.124-py2.py3-none-any.whl
import h2o h2o.init()
data = h2o.import_file(path='immuta/nytaxi/green_tripdata_2016-01.csv') data['Total_amount'].mean()