Importing to and Exporting Datasets from Your Network Storage – Rhino Federated Computing

This article explains how to import and export datasets on the Rhino FCP to and from the following cloud storage platforms:

Amazon Web Services (AWS) S3
Google Cloud Platform's (GCP) Cloud Storage (CS)
Server Message Block (SMB) network file sharing protocol
Azure Blob Storage

Prerequisite

Before you can complete these instructions, you will need to mount the bucket or directory that contains the data you want to access. To do this, follow the steps in Mounting Storage to Your Rhino Client.

Import Datasets

Import a new dataset and point the data location to your cloud storage data relative to the Client Mount Paths, see more detail in the Example below.

The integration is available at the workgroup level. Each workgroup can set up their own buckets or network share. Those buckets or network shares are not accessible to other workgroups.

Example: Importing a file from AWS S3

Suppose you want to import file that is located in a S3 bucket. Here is how you would do this.

Find your data by navigating to your organization storage (click on the S3 arrow button)
Find the data you want to import, e.g., automapper/ORGDataSet.csv.
Go to Settings > Client Mounted Storage.
In the Client Mounted Storage page, find the Client Mount Path that has the file, in this case /rhino_data/external/s3. Click the blue rectangle next to it to copy it.
In the Datasets page, on the Overview tab, select the Import New Dataset following the instructions in Importing a Remote Dataset
Set the data location as Client Mounted Path/Relative Path to Your Data in S3. Following the example in steps 2 and 4, it'd would look like as follows:
Select Import New Dataset.

Export Datasets

To export an existing dataset, follow the steps described in Exporting a Dataset. The Rhino integration with your network storage should be configured as `Is read only` = `False` to allow your Rhino Client to save the exported files in your network storage. (If you are not sure if `Is read only` = `False` in your configuration, please contact the Rhino support team.)

Datasets will be exported to the file storage path set in the integration. For the AWS import example above, datasets would exported to your AWS S3 bucket named `my_bucket`.