Importing data from Google Cloud Storage (GCS)
Last updated
Last updated
The data stored on Google cloud services can be imported to the platform by creating a connection to Google cloud storage. The connection can only be created with a valid JSON access key generated after creating a service account.
To import data from GCS:
Hover over the menu icon and select Connectors. The Connectors page is displayed showing the total number of connectors.
The Data Connectors screen is displayed.
Click the plus icon on the top. You can also use the +New data connector button on the workspace to create a new connection. The later option appears only when there are no data connectors in the tenant.
Click the Google Cloud services tile.
Click Create Connection. The Data connectors configuration page is displayed.
Specify this information to configure the Google cloud storage Data connector and access files:
Name: The name of the Data connector.
Bucket: The name of the bucket in which folders or files are stored in GCS. The bucket name used must be same as the name with which the bucket is created in the Google cloud storage.
Access key: The valid JSON access key generated after creating a service account in GCS, to authenticate.
Click the Test icon to check if you are able to establish the connection to the Data connector successfully. Once the connection is established, you can see the files imported from the GCS bucket to the platform. The list of files imported are populated in the table format. You can only view the file names, but cannot view or download.
Click Save to save the Data connector.
You can manage files, datasets, and published outputs for this data connector across different tabs:
Files Tab: View the files retrieved from this data connector.
Datasets Tab: See the projects where datasets fetched from this data connector have been used.
Schedulers Tab: View the outputs published to this connector. When creating a scheduler, users can configure an external connector as the destination to publish the generated outputs upon job execution.
To delete the data connector, click the Actions drop-down menu and select Delete
This Data connector gets added to the already existing Data connectors on this tenant. You can use imported files from GCS in the drop-down list of data connectors while uploading a dataset on the canvas.