Importing data from Amazon S3
Last updated
Last updated
You can import data from Amazon S3 cloud storage to the RapidCanvas platform. For this, you must establish a connection with Amazon S3 by providing the bucket name, Access key ID and secret access key. Once the connection is established successfully, it provides access to the bucket from where you can import the data to the platform.
To import data from Amazon S3:
Hover over the menu icon and select Connectors. The Connectors page is displayed showing the total number of connectors.
The Data connectors screen is displayed.
Click the plus icon on the top. You can also use the +New data connector button on the workspace to create a new connection.
Click the Amazon S3 tile.
Click Create Connection. The Data connectors configuration page is displayed.
Specify this information to configure Amazon S3 Data connector and access folders and files stored inside the folder:
Name: The name of the Data connector.
Bucket: The name of the bucket in which folders or files are stored in GCS. The bucket name used must be same as the name with which the bucket is created in the S3.
Access keyid: The access key ID is like username to connect to the S3 bucket.
Access key secret: The access key secret is like password to connect to the S3 bucket.
Click Test to check if you are able to establish the connection to the Data connector successfully. Once the connection is established, you can see the files imported from the S3 bucket to the platform. The list of files imported are populated in the table format.
Click Save to save the Data connector. This Data connector gets added to the already existing Data connectors on this tenant.
You can manage files, datasets, and published outputs for this data connector across different tabs:
Files Tab: View the files retrieved from this data connector.
Datasets Tab: See the projects where datasets fetched from this data connector have been used.
Schedulers Tab: View the outputs published to this connector. When creating a job, users can configure an external connector as the destination to publish the generated outputs upon job execution.
To delete the data connector, click the Actions drop-down menu and select Delete.