Amazon S3
Amazon S3 ETL connector for data replication
Last updated
Was this helpful?
Amazon S3 ETL connector for data replication
Last updated
Was this helpful?
Features
Details
Release Status
Released
Table Selection
No
Column Selection
No
Edit Integration
Yes
Replication Type Selection
No
Replication Key
File Modified Date
Suggested Replication Frequency
1 hr
Signing to Daton
Select Amazon S3 from the list of Integrations
Provide Integration Name, Replication Frequency. Integration name would be used in creating tables for the integration and cannot be changed later
Post successful authentication, you will be prompted to enter a folder path
Select the file type - Daton supports CSV and XLS formats
Enter the row number where the columns names (headers) are present.
Then select required fields for each table.
Overwrite the column names and update any data types as required
Submit the integration
Integrations would be in Pending state initially and will be moved to Active state as soon as the first job loads data successfully on to the configured warehouse
Users would be able to edit/pause/re-activate/delete integration anytime
Users can view job status and process logs from the integration details page by clicking on the integration name from the active list
Enter the access key and secret key in the Daton UI
Navigate to your S3 buckets where files for replication reside
Click on the Copy S3 URI button
S3 URI copied will have the following format - "s3://amazon-rpa/OutputFiles/developer/Sales and Traffic by ASIN/"
Remove the s3:// from the URL and enter the rest of the string the Daton UI.
Select the row number in the file that has the column names.
Daton auto-detects the schema from the folder and extracts the column names and infers the data types.
Override the Daton detected values if necessary to control the schema in the warehouse.
Daton will create the tables in the warehouse shortly and start processing data shortly after.