Become Amazon Certified with updated AWS-DEA-C01 exam questions and correct answers
A mobile app tracks user activity data, which is continuously streamed to Amazon Kinesis Data Streams. The app requires a solution to process this data in real-time and update user profiles stored in Amazon DynamoDB based on the activity data.
What combination of AWS services should be used for real-time processing of the stream and updating the user profiles in DynamoDB?
A company needs to load customer data that comes from a third party into an Amazon Redshift datawarehouse. The company stores order data and product data in the same data warehouse. The company wantsto use the combined dataset to identify potential new customers.A data engineer notices that one of the fields in the source data includes values that are in JSON format.How should the data engineer load the JSON data into the data warehouse with the LEAST effort?
A sales company uses AWS Glue ETL to collect, process, and ingest data into an Amazon S3 bucket. The AWS Glue pipeline creates a new file in the S3 bucket every hour. File sizes vary from 200 KB to 300 KB. The company wants to build a sales prediction model by using data from the previous 5 years. The historic data includes 44,000 files. The company builds a second AWS Glue ETL pipeline by using the smallest worker type. The second pipeline retrieves the historic files from the S3 bucket and processes the files for downstream analysis. The company notices significant performance issues with the second ETL pipeline. The company needs to improve the performance of the second pipeline. Which solution will meet this requirement MOST cost-effectively?
A Data Engineering Team is setting up a new job in AWS Glue ETL. They have configured their data sources and targets, defined the necessary transformations, but when they attempt to run the job, it fails with an authorization error. The team needs to ensure that the AWS Glue job has the necessary permissions to access the required AWS resources for execution.
What should the team do to resolve this issue and successfully run the Glue ETL job?
Files from multiple data sources arrive in an Amazon S3 bucket on a regular basis. A data engineer wants to ingest new files into Amazon Redshift in near real time when the new files arrive in the S3 bucket. Which solution will meet these requirements?
© Copyrights DumpsCertify 2025. All Rights Reserved
We use cookies to ensure your best experience. So we hope you are happy to receive all cookies on the DumpsCertify.