Prepare and pass your Cloud Certified Professional Data Engineer with free Professional-Data-Engineer exam questions.
Which role must be assigned to a service account used by the virtual machines in a Dataproc cluster so they can execute jobs?
You have a data pipeline that writes data to Cloud Bigtable using well-designed row keys. You want to monitor your pipeline to determine when to increase the size of you Cloud Bigtable cluster. Which two actions can you take to accomplish this? Choose 2 answers.
You operate a database that stores stock trades and an application that retrieves average stock price for a given company over an adjustable window of time. The data is stored in Cloud Bigtable where the datetime of the stock trade is the beginning of the row key. Your application has thousands of concurrent users, and you notice that performance is starting to degrade as more stocks are added. What should you do to improve the performance of your application?
You are training a deep learning model for a classification task. The precision and recall of the model is quite low. What could you do to improve the precision and recall scores?
You want to schedule a number of sequential load and transformation jobs Data files will be added to a Cloud
Storage bucket by an upstream process There is no fixed schedule for when the new data arrives Next, a
Dataproc job is triggered to perform some transformations and write the data to BigQuery. You then need to
run additional transformation jobs in BigQuery The transformation jobs are different for every table These
jobs might take hours to complete You need to determine the most efficient and maintainable workflow to
process hundreds of tables and provide the freshest data to your end users. What should you do?
© Copyrights DumpsCertify 2024. All Rights Reserved
We use cookies to ensure your best experience. So we hope you are happy to receive all cookies on the DumpsCertify.