Become Google Certified with updated Professional-Data-Engineer exam questions and correct answers
You have Cloud Functions written in Node.js that pull messages from Cloud Pub/Sub and send the data to BigQuery. You observe that the message processing rate on the Pub/Sub topic is orders of magnitude higher than anticipated, but there is no error logged in Stackdriver Log Viewer. What are the two most likely causes of this problem? Choose 2 answers.
Which role must be assigned to a service account used by the virtual machines in a Dataproc cluster so they can execute jobs?
You are using BigQuery with a regional dataset that includes a table with the daily sales volumes. This table is
updated multiple times per day. You need to protect your sales table in case of regional failures with a
recovery point objective (RPO) of less than 24 hours, while keeping costs to a minimum. What should you do?
Why do you need to split a machine learning dataset into training data and test data?
You are responsible for writing your company’s ETL pipelines to run on an Apache Hadoop cluster. The pipeline will require some checkpointing and splitting pipelines. Which method should you use to write the pipelines?
© Copyrights DumpsCertify 2026. All Rights Reserved
We use cookies to ensure your best experience. So we hope you are happy to receive all cookies on the DumpsCertify.