Databricks-Certified-Professional-Data-Engineer Practice Online

Quickly grab our Databricks-Certified-Professional-Data-Engineer product now and kickstart your exam preparation today!

Name: Databricks Certified Professional Data Engineer
Exam Code: Databricks-Certified-Professional-Data-Engineer
Certification: Databricks Certified Professional
Vendor: Databricks
Total Questions: 198
Last Updated: May 14, 2024
Page:    1 / 40      
Total 198 Questions | Updated On: May 14, 2024
Demo Download
Question 1

Which of the following statements regarding the retention policy of Delta lake CDF is correct ?


Answer: A

Question 2

Which of the following describes the minimal permissions a data engineer needs to start and terminate an existing cluster ?


Answer: B

Question 3

A data engineer wants to use Databricks REST API to retrieve the metadata of a job run using its run_id.

Which of the following REST API calls achieves this requirement ?


Answer: B

Question 4

Incorporating unit tests into a PySpark application requires upfront attention to the design of your jobs, or a potentially significant refactoring of existing code. Which statement describes a main benefit that offset this additional effort?


Answer: C

Question 5

An upstream source writes Parquet data as hourly batches to directories named with the current date. A nightly batch job runs the following code to ingest all data from the previous day as indicated by the date variable:

Databricks-Certified-Professional-Data-Engineer-page61-image7
Assume that the fields customer_id and order_id serve as a composite key to uniquely identify each order. If the upstream system is known to occasionally produce duplicate entries for a single order hours apart, which statement is correct


Answer: B

Page:    1 / 40      
Total 198 Questions | Updated On: May 14, 2024
Demo Download