Which statement regarding stream-static joins and static Delta tables is correct?
A. The checkpoint directory will be used to track updates to the static Delta table.
B. Stream-static joins cannot use static Delta tables because of consistency issues.
C. Each microbatch of a stream-static join will use the most recent version of the static Delta table as of each microbatch.
D. Each microbatch of a stream-static join will use the most recent version of the static Delta table as of the job's initialization.
E. The checkpoint directory will be used to track state information for the unique keys present in the join.
正解:C
解説: (Pass4Test メンバーにのみ表示されます)
質問 2:
A DLT pipeline includes the following streaming tables:
Raw_lot ingest raw device measurement data from a heart rate tracking device.
Bpm_stats incrementally computes user statistics based on BPM measurements from raw_lot.
How can the data engineer configure this pipeline to be able to retain manually deleted or updated records in the raw_iot table while recomputing the downstream table when a pipeline update is run?
A. Set the pipelines, reset, allowed property to false on raw_iot
B. Set the pipelines, reset, allowed property to false on bpm_stats
C. Set the SkipChangeCommits flag to true raw_lot
D. Set the skipChangeCommits flag to true on bpm_stats
正解:A
解説: (Pass4Test メンバーにのみ表示されます)
質問 3:
An upstream source writes Parquet data as hourly batches to directories named with the current date. A nightly batch job runs the following code to ingest all data from the previous day as indicated by the date variable:

Assume that the fields customer_id and order_id serve as a composite key to uniquely identify each order.
If the upstream system is known to occasionally produce duplicate entries for a single order hours apart, which statement is correct?
A. Each write to the orders table will only contain unique records; if existing records with the same key are present in the target table, the operation will tail.
B. Each write to the orders table will only contain unique records; if existing records with the same key are present in the target table, these records will be overwritten.
C. Each write to the orders table will run deduplication over the union of new and existing records, ensuring no duplicate records are present.
D. Each write to the orders table will only contain unique records, but newly written records may have duplicates already present in the target table.
E. Each write to the orders table will only contain unique records, and only those records without duplicates in the target table will be written.
正解:D
解説: (Pass4Test メンバーにのみ表示されます)
質問 4:
A data engineer has configured their Databricks Asset Bundle with multiple targets in databricks.yml and deployed it to the production workspace. Now, to validate the deployment, they need to invoke a job named my_project_job specifically within the prod target context.
Assuming the job is already deployed, they need to trigger its execution while ensuring the target- specific configuration is respected. Which command will trigger the job execution?
A. databricks job run my_project_job --env prod
B. databricks run my_project_job -t prod
C. databricks execute my_project_job -e prod
D. databricks bundle run my_project_job -t prod
正解:D
解説: (Pass4Test メンバーにのみ表示されます)
質問 5:
A junior data engineer has manually configured a series of jobs using the Databricks Jobs UI.
Upon reviewing their work, the engineer realizes that they are listed as the "Owner" for each job.
They attempt to transfer "Owner" privileges to the "DevOps" group, but cannot successfully accomplish this task.
Which statement explains what is preventing this privilege transfer?
A. Databricks jobs must have exactly one owner; "Owner" privileges cannot be assigned to a group.
B. Only workspace administrators can grant "Owner" privileges to a group.
C. A user can only transfer job ownership to a group if they are also a member of that group.
D. Other than the default "admins" group, only individual users can be granted privileges on jobs.
E. The creator of a Databricks job will always have "Owner" privileges; this configuration cannot be changed.
正解:A
解説: (Pass4Test メンバーにのみ表示されます)
質問 6:
A Data Engineer is building a simple data pipeline using Lakeflow Declarative Pipelines (LDP) in Databricks to ingest customer data. The raw customer data is stored in a cloud storage location in JSON format. The task is to create Lakeflow Declarative Pipelines that read the raw JSON data and write it into a Delta table for further processing. Which code snippet will correctly ingest the raw JSON data and create a Delta table using LDP?
A. import dlt
@dlt.table
def raw_customers():
return spark.read.json("s3://my-bucket/raw-customers/")
B. import dlt
@dlt.table
def raw_customers():
return spark.read.format("parquet").load("s3://my-bucket/raw-customers/")
C. import dlt
@dlt.view
def raw_customers():
return spark.format.json("s3://my-bucket/raw-customers/")
D. import dlt
@dlt.table
def raw_customers():
return spark.read.format("csv").load("s3://my-bucket/raw-customers/")
正解:A
解説: (Pass4Test メンバーにのみ表示されます)
質問 7:
A junior member of the data engineering team is exploring the language interoperability of Databricks notebooks. The intended outcome of the below code is to register a view of all sales that occurred in countries on the continent of Africa that appear in the geo_lookup table.
Before executing the code, running SHOW TABLES on the current database indicates the database contains only two tables: geo_lookup and sales.

Which statement correctly describes the outcome of executing these command cells in order in an interactive notebook?
A. Both commands will succeed. Executing show tables will show that countries at and sales at have been registered as views.
B. Both commands will fail. No new variables, tables, or views will be created.
C. Cmd 1 will succeed. Cmd 2 will search all accessible databases for a table or view named countries af: if this entity exists, Cmd 2 will succeed.
D. Cmd 1 will succeed and Cmd 2 will fail, countries at will be a Python variable containing a list of strings.
E. Cmd 1 will succeed and Cmd 2 will fail, countries at will be a Python variable representing a PySpark DataFrame.
正解:D
解説: (Pass4Test メンバーにのみ表示されます)
質問 8:
A departing platform owner currently holds ownership of multiple catalogs and controls storage credentials and external locations. A data engineer has been asked to ensure continuity: transfer catalog ownership to the platform team group, delegate ongoing privilege management, and retain the ability to receive and share data via Delta Sharing. Which role must be in place to perform these actions across the metastore?
A. Workspace Admin, because workspace admins can transfer ownership of any Unity Catalog object.
B. Catalog Owner, because catalog owners can transfer any object in any catalog in the metastore.
C. Metastore Admin, because metastore admins can transfer ownership and manage privileges across all metastore objects, including shares and recipients.
D. Account Admin, because account admins can only create metastores but cannot change ownership of catalogs.
正解:C
解説: (Pass4Test メンバーにのみ表示されます)
質問 9:
A data engineering team is implementing an append-only data pipeline using Delta Lake, and wants to ensure that data is never modified or deleted once written. Which Delta Lake feature should the data engineer enable to prevent modifications to existing data?
A. Delta OPTIMIZE
B. Delta Time Travel
C. Delta APPEND_ONLY
D. Delta VACUUM
正解:C
解説: (Pass4Test メンバーにのみ表示されます)
837 お客様のコメント
クリック」







诸冈** -
問題集にある問題がたくさん出てきました。Pass4Testの問題集は信頼できます。試験に合格しました。ありがとうございます。