You are preparing data that your machine learning team will use to train a model using BigQueryML. They want to predict the price per square foot of real estate. The training data has a column for the price and a column for the number of square feet. Another feature column called
'feature1' contains null values due to missing data. You want to replace the nulls with zeros to keep more data points. Which query should you use?
A.

B.

C.

D.

正解:B
質問 2:
You are planning to load some of your existing on-premises data into BigQuery on Google Cloud.
You want to either stream or batch-load data, depending on your use case. Additionally, you want to mask some sensitive data before loading into BigQuery. You need to do this in a programmatic way while keeping costs to a minimum. What should you do?
A. Use the BigQuery Data Transfer Service to schedule your migration. After the data is populated in BigQuery, use the connection to the Cloud Data Loss Prevention (Cloud DLP) API to de-identify the necessary data.
B. Set up Datastream to replicate your on-premise data on BigQuery.
C. Use Cloud Data Fusion to design your pipeline, use the Cloud DLP plug-in to de-identify data within your pipeline, and then move the data into BigQuery.
D. Create your pipeline with Dataflow through the Apache Beam SDK for Python, customizing separate options within your code for streaming, batch processing, and Cloud DLP. Select BigQuery as your data sink.
正解:D
質問 3:
Cloud Dataproc is a managed Apache Hadoop and Apache _____ service.
A. Ignite
B. Fire
C. Blaze
D. Spark
正解:D
解説: (Pass4Test メンバーにのみ表示されます)
質問 4:
Your company is selecting a system to centralize data ingestion and delivery. You are considering messaging and data integration systems to address the requirements. The key requirements are:
- The ability to seek to a particular offset in a topic, possibly back
to the start of all data ever captured
- Support for publish/subscribe semantics on hundreds of topics
- Retain per-key ordering
Which system should you choose?
A. Dataflow
B. Firebase Cloud Messaging
C. Cloud Storage
D. Apache Kafka
正解:D
解説: (Pass4Test メンバーにのみ表示されます)
質問 5:
Which of the following is not possible using primitive roles?
A. Give a user access to view all datasets in a project, but not run queries on them.
B. Give UserA owner access and UserB editor access for all datasets in a project.
C. Give GroupA owner access and GroupB editor access for all datasets in a project.
D. Give a user viewer access to BigQuery and owner access to Google Compute Engine instances.
正解:A
解説: (Pass4Test メンバーにのみ表示されます)
質問 6:
You need to connect multiple applications with dynamic public IP addresses to a Cloud SQL instance. You configured users with strong passwords and enforced the SSL connection to your Cloud SQL instance. You want to use Cloud SQL public IP and ensure that you have secured connections. What should you do?
A. Leave the Authorized Network empty. Use Cloud SQL Auth proxy on all applications.
B. Add CIDR 0.0.0.0/0 network to Authorized Network. Use Cloud SQL Auth proxy on all applications.
C. Add CIDR 0.0.0.0/0 network to Authorized Network. Use Identity and Access Management (IAM) to add users.
D. Add all application networks to Authorized Network and regularly update them.
正解:A
質問 7:
You want to use a BigQuery table as a data sink. In which writing mode(s) can you use BigQuery as a sink?
A. Only streaming
B. BigQuery cannot be used as a sink
C. Only batch
D. Both batch and streaming
正解:D
解説: (Pass4Test メンバーにのみ表示されます)
1160 お客様のコメント
クリック」





Suho -
要点をしっかり抑えながら学ぶことができます。より効率良く合格を目指す私のための,必携のProfessional-Data-Engineer試験対策書だと思う