Welcome to ExamTopics
ExamTopics Logo
- Expert Verified, Online, Free.
exam questions

Exam Professional Data Engineer All Questions

View all questions & answers for the Professional Data Engineer exam

Exam Professional Data Engineer topic 1 question 295 discussion

Actual exam question from Google's Professional Data Engineer
Question #: 295
Topic #: 1
[All Professional Data Engineer Questions]

You are designing the architecture to process your data from Cloud Storage to BigQuery by using Dataflow. The network team provided you with the Shared VPC network and subnetwork to be used by your pipelines. You need to enable the deployment of the pipeline on the Shared VPC network. What should you do?

  • A. Assign the compute.networkUser role to the Dataflow service agent.
  • B. Assign the compute.networkUser role to the service account that executes the Dataflow pipeline.
  • C. Assign the dataflow.admin role to the Dataflow service agent.
  • D. Assign the dataflow.admin role to the service account that executes the Dataflow pipeline.
Show Suggested Answer Hide Answer
Suggested Answer: B 🗳️

Comments

Chosen Answer:
This is a voting comment (?) , you can switch to a simple comment.
Switch to a voting comment New
raaad
Highly Voted 10 months, 2 weeks ago
Selected Answer: A
- Dataflow service agent is the one responsible for setting up and managing the network resources that Dataflow requires. - By granting the compute.networkUser role to this service agent, we are enabling it to provision the necessary network resources within the Shared VPC for your Dataflow job.
upvoted 9 times
...
SamuelTsch
Most Recent 3 weeks ago
Selected Answer: A
From https://cloud.google.com/dataflow/docs/guides/specifying-networks, it says "Make sure the Shared VPC subnetwork is shared with the Dataflow service account and has the Compute Network User role assigned on the specified subnet. The Compute Network User role must be assigned to the Dataflow service account in the host project."
upvoted 1 times
ach5
2 weeks, 4 days ago
service account - it's B
upvoted 3 times
...
...
Preetmehta1234
1 month, 4 weeks ago
Selected Answer: B
If you see in the comments, A was answer by people around 8 months ago but recent ones have answered B with the documentation. The GCP documentation evolves with time
upvoted 3 times
...
Preetmehta1234
1 month, 4 weeks ago
Selected Answer: B
service account that executes the Dataflow pipeline It's straight forward
upvoted 2 times
...
Preetmehta1234
1 month, 4 weeks ago
Selected Answer: B
Assign the compute.networkUser role to the service account that executes the Dataflow pipeline
upvoted 2 times
...
Jeyaraj
4 months ago
The correct answer is B. Assign the compute.networkUser role to the service account that executes the Dataflow pipeline. Here's why: Shared VPC and Network Access: When using a Shared VPC, you need to grant specific permissions to service accounts in the service project (where your Dataflow pipeline runs) to access resources in the host project's network. compute.networkUser Role: This role grants the necessary permissions for a service account to use the network resources in the Shared VPC. This includes accessing subnets, creating instances, and communicating with other services within the network. Service Account for Pipeline Execution: The service account that executes your Dataflow pipeline is the one that needs these network permissions. This is because the Dataflow service uses this account to create and manage worker instances within the Shared VPC network.
upvoted 3 times
...
extraego
5 months, 2 weeks ago
Selected Answer: B
Dataflow service agent is a role that is assigned to a service account. So is compute.networkUser. https://cloud.google.com/dataflow/docs/concepts/access-control#example
upvoted 4 times
...
josech
6 months, 1 week ago
Selected Answer: B
Option B https://cloud.google.com/knowledge/kb/dataflow-job-in-shared-vpc-xpn-permissions-000004261
upvoted 4 times
...
chrissamharris
6 months, 3 weeks ago
Selected Answer: B
I believe the answer is B. All authentication documentation points to Service Accounts. https://cloud.google.com/dataflow/docs/concepts/authentication#on-gcp Dataflow service agent typically manages general interactions with the Dataflow service but does not execute the actual jobs.
upvoted 3 times
...
saschak94
9 months, 2 weeks ago
Selected Answer: A
All projects that have used the resource Dataflow Job have a Dataflow Service Account, also known as the Dataflow service agent. Make sure the Shared VPC subnetwork is shared with the Dataflow service account and has the Compute Network User role assigned on the specified subnet.
upvoted 4 times
...
Matt_108
10 months, 2 weeks ago
Selected Answer: A
Option A, I do agree with Raaad, it's the dataflow service agent that needs the networkUser role, because it's the one that provisions the network resources https://cloud.google.com/dataflow/docs/guides/specifying-networks#shared
upvoted 3 times
tibuenoc
9 months, 2 weeks ago
But your link it's explain that "Network User role must be assigned to the Dataflow service account" Make sure the Shared VPC subnetwork is shared with the Dataflow service account and has the Compute Network User role assigned on the specified subnet. The Compute Network User role must be assigned to the Dataflow service account in the host project.
upvoted 1 times
ML6
9 months, 1 week ago
All projects that have used the resource Dataflow Job have a Dataflow Service Account, also known as the Dataflow service agent. Source: https://cloud.google.com/dataflow/docs/concepts/security-and-permissions#df-service-account
upvoted 2 times
...
...
...
task_7
10 months, 2 weeks ago
Selected Answer: B
compute.networkUser to the service account that executes the Dataflow pipeline.
upvoted 4 times
...
BIGQUERY_ALT_ALT
10 months, 2 weeks ago
Selected Answer: B
Option B is Correct. Explanation: You need to give compute networkuser role to service account that is processing the pipeline as it will need to deploy nessesary worker nodes on the shared vpc project. Option A is incorrect as Dataflow Service Agent is Google MGS service account that will not responsible for running or deoplying workers in shared vpc. Option C and D is incorrect as dataflow.admin is elevated privlages to create and manage all of dataflow components not deploying resources in shared vpc.
upvoted 2 times
...
GCP001
10 months, 2 weeks ago
B. Assign the compute.networkUser role to the service account that executes the Dataflow pipeline. See the ref - https://cloud.google.com/dataflow/docs/guides/specifying-networks
upvoted 2 times
raaad
10 months, 2 weeks ago
Option A makes more sense: - Assigning the compute.networkUser role to the pipeline's service account grants it unnecessary and possibly excessive permissions outside its core responsibility of data processing. The question focused specifically on the deployment aspect (i.e., provisioning of network resources like VMs) rather than what the pipeline accesses or processes once it's running.
upvoted 1 times
GCP001
10 months, 1 week ago
Yes , I agree, it should be A. Dataflow service account should be the one having this permission instaed of worker
upvoted 1 times
8284a4c
2 weeks, 6 days ago
The compute.networkUser role needs to be assigned to the specific service account running the Dataflow job, not the Dataflow service agent, as the service agent does not execute the pipeline.
upvoted 1 times
...
...
...
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...