exam questions

Exam Professional Data Engineer All Questions

View all questions & answers for the Professional Data Engineer exam

Exam Professional Data Engineer topic 1 question 306 discussion

Actual exam question from Google's Professional Data Engineer
Question #: 306
Topic #: 1
[All Professional Data Engineer Questions]

You are preparing an organization-wide dataset. You need to preprocess customer data stored in a restricted bucket in Cloud Storage. The data will be used to create consumer analyses. You need to comply with data privacy requirements.

What should you do?

  • A. Use Dataflow and the Cloud Data Loss Prevention API to mask sensitive data. Write the processed data in BigQuery.
  • B. Use customer-managed encryption keys (CMEK) to directly encrypt the data in Cloud Storage. Use federated queries from BigQuery. Share the encryption key by following the principle of least privilege.
  • C. Use the Cloud Data Loss Prevention API and Dataflow to detect and remove sensitive fields from the data in Cloud Storage. Write the filtered data in BigQuery.
  • D. Use Dataflow and Cloud KMS to encrypt sensitive fields and write the encrypted data in BigQuery. Share the encryption key by following the principle of least privilege.
Show Suggested Answer Hide Answer
Suggested Answer: A 🗳️

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
raaad
Highly Voted 9 months, 2 weeks ago
Selected Answer: A
- Prioritizes Data Privacy: It protects sensitive information by masking it, reducing the risk of exposure in case of unauthorized access or accidental leaks. - Reduces Data Sensitivity: Masking renders sensitive data unusable for attackers, even if they gain access to it. - Preserves Data Utility: Masked data can still be used for consumer analyses, as patterns and relationships are often preserved, allowing meaningful insights to be derived.
upvoted 13 times
dungct
8 months, 1 week ago
why not d ?
upvoted 2 times
ML6
8 months ago
Data in Cloud Storage is encrypted by default.
upvoted 2 times
...
...
...
desertlotus1211
Most Recent 2 weeks, 6 days ago
Selected Answer: C
If I had to choose... I choose C or A... A can still leave partial sensitive data available.
upvoted 1 times
desertlotus1211
2 weeks, 6 days ago
For data privacy, removing data through a DLP (Data Loss Prevention) system is generally considered better than masking, as it permanently eliminates sensitive information, whereas masking only conceals it, potentially leaving traces or vulnerabilities
upvoted 1 times
...
...
AlizCert
4 months, 2 weeks ago
Selected Answer: A
What made me decide on A instead of C was the "The data will be used to create consumer analyses" sentence. Having all the PIIs completely redacted from the records, we were unable to distinguish between the individual customers.
upvoted 3 times
...
Matt_108
9 months, 1 week ago
Selected Answer: A
Option A, agree with raaad explanation
upvoted 1 times
...
scaenruy
9 months, 2 weeks ago
Selected Answer: A
A. Use Dataflow and the Cloud Data Loss Prevention API to mask sensitive data. Write the processed data in BigQuery.
upvoted 2 times
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...
exam
Someone Bought Contributor Access for:
SY0-701
London, 1 minute ago