exam questions

Exam Professional Machine Learning Engineer All Questions

View all questions & answers for the Professional Machine Learning Engineer exam

Exam Professional Machine Learning Engineer topic 1 question 300 discussion

Actual exam question from Google's Professional Machine Learning Engineer
Question #: 300
Topic #: 1
[All Professional Machine Learning Engineer Questions]

You work for a bank. You need to train a model by using unstructured data stored in Cloud Storage that predicts whether credit card transactions are fraudulent. The data needs to be converted to a structured format to facilitate analysis in BigQuery. Company policy requires that data containing personally identifiable information (PII) remain in Cloud Storage. You need to implement a scalable solution that preserves the data’s value for analysis. What should you do?

  • A. Use BigQuery’s authorized views and column-level access controls to restrict access to PII within the dataset.
  • B. Use the DLP API to de-identify the sensitive data before loading it into BigQuery.
  • C. Store the unstructured data in a separate PII-compliant BigQuery database.
  • D. Remove the sensitive data from the files manually before loading them into BigQuery.
Show Suggested Answer Hide Answer
Suggested Answer: B 🗳️

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
tk786786
1 day, 21 hours ago
Selected Answer: B
B. Use the DLP API to de-identify the sensitive data before loading it into BigQuery. Why Option B? Ensures Compliance with Company Policy Company policy requires PII to remain in Cloud Storage. Google Cloud Data Loss Prevention (DLP) API can de-identify (mask, tokenize, or redact) PII while preserving its analytical value. Only de-identified structured data is moved to BigQuery, ensuring compliance. Preserves Data Utility for Analysis DLP API supports format-preserving encryption (FPE) and tokenization, allowing analysis without exposing sensitive details. Fraud detection models can still leverage de-identified transaction patterns without accessing raw PII. Scalable and Automated Solution DLP API can be used in a Dataflow pipeline to process large amounts of unstructured data before ingestion. Avoids manual effort (as required in Option D) and provides consistent security measures.
upvoted 1 times
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...
exam
Someone Bought Contributor Access for:
SY0-701
London, 1 minute ago