exam questions

Exam Professional Cloud Architect All Questions

View all questions & answers for the Professional Cloud Architect exam

Exam Professional Cloud Architect topic 1 question 174 discussion

Actual exam question from Google's Professional Cloud Architect
Question #: 174
Topic #: 1
[All Professional Cloud Architect Questions]

You are working with a data warehousing team that performs data analysis. The team needs to process data from external partners, but the data contains personally identifiable information (PII). You need to process and store the data without storing any of the PIIE data. What should you do?

  • A. Create a Dataflow pipeline to retrieve the data from the external sources. As part of the pipeline, use the Cloud Data Loss Prevention (Cloud DLP) API to remove any PII data. Store the result in BigQuery.
  • B. Create a Dataflow pipeline to retrieve the data from the external sources. As part of the pipeline, store all non-PII data in BigQuery and store all PII data in a Cloud Storage bucket that has a retention policy set.
  • C. Ask the external partners to upload all data on Cloud Storage. Configure Bucket Lock for the bucket. Create a Dataflow pipeline to read the data from the bucket. As part of the pipeline, use the Cloud Data Loss Prevention (Cloud DLP) API to remove any PII data. Store the result in BigQuery.
  • D. Ask the external partners to import all data in your BigQuery dataset. Create a dataflow pipeline to copy the data into a new table. As part of the Dataflow bucket, skip all data in columns that have PII data
Show Suggested Answer Hide Answer
Suggested Answer: A 🗳️

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
StelSen
Highly Voted 2 years, 6 months ago
Option-A is correct. Although Option-C sounds good, ultimately we should not store PI data at all as per question says.
upvoted 50 times
...
edilramos
Highly Voted 2 years, 6 months ago
Selected Answer: A
The correct answer is A. Option C seems to be an option, but there are two non-conformities there. In addition to storing personal data in the GCS, it is being improperly retained.
upvoted 14 times
...
dija123
Most Recent 2 months, 3 weeks ago
Selected Answer: A
Agree with A
upvoted 1 times
...
ptapia_el
3 months, 3 weeks ago
Selected Answer: A
This best option.
upvoted 1 times
...
tamj123
8 months, 2 weeks ago
A is best answer, C seems be an extract step and security risk to upload to a bucket first
upvoted 1 times
...
BiddlyBdoyng
1 year ago
The problem with C is the data is stored in the bucket with the PII data even though the BigQuery data has it removed?
upvoted 3 times
johny_doe
10 months ago
exactly
upvoted 1 times
...
...
AugustoKras011111
1 year, 4 months ago
Selected Answer: A
option A, the question say dont store data...
upvoted 1 times
...
someCloudUser
1 year, 4 months ago
Selected Answer: A
A is correct.
upvoted 1 times
...
telp
1 year, 4 months ago
Selected Answer: A
Answer A. The question say do not store PII data so need to remove it before storing.
upvoted 1 times
...
rotorclear
1 year, 4 months ago
Selected Answer: A
Answer should be A because the question emphasises on processing the data without storing it. That rules out C.
upvoted 1 times
...
RVivek
1 year, 4 months ago
Selected Answer: A
C -- is wrong because PII data is uploaded and the bucket is locked which means the data cannot be deleted B and D are wron as they do not use Data loss prevention to protect data
upvoted 2 times
...
dataqueen_3110
1 year, 4 months ago
PII --> Cloud DLP. So that narrows the choices down to A or C. C says "Ask the external partners to upload all data on Cloud Storage" which is not generally a feasible or recommended practice. Also, we cannot store PII anywhere, including in GCS. Answer is A.
upvoted 1 times
...
Wael216
1 year, 6 months ago
Selected Answer: A
A i s correct, C sounds good but storing the data in GCS is already a violation of the PII requirements
upvoted 1 times
...
omermahgoub
1 year, 6 months ago
I would recommend option A, creating a Dataflow pipeline to retrieve the data from the external sources and using the Cloud Data Loss Prevention (Cloud DLP) API to remove any PII data. Storing the result in BigQuery would allow the data warehousing team to easily perform analysis on the data. Option C, using Bucket Lock to protect the data and using the Cloud DLP API to remove PII data, would protect the data from unauthorized access, but would not allow the data warehousing team to easily perform analysis on the data.
upvoted 5 times
omermahgoub
1 year, 6 months ago
Option B, storing non-PII data in BigQuery and PII data in a Cloud Storage bucket with a retention policy set, would not fully protect the PII data and could potentially lead to data breaches. Option D, copying the data into a new table and skipping columns with PII data, would not fully protect the PII data and could potentially lead to data breaches. It would also require the data warehousing team to manually skip certain columns when performing analysis, which could be time-consuming and error-prone.
upvoted 2 times
...
...
surajkrishnamurthy
1 year, 6 months ago
Selected Answer: A
A Is the Correct Answer
upvoted 1 times
...
ardit
1 year, 6 months ago
Selected Answer: A
A is the right one.
upvoted 1 times
...
jaxclain
1 year, 7 months ago
Selected Answer: A
Of course the correct answer is A, not sure how some people think C is valid, probably trolling trying to confuse some here.
upvoted 2 times
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...
exam
Someone Bought Contributor Access for:
SY0-701
London, 1 minute ago