exam questions

Exam Professional Cloud Architect All Questions

View all questions & answers for the Professional Cloud Architect exam

Exam Professional Cloud Architect topic 1 question 109 discussion

Actual exam question from Google's Professional Cloud Architect
Question #: 109
Topic #: 1
[All Professional Cloud Architect Questions]

You are working at a sports association whose members range in age from 8 to 30. The association collects a large amount of health data, such as sustained injuries. You are storing this data in BigQuery. Current legislation requires you to delete such information upon request of the subject. You want to design a solution that can accommodate such a request. What should you do?

  • A. Use a unique identifier for each individual. Upon a deletion request, delete all rows from BigQuery with this identifier.
  • B. When ingesting new data in BigQuery, run the data through the Data Loss Prevention (DLP) API to identify any personal information. As part of the DLP scan, save the result to Data Catalog. Upon a deletion request, query Data Catalog to find the column with personal information.
  • C. Create a BigQuery view over the table that contains all data. Upon a deletion request, exclude the rows that affect the subject's data from this view. Use this view instead of the source table for all analysis tasks.
  • D. Use a unique identifier for each individual. Upon a deletion request, overwrite the column with the unique identifier with a salted SHA256 of its value.
Show Suggested Answer Hide Answer
Suggested Answer: A 🗳️

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
milan74
Highly Voted 3 years, 8 months ago
According to me, the question states "The association collects a large amount of health data, such as sustained injuries." and the nuance on the word such => " Current legislation requires you to delete "SUCH" information upon request of the subject. " So from that point of view the question is not to delete the entire user records but specific data related to personal health data. With DLP you can use InfoTypes and InfoType detectors to specifically scan for those entries and how to act upon them (link https://cloud.google.com/dlp/docs/concepts-infotypes) I would say B.
upvoted 85 times
AmitAr
2 years, 10 months ago
(A) - Primary task is "legislation requires you to delete" .. and B is not deleting. only A is deleting
upvoted 11 times
BeCalm
2 years ago
Deletion is implied in "Upon a deletion request, query Data Catalog to find the column with personal information."
upvoted 1 times
...
...
zanfo
3 years ago
I want to delete all the informations about the user, not only those individuate by DLP. ALL THE INFORMATIONS of the users...B is not correct! the correct is A
upvoted 8 times
...
Ishu_awsguy
2 years, 7 months ago
There is no need of DLP. All the data is sensitive but only upon user request it needs deletion. So A should be the correct answer.
upvoted 12 times
...
Arad
3 years, 4 months ago
as PhilipKoku mentioned below: A) is the correct answer. B) is only masking the data and then when a request is received, it identified the record but it doesn’t delete it. D) Is masking the ID.
upvoted 12 times
mgm7
3 years, 3 months ago
B is not masking the data but identifying where it is to take action on at later date if required
upvoted 6 times
...
...
...
XDevX
Highly Voted 3 years, 9 months ago
IMHO a) is the correct answer because it is easier to operate. The question is not how to mask data and so on but just to delete data on request, so I don't think that we have to use for just the deletion of specific data DLP.
upvoted 36 times
...
halifax
Most Recent 3 weeks, 6 days ago
Selected Answer: B
Option A doesn't Delete all instances of PII, what happens if there is personal information on the descriptions? such as " sustained injuries". Delete just the identifiers is not complete.
upvoted 1 times
...
david_tay
1 month ago
Selected Answer: A
B is pointless as any health data is sensitive data, and also the fact that it did not mention deletion. Hence A is the correct answer.
upvoted 1 times
...
plumbig11
2 months, 3 weeks ago
Selected Answer: A
Use a unique identifier for each individual. Upon a deletion request, delete all rows from BigQuery with this identifier.
upvoted 1 times
...
rrope
3 months ago
Selected Answer: A
A. Use a unique identifier for each individual.
upvoted 1 times
...
andyk87
3 months, 2 weeks ago
Selected Answer: B
Option B is better when the requirement is to delete only the PII health data, not all data related to the individual.
upvoted 1 times
...
Sephethus
9 months, 2 weeks ago
It had better be A, if not then you're not a good organization
upvoted 2 times
...
hitmax87
10 months, 2 weeks ago
Selected Answer: B
Data Loss Prevention must have!
upvoted 1 times
...
Gino17m
10 months, 4 weeks ago
Selected Answer: B
I vote for B. I had some doubts whether A was correct, but: - I'm not convinced by the argument "only A talks about deleting" (it would be too easy if it was about choosing an answer containing the word "delete" ;) - the question says "design a solution that can accommodate such a request" - I'm not very fluent in english, but "accommodate" imho means more "facilitate" than "accomplish" here - I think that the task is about deleting health data not everything related with unique identifier - Data Catalog allows you to manage data, knowing in which datasets and in which tables what data is stored. Answer "A" somehow imposes the data model - each table with data related to a given individual must contain the ID of this individual (in a real data model this does not have to be the case).
upvoted 3 times
...
Djenko
1 year ago
Selected Answer: A
Should be A)
upvoted 2 times
...
mesodan
1 year ago
Selected Answer: A
A is correct. As for option B: While DLP is valuable for identifying sensitive data, it might not be sufficient for this specific case. DLP cannot necessarily determine an individual's right to deletion based solely on data classification. Additionally, relying on Data Catalog to store the results adds unnecessary complexity and potential inconsistencies.
upvoted 3 times
...
Gall
1 year, 1 month ago
Selected Answer: B
B. The A removes all data, not SUCH only.
upvoted 2 times
...
NoCrapEva
1 year, 2 months ago
Selected Answer: A
Ans. B assumes you will delete the Personal Information found in the Catalog... Some people are reading GDPR into this question (we are not told what country and what legislation). The question states you must delete all information (not just personal informarion) on request. Ans B is a red herring ! Answer must = A
upvoted 2 times
...
Roro_Brother
1 year, 3 months ago
Selected Answer: A
(A) - Primary task is "legislation requires you to delete" .. and B is not deleting. only A is deleting
upvoted 5 times
...
Jconnor
1 year, 3 months ago
Well, A would delete all rows with the identifier, I guess including the ones that are not confidential, also what does it mean unique identifier? each user is unique already. Ridiculous. B would identify the columns that contain personal data, but B is prone to errors as changes in legislation of what is consider injury would be excluded and all data would need to be re- ingested. Unfortunately B is closer and less damaging than A.
upvoted 1 times
...
thewalker
1 year, 4 months ago
Selected Answer: B
Either A or B is the answer. A - will delete all the info about the subject, which is not the intension. Only the sensitive data to be deleted. Hence, B.
upvoted 2 times
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...
exam
Someone Bought Contributor Access for:
SY0-701
London, 1 minute ago