Welcome to ExamTopics
ExamTopics Logo
- Expert Verified, Online, Free.
exam questions

Exam Professional Data Engineer All Questions

View all questions & answers for the Professional Data Engineer exam

Exam Professional Data Engineer topic 1 question 103 discussion

Actual exam question from Google's Professional Data Engineer
Question #: 103
Topic #: 1
[All Professional Data Engineer Questions]

You have a data stored in BigQuery. The data in the BigQuery dataset must be highly available. You need to define a storage, backup, and recovery strategy of this data that minimizes cost. How should you configure the BigQuery table that have a recovery point objective (RPO) of 30 days?

  • A. Set the BigQuery dataset to be regional. In the event of an emergency, use a point-in-time snapshot to recover the data.
  • B. Set the BigQuery dataset to be regional. Create a scheduled query to make copies of the data to tables suffixed with the time of the backup. In the event of an emergency, use the backup copy of the table.
  • C. Set the BigQuery dataset to be multi-regional. In the event of an emergency, use a point-in-time snapshot to recover the data.
  • D. Set the BigQuery dataset to be multi-regional. Create a scheduled query to make copies of the data to tables suffixed with the time of the backup. In the event of an emergency, use the backup copy of the table.
Show Suggested Answer Hide Answer
Suggested Answer: C 🗳️

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
DeepakVenkatachalam
Highly Voted 1 year, 2 months ago
Answer is B. Timetravel only covers for 7 days and a scheduled query is needed for creating Table snapshots for 30 days. Also table snapshot must remain in the same region as base table (please refer to limitation of table snapshot from below link) https://cloud.google.com/bigquery/docs/table-snapshots-intro
upvoted 9 times
...
desertlotus1211
Highly Voted 1 year, 10 months ago
Answer is C: https://cloud.google.com/bigquery/docs/table-snapshots-intro "Benefits of using table snapshots include the following: Keep a record for longer than seven days. With BigQuery time travel, you can only access a table's data from seven days ago or more recently. With table snapshots, you can preserve a table's data from a specified point in time for as long as you want. Minimize storage cost. BigQuery only stores bytes that are different between a snapshot and its base table, so a table snapshot typically uses less storage than a full copy of the table." But the wording is foolish... It's table snapshot, NOT point in time snapshot! https://cloud.google.com/bigquery/docs/time-travel#restore-a-table this is point in time using time travel window - max is 7 days...
upvoted 5 times
...
Erg_de
Most Recent 3 weeks, 1 day ago
Selected Answer: B
Best choice, minimized cost
upvoted 1 times
...
Gcpteamprep
1 month ago
Selected Answer: B
Minimized Cost with Regional Storage: Regional datasets are less costly than multi-regional datasets in BigQuery. Since there is no requirement here for multi-regional availability, regional storage meets the high availability need while keeping costs lower. RPO Compliance with Scheduled Backups: A scheduled query that periodically creates copies of the data (e.g., daily or weekly, depending on the requirements) allows for recovery within the 30-day RPO, meeting the requirement for data retention and recovery. Point-in-Time Recovery Not Native in BigQuery: Although BigQuery provides a limited "table snapshot" feature, it’s not a true point-in-time recovery option for the last 30 days. Creating periodic backups through scheduled queries gives you control over retention, enabling you to keep backups for 30 days and reducing dependency on more costly or limited snapshot capabilities.
upvoted 2 times
...
Vogangster
1 month, 2 weeks ago
D.Create monthly snapshots of a table by using a service account that runs a scheduled query. Link: https://cloud.google.com/bigquery/docs/table-snapshots-scheduled
upvoted 1 times
...
AlizCert
5 months, 3 weeks ago
Selected Answer: D
HA => multi-region 30-days RPO => manual backups as max time-travel is 7 days
upvoted 3 times
...
Lestrang
6 months, 1 week ago
This is in one of google's training practice questions and the answer for it is C.
upvoted 3 times
NickNtaken
6 months ago
Agreed. Multi-regional datasets offer higher availability by replicating data across multiple regions
upvoted 1 times
...
...
rocky48
11 months, 3 weeks ago
Selected Answer: A
ou should consider option A. Setting the BigQuery dataset to be regional and using a point-in-time snapshot to recover the data in the event of an emergency can help you achieve the desired level of availability and minimize cost. This approach can help you avoid the additional cost of creating and maintaining backup copies of the data, which can be expensive. Setting the BigQuery dataset to be multi-regional (options C and D) can provide additional redundancy and availability. However, this approach can be more expensive than setting the dataset to be regional, especially if you do not require the additional level of redundancy.
upvoted 3 times
...
Nirca
1 year, 1 month ago
Selected Answer: A
I'm going for A: 1. Set the BigQuery dataset to be regional. This will reduce the cost of storage compared to a multi-regional dataset. 2. building Snapshot: bq snapshot --dataset <dataset_id> --table <table_id> <snapshot_id>
upvoted 3 times
ffggrre
1 year, 1 month ago
typically Multi-region cost is equal or less than a region. https://cloud.google.com/bigquery/pricing#storage
upvoted 1 times
...
...
ckanaar
1 year, 2 months ago
I think the answer is A: This option meets the 30-day RPO requirement, assuming that the snapshot is maintained for that long. It offers high availability as data is written synchronously to 2 zones within a region: https://cloud.google.com/blog/topics/developers-practitioners/backup-disaster-recovery-strategies-bigquery/. The cost would be lower than maintaining a multi-regional dataset, but you'll incur the cost of the snapshot.
upvoted 4 times
...
lucaluca1982
1 year, 8 months ago
Why not B? Setting dataset regional or multi does not affect the backup and recovery strategy.
upvoted 3 times
...
midgoo
1 year, 8 months ago
Selected Answer: C
1. HA -> Multi-region 2. DR -> Snapshot
upvoted 4 times
...
kostol
1 year, 9 months ago
Selected Answer: D
https://cloud.google.com/bigquery/docs/table-snapshots-scheduled
upvoted 2 times
...
YatMoh
1 year, 11 months ago
One of the criteria is to minimize cost. While C is an ideal solution but more expensive compared to D. In my opinion the correct answer should be D.
upvoted 2 times
jkhong
1 year, 11 months ago
How is using snapshots more expensive, snapshots store the delta of the state of the table during time of taking snapshot and the current state of the table (https://cloud.google.com/bigquery/docs/table-snapshots-intro#storage_costs) When a snapshot is created, there is literally zero cost... until changes are made to the base table, only then will the delta storage cost be charged.. This certainly seems cheaper than creating a new table
upvoted 2 times
...
...
raghu06raj
1 year, 11 months ago
Even when the dataset/table is regional, google provides high availability with data available in 2 zones. Isn't A correct answer?
upvoted 3 times
...
zellck
1 year, 11 months ago
Selected Answer: C
C is the answer. https://cloud.google.com/bigquery/docs/table-snapshots-intro#table_snapshots A BigQuery table snapshot preserves the contents of a table (called the base table) at a particular time. You can save a snapshot of a current table, or create a snapshot of a table as it was at any time in the past seven days. A table snapshot can have an expiration; when the configured amount of time has passed since the table snapshot was created, BigQuery deletes the table snapshot. You can query a table snapshot as you would a standard table. Table snapshots are read-only, but you can create (restore) a standard table from a table snapshot, and then you can modify the restored table.
upvoted 4 times
...
hauhau
1 year, 12 months ago
Selected Answer: D
point in time snapshots only have data from past 7 days
upvoted 3 times
jkhong
1 year, 11 months ago
We can keep PIT snapshots for as long as we want. I think you confused snapshots with time travel... "With BigQuery time travel, you can only access a table's data from seven days ago or more recently. With table snapshots, you can preserve a table's data from a specified point in time for as long as you want." https://cloud.google.com/bigquery/docs/table-snapshots-intro#table_snapshots
upvoted 1 times
...
desertlotus1211
1 year, 10 months ago
https://cloud.google.com/bigquery/docs/time-travel#configure_the_time_travel_window 'You can set the duration of the time travel window, from a minimum of two days to a maximum of seven days' Not the same as snapshot
upvoted 1 times
...
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...