exam questions

Exam Professional Data Engineer All Questions

View all questions & answers for the Professional Data Engineer exam

Exam Professional Data Engineer topic 1 question 309 discussion

Actual exam question from Google's Professional Data Engineer
Question #: 309
Topic #: 1
[All Professional Data Engineer Questions]

You work for an airline and you need to store weather data in a BigQuery table. Weather data will be used as input to a machine learning model. The model only uses the last 30 days of weather data. You want to avoid storing unnecessary data and minimize costs. What should you do?

  • A. Create a BigQuery table where each record has an ingestion timestamp. Run a scheduled query to delete all the rows with an ingestion timestamp older than 30 days.
  • B. Create a BigQuery table partitioned by datetime value of the weather date. Set up partition expiration to 30 days.
  • C. Create a BigQuery table partitioned by ingestion time. Set up partition expiration to 30 days.
  • D. Create a BigQuery table with a datetime column for the day the weather data refers to. Run a scheduled query to delete rows with a datetime value older than 30 days.
Show Suggested Answer Hide Answer
Suggested Answer: B 🗳️

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
AllenChen123
Highly Voted 1 year ago
Selected Answer: B
Partitioned based on weather date, with partition expiration set
upvoted 6 times
...
iooj
Highly Voted 6 months ago
Selected Answer: B
got this one on the exam, aug 2024, passed
upvoted 6 times
...
juliorevk
Most Recent 1 week, 4 days ago
Selected Answer: B
B BQ partitioning with partition expiration of 30 days allows you to only filter for the last 30 days and delete days that are beyond 30 days.
upvoted 1 times
...
d11379b
10 months, 2 weeks ago
https://cloud.google.com/bigquery/docs/partitioned-tables Here it mentions “ For TIMESTAMP and DATETIME columns, the partitions can have either hourly, daily, monthly, or yearly granularity.l So you should not calculate the amount of partitions on second granularity
upvoted 3 times
...
chambg
11 months ago
Selected Answer: D
Skeptical about Option B as maximum partitions in a BQ table is 4000.Since Datetime value is a timestamp it will have more than 4000 values in a duration for 30 days (30*24*60*60 = 259,200 ). So Option D is right imo
upvoted 1 times
ce9e395
10 months, 4 weeks ago
This is a good point
upvoted 1 times
joao_01
9 months, 3 weeks ago
It's not a good point. The granularity goes to DAYs, not SECONDs. So, the right answer is B.
upvoted 6 times
...
...
...
JyoGCP
11 months, 2 weeks ago
Selected Answer: B
Option B
upvoted 1 times
...
Sofiia98
1 year ago
Selected Answer: B
We need the last 30 days, we don't care about ingestion time
upvoted 4 times
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...
exam
Someone Bought Contributor Access for:
SY0-701
London, 1 minute ago