Welcome to ExamTopics
ExamTopics Logo
- Expert Verified, Online, Free.
exam questions

Exam Professional Data Engineer All Questions

View all questions & answers for the Professional Data Engineer exam

Exam Professional Data Engineer topic 1 question 309 discussion

Actual exam question from Google's Professional Data Engineer
Question #: 309
Topic #: 1
[All Professional Data Engineer Questions]

You work for an airline and you need to store weather data in a BigQuery table. Weather data will be used as input to a machine learning model. The model only uses the last 30 days of weather data. You want to avoid storing unnecessary data and minimize costs. What should you do?

  • A. Create a BigQuery table where each record has an ingestion timestamp. Run a scheduled query to delete all the rows with an ingestion timestamp older than 30 days.
  • B. Create a BigQuery table partitioned by datetime value of the weather date. Set up partition expiration to 30 days.
  • C. Create a BigQuery table partitioned by ingestion time. Set up partition expiration to 30 days.
  • D. Create a BigQuery table with a datetime column for the day the weather data refers to. Run a scheduled query to delete rows with a datetime value older than 30 days.
Show Suggested Answer Hide Answer
Suggested Answer: B 🗳️

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
iooj
3 months, 3 weeks ago
Selected Answer: B
got this one on the exam, aug 2024, passed
upvoted 3 times
...
d11379b
8 months ago
https://cloud.google.com/bigquery/docs/partitioned-tables Here it mentions “ For TIMESTAMP and DATETIME columns, the partitions can have either hourly, daily, monthly, or yearly granularity.l So you should not calculate the amount of partitions on second granularity
upvoted 3 times
...
chambg
8 months, 3 weeks ago
Selected Answer: D
Skeptical about Option B as maximum partitions in a BQ table is 4000.Since Datetime value is a timestamp it will have more than 4000 values in a duration for 30 days (30*24*60*60 = 259,200 ). So Option D is right imo
upvoted 1 times
ce9e395
8 months, 2 weeks ago
This is a good point
upvoted 1 times
joao_01
7 months, 2 weeks ago
It's not a good point. The granularity goes to DAYs, not SECONDs. So, the right answer is B.
upvoted 4 times
...
...
...
JyoGCP
9 months, 1 week ago
Selected Answer: B
Option B
upvoted 1 times
...
Sofiia98
9 months, 4 weeks ago
Selected Answer: B
We need the last 30 days, we don't care about ingestion time
upvoted 3 times
...
AllenChen123
10 months ago
Selected Answer: B
Partitioned based on weather date, with partition expiration set
upvoted 4 times
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...