exam questions

Exam Professional Machine Learning Engineer All Questions

View all questions & answers for the Professional Machine Learning Engineer exam

Exam Professional Machine Learning Engineer topic 1 question 15 discussion

Actual exam question from Google's Professional Machine Learning Engineer
Question #: 15
Topic #: 1
[All Professional Machine Learning Engineer Questions]

You have been asked to develop an input pipeline for an ML training model that processes images from disparate sources at a low latency. You discover that your input data does not fit in memory. How should you create a dataset following Google-recommended best practices?

  • A. Create a tf.data.Dataset.prefetch transformation.
  • B. Convert the images to tf.Tensor objects, and then run Dataset.from_tensor_slices().
  • C. Convert the images to tf.Tensor objects, and then run tf.data.Dataset.from_tensors().
  • D. Convert the images into TFRecords, store the images in Cloud Storage, and then use the tf.data API to read the images for training.
Show Suggested Answer Hide Answer
Suggested Answer: D 🗳️

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
chohan
Highly Voted 3 years, 4 months ago
Should be D
upvoted 21 times
...
alphard
Highly Voted 2 years, 10 months ago
My option is D. Cite from Google Pag: to construct a Dataset from data in memory, use tf.data.Dataset.from_tensors() or tf.data.Dataset.from_tensor_slices(). When input data is stored in a file (not in memory), the recommended TFRecord format, you can use tf.data.TFRecordDataset(). tf.data.Dataset is for data in memory. tf.data.TFRecordDataset is for data in non-memory storage.
upvoted 16 times
...
RyanTan
Most Recent 1 month, 1 week ago
Selected Answer: A
this is one of the review questions in Chapter 3 in the book "Official Google Cloud Certified Professional Machine Learning Engineer Study Guide". tf.data.Dataset.prefetch transformation decouples the time when data is produced to the time when data is consumed so it can reduce the latency. Also the transformation can reduce the memory usage. By the way, tf.data.Dataset.interleave transformation can also used to reduce the latency and memory usage.
upvoted 1 times
...
PhilipKoku
4 months, 1 week ago
Selected Answer: D
D) Storing images in TFRecords optimises storage for images.
upvoted 2 times
...
pinimichele01
6 months, 1 week ago
Selected Answer: D
tf.data.Dataset is for data in memory. tf.data.TFRecordDataset is for data in non-memory storage.
upvoted 2 times
...
samratashok
7 months, 2 weeks ago
Selected Answer: D
why this website shows wrong option as answer, this is my observation from so many questions?
upvoted 3 times
...
fragkris
10 months, 3 weeks ago
Selected Answer: D
D is correct
upvoted 1 times
...
Sum_Sum
11 months, 1 week ago
Selected Answer: D
D because: tf.data.Dataset is for data in memory. tf.data.TFRecordDataset is for data in non-memory storage.
upvoted 2 times
...
boobyg1
11 months, 3 weeks ago
Selected Answer: D
all "correct" answers are wrong
upvoted 2 times
...
M25
1 year, 5 months ago
Selected Answer: D
Went with D
upvoted 1 times
...
India_willsmith
1 year, 6 months ago
For all questions the given answers and voted answers are different. Which one should be considered for exam?
upvoted 2 times
Alfredo_OSS
1 year, 6 months ago
You should consider the voted ones.
upvoted 2 times
...
...
enghabeth
1 year, 8 months ago
Selected Answer: D
Converting your data into TFRecord has many advantages, such as: More efficient storage: the TFRecord data can take up less space than the original data; it can also be partitioned into multiple files. Fast I/O: the TFRecord format can be read with parallel I/O operations, which is useful for TPUs or multiple hosts
upvoted 1 times
...
enghabeth
1 year, 8 months ago
Selected Answer: D
my option is D
upvoted 1 times
...
Omi_04040
1 year, 9 months ago
Ans: D
upvoted 1 times
...
wish0035
1 year, 10 months ago
Selected Answer: D
ans: D
upvoted 1 times
...
EFIGO
1 year, 10 months ago
Selected Answer: D
For data in memory use tf.data.Dataset, for data in non-memory storage use tf.data.TFRecordDataset. Since data don't fit in memory, go with option D.
upvoted 1 times
...
GCP72
2 years, 2 months ago
Selected Answer: D
Correct answer is "D"
upvoted 1 times
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...
exam
Someone Bought Contributor Access for:
SY0-701
London, 1 minute ago