Exam Professional Machine Learning Engineer All Questions

View all questions & answers for the Professional Machine Learning Engineer exam

Exam Professional Machine Learning Engineer topic 1 question 15 discussion

Actual exam question from Google's Professional Machine Learning Engineer

Question #: 15
Topic #: 1

[All Professional Machine Learning Engineer Questions]

You have been asked to develop an input pipeline for an ML training model that processes images from disparate sources at a low latency. You discover that your input data does not fit in memory. How should you create a dataset following Google-recommended best practices?

A. Create a tf.data.Dataset.prefetch transformation.
B. Convert the images to tf.Tensor objects, and then run Dataset.from_tensor_slices().
C. Convert the images to tf.Tensor objects, and then run tf.data.Dataset.from_tensors().
D. Convert the images into TFRecords, store the images in Cloud Storage, and then use the tf.data API to read the images for training.

Show Suggested Answer

Suggested Answer: D 🗳️

by inder0007 at June 5, 2021, 9:01 p.m.

Comments

Submit Cancel

chohan

Highly Voted 3 years, 4 months ago

Should be D

upvoted 21 times

...

alphard

Highly Voted 2 years, 10 months ago

My option is D. Cite from Google Pag: to construct a Dataset from data in memory, use tf.data.Dataset.from_tensors() or tf.data.Dataset.from_tensor_slices(). When input data is stored in a file (not in memory), the recommended TFRecord format, you can use tf.data.TFRecordDataset(). tf.data.Dataset is for data in memory. tf.data.TFRecordDataset is for data in non-memory storage.

upvoted 16 times

...

RyanTan

Most Recent 1 month, 1 week ago

Selected Answer: A

this is one of the review questions in Chapter 3 in the book "Official Google Cloud Certified Professional Machine Learning Engineer Study Guide". tf.data.Dataset.prefetch transformation decouples the time when data is produced to the time when data is consumed so it can reduce the latency. Also the transformation can reduce the memory usage. By the way, tf.data.Dataset.interleave transformation can also used to reduce the latency and memory usage.

upvoted 1 times

...

PhilipKoku

4 months, 1 week ago

Selected Answer: D

D) Storing images in TFRecords optimises storage for images.

upvoted 2 times

...

pinimichele01

6 months, 1 week ago

Selected Answer: D

tf.data.Dataset is for data in memory. tf.data.TFRecordDataset is for data in non-memory storage.

upvoted 2 times

...

samratashok

7 months, 2 weeks ago

Selected Answer: D

why this website shows wrong option as answer, this is my observation from so many questions?

upvoted 3 times

...

fragkris

10 months, 3 weeks ago

Selected Answer: D

D is correct

upvoted 1 times

...

Sum_Sum

11 months, 1 week ago

Selected Answer: D

D because: tf.data.Dataset is for data in memory. tf.data.TFRecordDataset is for data in non-memory storage.

upvoted 2 times

...

boobyg1

11 months, 3 weeks ago

Selected Answer: D

all "correct" answers are wrong

upvoted 2 times

...

M25

1 year, 5 months ago

Selected Answer: D

Went with D

upvoted 1 times

...

India_willsmith

1 year, 6 months ago

For all questions the given answers and voted answers are different. Which one should be considered for exam?

upvoted 2 times

Alfredo_OSS

1 year, 6 months ago

You should consider the voted ones.

upvoted 2 times

...

enghabeth

1 year, 8 months ago

Selected Answer: D

Converting your data into TFRecord has many advantages, such as: More efficient storage: the TFRecord data can take up less space than the original data; it can also be partitioned into multiple files. Fast I/O: the TFRecord format can be read with parallel I/O operations, which is useful for TPUs or multiple hosts

upvoted 1 times

...

enghabeth

1 year, 8 months ago

Selected Answer: D

my option is D

upvoted 1 times

...

Omi_04040

1 year, 9 months ago

Ans: D

upvoted 1 times

...

wish0035

1 year, 10 months ago

Selected Answer: D

ans: D

upvoted 1 times

...

EFIGO

1 year, 10 months ago

Selected Answer: D

For data in memory use tf.data.Dataset, for data in non-memory storage use tf.data.TFRecordDataset. Since data don't fit in memory, go with option D.

upvoted 1 times

...

GCP72

2 years, 2 months ago

Selected Answer: D

Correct answer is "D"

upvoted 1 times

...

Load full discussion...

Exam Professional Machine Learning Engineer All Questions

View all questions & answers for the Professional Machine Learning Engineer exam

Exam Professional Machine Learning Engineer topic 1 question 15 discussion

Comments

chohan

alphard

RyanTan

PhilipKoku

pinimichele01

samratashok

fragkris

Sum_Sum

boobyg1

M25

India_willsmith

Alfredo_OSS

enghabeth

enghabeth

Omi_04040

wish0035

EFIGO

GCP72

SY0-701