Welcome to ExamTopics
ExamTopics Logo
- Expert Verified, Online, Free.
exam questions

Exam Certified Machine Learning Professional All Questions

View all questions & answers for the Certified Machine Learning Professional exam

Exam Certified Machine Learning Professional topic 1 question 15 discussion

Actual exam question from Databricks's Certified Machine Learning Professional
Question #: 15
Topic #: 1
[All Certified Machine Learning Professional Questions]

A data scientist has computed updated feature values for all primary key values stored in the Feature Store table features. In addition, feature values for some new primary key values have also been computed. The updated feature values are stored in the DataFrame features_df. They want to replace all data in features with the newly computed data.
Which of the following code blocks can they use to perform this task using the Feature Store Client fs?

  • A.
  • B.
  • C.
  • D.
  • E.
Show Suggested Answer Hide Answer
Suggested Answer: D 🗳️

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
hugodscarvalho
10 months ago
Selected Answer: D
The data scientist already has the table created, so the method should be "write_table". Since he wants to replace all data in features with the newly computed data the "mode" overwrite should be used. Doc: https://docs.databricks.com/en/machine-learning/feature-store/workspace-feature-store/feature-tables.html#create-a-feature-table-in-databricks-feature-store
upvoted 4 times
...
mozuca
11 months ago
Selected Answer: D
Alternatively, you can create_table with schema only (without df), and populate data to the feature table with fs.write_table, fs.write_table has both overwrite and merge mode. Example: fs.create_table( name=table_name, primary_keys=["index"], schema=airbnb_df.schema, description="Original Airbnb data" ) fs.write_table( name=table_name, df=airbnb_df, mode="overwrite" ) Is this case the answer is D
upvoted 2 times
...
BokNinja
11 months, 1 week ago
Answer is D. The mode='overwrite' argument ensures that the existing data in the feature table is replaced with the new data from features_df1.
upvoted 2 times
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...