exam questions

Exam DP-700 All Questions

View all questions & answers for the DP-700 exam

Exam DP-700 topic 2 question 34 discussion

Actual exam question from Microsoft's DP-700
Question #: 34
Topic #: 2
[All DP-700 Questions]

You have a Fabric warehouse named DW1 that contains a Type 2 slowly changing dimension (SCD) dimension table named DimCustomer. DimCustomer contains 100 columns and 20 million rows. The columns are of various data types, including int, varchar, date, and varbinary.

You need to identify incoming changes to the table and update the records when there is a change. The solution must minimize resource consumption.

What should you use to identify changes to attributes?

  • A. a hash function to compare the attributes in the source table.
  • B. a direct attributes comparison across the attributes in the DimCustomer table.
  • C. a direct attributes comparison for the attributes in the source table.
  • D. a hash function to compare the attributes in the DimCustomer table.
Show Suggested Answer Hide Answer
Suggested Answer: A 🗳️

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
13d2a97
2 weeks, 1 day ago
Selected Answer: A
A. a hash function to compare the attributes in the source table.
upvoted 1 times
...
12a2ecc
3 weeks ago
Selected Answer: A
The correct answer is: A. a hash function to compare the attributes in the source table. Here's why: Context: You are dealing with a Type 2 SCD table with 100 columns and 20 million rows. You need to detect changes to update the dimension appropriately, while minimizing resource consumption. Option A: A. a hash function to compare the attributes in the source table. This is the most efficient and scalable approach. You compute a hash (e.g., MD5, SHA-256) over a concatenation of all relevant (non-SCD metadata) columns in both the source and existing dimension table. By comparing the hashes, you avoid comparing each column directly. This minimizes CPU and memory usage, especially with many columns and large datasets. Commonly used in modern ETL/ELT tools for change detection in large datasets.
upvoted 3 times
...
zxc01
3 weeks, 4 days ago
Selected Answer: A
why don't use hash function on source side? the question lost many details. I just can guess they try to use incremental loading method. Normally we need keep Hask key values in target table column and use hash function to get changed imported rows from source side if their hash values cannot match with target hash value column.
upvoted 1 times
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...
exam
Someone Bought Contributor Access for:
SY0-701
London, 1 minute ago