Exam SnowPro Core All Questions

View all questions & answers for the SnowPro Core exam

Go to Exam

Exam SnowPro Core topic 1 question 1095 discussion

Actual exam question from Snowflake's SnowPro Core

Question #: 1095
Topic #: 1

[All SnowPro Core Questions]

How should clustering be used to optimize the performance of queries that run on a very large table?

A. Manually re-cluster the table regularly.
B. Choose one high cardinality column as the clustering key.
C. Use the column that is most-frequently used in query select clauses as the clustering key.
D. Assess the average table depth to identify how clustering is impacting the query.

Show Suggested Answer

Suggested Answer: D 🗳️

by simus90 at April 2, 2024, 1:44 p.m.

Comments

Submit Cancel

surya610

2 months, 1 week ago

Selected Answer: C

C, having select columns as clustering key may help directly improve a query D- But only accessing the information cant help optimize. Hence C.

upvoted 1 times

...

Snowflake provides the SYSTEM$CLUSTERING_INFORMATION function to help you assess the effectiveness of clustering by evaluating the average depth of micro-partitions. The average depth indicates how well the data is clustered for the specified clustering key. If the depth is high, it suggests that data is not well-clustered, and you may need to refine the clustering key or re-cluster the table using the RECLUSTER operation. https://docs.snowflake.com/en/user-guide/tables-clustering-keys#what-is-a-clustering-key https://docs.snowflake.com/en/user-guide/tables-clustering-keys#calculating-the-clustering-information-for-a-table

upvoted 1 times

...

MatthieuDN

5 months, 3 weeks ago

Selected Answer: D

D is correct, it would've been C if the answer was WHERE clauses instead of SELECT

upvoted 1 times

...

d22770a

9 months ago

Selected Answer: D

D is correct

upvoted 2 times

...

joshguy40

10 months, 3 weeks ago

Selected Answer: D

its D select clause is the column you choose to select. We dont care about that. We care about the columns being filtered in the WHERE clause.

upvoted 4 times

...

08c95eb

1 year, 1 month ago

Selected Answer: D

selective filters is different than select clause

upvoted 4 times

...

Jacobr5000

1 year, 2 months ago

Selected Answer: C

"Snowflake recommends prioritizing keys in the order below: Cluster columns that are most actively used in selective filters."

upvoted 2 times

...

Lematthew31

1 year, 2 months ago

Selected Answer: C

It's C : https://docs.snowflake.com/en/user-guide/tables-clustering-keys#strategies-for-selecting-clustering-keys "Selecting the right columns/expressions for a clustering key can dramatically impact query performance. Analysis of your workload will usually yield good clustering key candidates. Snowflake recommends prioritizing keys in the order below: Cluster columns that are most actively used in selective filters"

upvoted 3 times

d22770a

9 months ago

SELECTIVE FILTER means WHERE clause, Option D talks SELECT column. So that is wrong

upvoted 1 times

...

yaho5

1 year, 3 months ago

Selected Answer: C

C Snowflake recommends prioritizing keys in the order below: Cluster columns that are most actively used in selective filters. For many fact tables involved in date-based queries (for example “WHERE invoice_date > x AND invoice date <= y”), choosing the date column is a good idea. For event tables, event type might be a good choice, if there are a large number of different event types. (If your table has only a small number of different event types, then see the comments on cardinality below before choosing an event column as a clustering key.) If there is room for additional cluster keys, then consider columns frequently used in join predicates, for example “FROM table1 JOIN table2 ON table2.column_A = table1.column_B”.

upvoted 3 times

...

NachoPrendes

1 year, 3 months ago

Selected Answer: C

C https://docs.snowflake.com/en/user-guide/tables-clustering-keys#:~:text=Cluster%20columns%20that%20are%20most%20actively%20used%20in%20selective%20filters

upvoted 3 times

induna

1 year, 3 months ago

I think it is D, per the doc you listed: The number of distinct values (i.e. cardinality) in a column/expression is a critical aspect of selecting it as a clustering key. It is important to choose a clustering key that has: A large enough number of distinct values to enable effective pruning on the table. A small enough number of distinct values to allow Snowflake to effectively group rows in the same micro-partitions.

upvoted 2 times

...

simus90

1 year, 3 months ago

Selected Answer: D

it s D

upvoted 4 times

...

Exam SnowPro Core All Questions

View all questions & answers for the SnowPro Core exam

Exam SnowPro Core topic 1 question 1095 discussion

Comments

surya610

bor4un

MatthieuDN

d22770a

joshguy40

08c95eb

Jacobr5000

Lematthew31

d22770a

yaho5

NachoPrendes

induna

simus90