exam questions

Exam DP-203 All Questions

View all questions & answers for the DP-203 exam

Exam DP-203 topic 1 question 32 discussion

Actual exam question from Microsoft's DP-203
Question #: 32
Topic #: 1
[All DP-203 Questions]

You plan to ingest streaming social media data by using Azure Stream Analytics. The data will be stored in files in Azure Data Lake Storage, and then consumed by using Azure Databricks and PolyBase in Azure Synapse Analytics.
You need to recommend a Stream Analytics data output format to ensure that the queries from Databricks and PolyBase against the files encounter the fewest possible errors. The solution must ensure that the files can be queried quickly and that the data type information is retained.
What should you recommend?

  • A. JSON
  • B. Parquet
  • C. CSV
  • D. Avro
Show Suggested Answer Hide Answer
Suggested Answer: B 🗳️

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
demirsamuel
Highly Voted 2 years, 10 months ago
Selected Answer: B
Avro schema definitions are JSON records. Polybase does not support JSON so why supporting Avro then. A CSV does not contain the schema as it is everything marked as string. so only parquet is left to choose.
upvoted 34 times
...
hrastogi7
Highly Voted 3 years, 4 months ago
Parquet can be quickly retrieved and maintain metadata in itself. Hence Parquet is correct answer.
upvoted 23 times
...
EmnCours
Most Recent 4 months, 3 weeks ago
Selected Answer: B
Selected Answer: B
upvoted 1 times
...
kkk5566
1 year, 7 months ago
Selected Answer: B
should be correct
upvoted 1 times
...
akhil5432
1 year, 8 months ago
Selected Answer: B
Parquet
upvoted 1 times
...
Deeksha1234
2 years, 9 months ago
Parquet is correct
upvoted 3 times
...
Rrk07
2 years, 11 months ago
Parquet is correct
upvoted 2 times
...
Muishkin
2 years, 12 months ago
Isnt JSON good for batch processing/streaming?
upvoted 1 times
RehanRajput
2 years, 11 months ago
Indeed. However, we also want to query the data using PolyBase. Polybase doesn't support Avro. https://docs.microsoft.com/en-us/azure/synapse-analytics/sql/load-data-overview#polybase-external-file-formats
upvoted 6 times
...
...
AhmedDaffaie
3 years ago
I am confused! Avro has self-describing schema and good for quick loading (patching), why parquet?
upvoted 5 times
Boompiee
2 years, 11 months ago
Apparently, the deciding factor is the fact that PolyBase doesn't support AVRO, but it does support Parquet.
upvoted 7 times
matiandal
1 year, 5 months ago
"Polybase currently supports only delimeted text, rcfile, orc and parquet formats." R: https://msdn.microsoft.com/en-us/library/dn935025.aspx
upvoted 1 times
...
...
...
PallaviPatel
3 years, 2 months ago
Selected Answer: B
correct.
upvoted 1 times
...
EmmettBrown
3 years, 3 months ago
Selected Answer: B
Parquet is the correct answer
upvoted 1 times
...
alexleonvalencia
3 years, 4 months ago
Respuesta correcta PARQUET
upvoted 1 times
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...
exam
Someone Bought Contributor Access for:
SY0-701
London, 1 minute ago