exam questions

Exam DP-201 All Questions

View all questions & answers for the DP-201 exam

Exam DP-201 topic 7 question 2 discussion

Actual exam question from Microsoft's DP-201
Question #: 2
Topic #: 7
[All DP-201 Questions]

HOTSPOT -
Which Azure Data Factory components should you recommend using together to import the customer data from Salesforce to Data Lake Storage? To answer, select the appropriate options in the answer area.
NOTE: Each correct selection is worth one point.
Hot Area:

Show Suggested Answer Hide Answer
Suggested Answer:
Box 1: Self-hosted integration runtime
A self-hosted IR is capable of nunning copy activity between a cloud data stores and a data store in private network.

Box 2: Schedule trigger -

Schedule every 8 hours -

Box 3: Copy activity -
Scenario:
✑ Customer data, including name, contact information, and loyalty number, comes from Salesforce and can be imported into Azure once every eight hours. Row modified dates are not trusted in the source table.
✑ Product data, including product ID, name, and category, comes from Salesforce and can be imported into Azure once every eight hours. Row modified dates are not trusted in the source table.

Comments

Chosen Answer:
This is a voting comment (?). It is better to Upvote an existing comment if you don't have anything to add.
Switch to a voting comment New
Leonido
Highly Voted 4 years, 12 months ago
Azure IR is perfectly capable of running Salesforce connection. No need for VM or SSIS runtime. Tumbling trigger will serve better here. Only the copy activity is the right answer.
upvoted 17 times
vistran
4 years, 11 months ago
why cant it be scheduled trigger ?
upvoted 5 times
anamaster
4 years ago
it is scheduled trigger since modified dates are not reliable
upvoted 4 times
...
...
dinu517
4 years, 9 months ago
Agree with Azure IR as this is cloud to cloud. Scheduled trigger makes more sense as we need to get data every 8 hours
upvoted 13 times
...
knightkkd
4 years, 6 months ago
As modified dates are not reliable, tumbling windows should not be used, Scheduled trigger is the correct option
upvoted 9 times
lgtiza
3 years, 8 months ago
Modified dates do not play any role here. In a tumbling window you just set up a starting date/time and set up a frequency and that's it. It will work exactly the same as with a scheduled trigger with recurrence every 8 hours. But tumbling windows are a bit safer because you don't need to set up the first date/time in the future.
upvoted 1 times
...
...
Devendra00023
4 years ago
https://docs.microsoft.com/en-us/azure/data-factory/connector-salesforce
upvoted 2 times
...
...
BungyTex
Highly Voted 4 years, 4 months ago
I am literally working a project right now where we ingest data from salesforce to data lake. Its the Azure IR.
upvoted 9 times
...
Larrave
Most Recent 3 years, 4 months ago
As suggested it seems Azure IR is also possible and this is also the only solution, because it is mentioned to use PaaS and no VMs managed by Litware. Self-hosted IR requires managing VMs, therefore => Azure IR
upvoted 1 times
...
dbdev
3 years, 10 months ago
Based on reviewed information all the answers provided are correct.
upvoted 1 times
...
maynard13x8
4 years ago
I agree is azure IR because salesforce SaaS, but I don’t know what trigger is better. Schedule with frequency 8h fit perfectly but, at the same time Tumbling with 8h window do the same. I think the first one always trigger at these hours independently if the process was cancelled or not, and tumbling may introduce delays if process fails and has to be relaunched because next window will be 8h later than this second try. Am I right?
upvoted 2 times
Psycho
3 years, 11 months ago
Schedule trigger: A trigger that invokes a pipeline on a wall-clock schedule. Tumbling window trigger: A trigger that operates on a periodic interval, while also retaining state. Event-based trigger: A trigger that responds to an event.
upvoted 1 times
...
...
mohowzeh
4 years, 3 months ago
See https://docs.microsoft.com/en-us/azure/data-factory/copy-activity-overview#supported-data-stores-and-formats: "When you're copying data between two data stores that are publicly accessible through the internet from any IP, you can use the Azure integration runtime for the copy activity. This integration runtime is secure, reliable, scalable, and globally available. When you're copying data to and from data stores that are located on-premises or in a network with access control (for example, an Azure virtual network), you need to set up a self-hosted integration runtime." Further down on that page, Salesforce is listed as supported by Azure IR.
upvoted 8 times
...
syu31svc
4 years, 4 months ago
https://docs.microsoft.com/en-us/azure/data-factory/connector-salesforce-service-cloud: "When you copy data into Salesforce Service Cloud, the default Azure Integration Runtime can't be used to execute copy. In other words, if your source linked service doesn't have a specified integration runtime, explicitly create an Azure Integration Runtime with a location near your Salesforce Service Cloud instance." Integration runtime is Azure Trigger as schedule and activity as copy are correct
upvoted 1 times
...
master28aug
4 years, 7 months ago
we will need Self-hosted since the salesforce is an on-premise Source. ref- https://docs.microsoft.com/en-us/azure/data-factory/connector-salesforce
upvoted 2 times
...
kittykat
4 years, 8 months ago
the answer should be tumbling window trigger isn't it?
upvoted 2 times
AJMorgan591
4 years, 7 months ago
"Row modified dates are not trusted in the source table."
upvoted 1 times
essdeecee
4 years, 6 months ago
You keep saying this but its not clear why this makes a difference. Scheduled is time of day where as tumbling relates to every n hours. How does differing triggers for 800/1400/2000 hrs eg have any bearing on the row modified date? I'm not saying its wrong just you need to justify it better. I believe tumbling is a better approach because you have 1 not 3 triggers.
upvoted 3 times
M0e
4 years, 6 months ago
Tumbling Window in ADF depends on a time field of the source data to determine if it should process it or not. In the case of Scheduled trigger, it keeps track of the processed date and time for each row externally. In case of the scheduled trigger, you only need one trigger, not 3.
upvoted 3 times
essdeecee
4 years, 6 months ago
I'm pretty sure that's not true. I'm looking at it now and cannot see that dependency. Also there is not mention of that requirement here:https://docs.microsoft.com/en-us/azure/data-factory/how-to-create-tumbling-window-trigger
upvoted 2 times
...
...
...
...
...
krisspark
4 years, 8 months ago
As it's mentioned every 8 hrs why not tumbling window trigger rather scheduled.. ?
upvoted 4 times
AJMorgan591
4 years, 7 months ago
"Row modified dates are not trusted in the source table."
upvoted 1 times
...
...
peppele
4 years, 9 months ago
Important When you copy data into Salesforce, the default Azure Integration Runtime can't be used to execute copy. In other words, if your source linked service doesn't have a specified integration runtime, explicitly create an Azure Integration Runtime with a location near your Salesforce instance. Associate the Salesforce linked service as in the following example. So the given answer is correct.
upvoted 5 times
Anilpanda10
4 years, 9 months ago
In the question its mentioned to copy data from salesforce not into salesforce. I feel Azure IR should be correct here.
upvoted 9 times
...
...
proca
4 years, 9 months ago
"When you copy data into Salesforce, the default Azure Integration Runtime can't be used to execute copy. In other words, if your source linked service doesn't have a specified integration runtime, explicitly create an Azure Integration Runtime with a location near your Salesforce instance. Associate the Salesforce linked service as in the following example." https://docs.microsoft.com/en-us/azure/data-factory/connector-salesforce
upvoted 1 times
...
Tommy65
4 years, 9 months ago
For box 1 both Azure IR and self hosted IR are correct as the data is in Salesforce and Salesforce supports both: https://docs.microsoft.com/en-us/azure/data-factory/copy-activity-overview#supported-data-stores-and-formats
upvoted 2 times
...
pravinDataSpecialist
4 years, 10 months ago
IR to be used is Azure IR as mentioned here --> https://docs.microsoft.com/en-us/azure/data-factory/connector-salesforce "The integration runtime to be used to connect to the data store. If not specified, it uses the default Azure Integration Runtime"
upvoted 1 times
...
Yuri1101
5 years ago
Salesforce is a cloud data source even though there is no clear explanation from the question. Azure IR and Self-Hosted IR both will work via different approaches. Would prefer Azure IR as the answer due to simplicity. https://docs.microsoft.com/en-us/azure/data-factory/data-migration-guidance-s3-azure-storage
upvoted 3 times
Tombarc
5 years ago
Azure IR is recommended for Azure services only
upvoted 10 times
spiitr
4 years, 2 months ago
Any third party cloud or in fact Azure VM (or Azure IaaS) requires self-hosted IR
upvoted 1 times
...
...
AusAv
4 years, 7 months ago
I just completed the exam, the question is along the lines of: "comes from Salesforce, a SaaS application" if that helps
upvoted 7 times
M0e
4 years, 6 months ago
If they say Salesforce SaaS, the answer should be: Azure IR (https://docs.microsoft.com/en-gb/azure/data-factory/copy-activity-overview - [Salesforce -> Supported by Azure IR -> ✓])
upvoted 7 times
...
...
HCL1991
4 years, 12 months ago
I agree on this. If you want to determine which IR to use you can read https://docs.microsoft.com/en-us/azure/data-factory/concepts-integration-runtime#determining-which-ir-to-use. You need to look at the source and sink environment where you run the ADF pipeline. In case of salesForce, ADF has a SalesForce connector which does not have a source IR (https://docs.microsoft.com/en-gb/azure/data-factory/connector-salesforce). Instead it requires you to specify the IR of the sink. In this case the sink lies within Azure. Since the sink lies within Azure, the recommended IR is the Azure IR.
upvoted 13 times
...
...
Community vote distribution
A (35%)
C (25%)
B (20%)
Other
Most Voted
A voting comment increases the vote count for the chosen answer by one.

Upvoting a comment with a selected answer will also increase the vote count towards that answer by one. So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.

SaveCancel
Loading ...
exam
Someone Bought Contributor Access for:
SY0-701
London, 1 minute ago