I agree that the correct answer is B and not C. Here is why:
Pronunciation might not group variations like "John_smith" and "John, Smith" because they differ significantly in spelling and format.
Spelling focuses on minor differences in spelling but might not handle different delimiters (like underscores, commas, spaces, and hyphens) effectively.
Manual Selection would require you to manually select and group each variation, which isn't automatic.
The Common Characters option looks for similar sequences of characters within the names, making it effective at grouping variations such as "John, Smith", "John_smith", "John Smith", and "John-smith" automatically.
Answer is B
Common Characters: Find and group values that have letters or numbers in common. This option uses the ngram fingerprint algorithm that indexes words by their unique characters after removing punctuation, duplicates, and whitespace. This algorithm works for any supported language. This option isn't available for data roles.
For example, this algorithm would match names that are represented as "John Smith" and "Smith, John" because they both generate the key "hijmnost". Since this algorithm doesn't consider pronunciation, the value "Tom Jhinois" would have the same key "hijmnost" and would also be included in the group.
upvoted 3 times
Log in to ExamTopics
Sign in:
Community vote distribution
A (35%)
C (25%)
B (20%)
Most Voted
A voting comment increases the vote count for the chosen answer by one.
Upvoting a comment with a selected answer will also increase the vote count towards that answer by one.
So if you see a comment that you already agree with, you can upvote it instead of posting a new comment.
2 months, 1 week agoMonBouj
2 months, 3 weeks agoiccent2
3 months ago84db7a1
8 months, 2 weeks ago