8 Tips To Find and Merge More Duplicates

You know you have duplicates in your database. But after you merge them in Insycle, some duplicate records remain floating in your CRM, causing problems with your processes.

In this article, we’ll cover some simple tips that you can use to expand your deduplication templates and catch the maximum number of duplicates in your database.

1. Create Multiple Templates for Different Matching Criteria

When it comes to deduplicating your data, there may not be a single template that can identify and merge all duplicate records. Different fields may require different matching criteria, and a one-size-fits-all approach can often overlook edge cases or specific scenarios.

As a best practice, you should break your duplication issues into smaller problems and address them individually. Avoid trying to solve everything at once. Deduplication will typically require multiple passes and iterations.

When creating multiple templates, start with the easier, more straightforward fields first, such as names or email addresses. These fields are more likely to surface duplicate records. When setting up your templates, it's generally best to start with Exact Match. Once you've addressed the low-hanging fruit, you can then iterate through more complex fields or edge cases, where the Similar Match option might be helpful.

For example, if working with contact records, you could have several templates that all use the name, but each uses various additional fields and parameters. You could have templates to deduplicate by:

Similar name, same email
Same name, same related company or account
Similar name, same IP address
Same name, same business phone or mobile phone number

When you have a set of templates that address your duplication issues for a record type, you can bundle them into a Recipe and run them together.

2. Use Fields that Ensure Uniqueness

Every merge operation relies on matching fields to identify duplicate records in your database.

You want to use fields in Step 1 that contain unique information that is unlikely to be shared by any other record unless it is a duplicate. For example, a contact record that shares the same first name, last name, and phone number as another record is highly likely to be a duplicate. For companies, things like company names and website addresses are good unique identifier fields.

merge-duplicates-salesforce-contacts-first-last-name-phone-step-1-simple-tab.png

Reliable, Often Used Matching Fields:

First and last name
Company name
Email
Domain or URL
Phone number
Mailing address
ID number
External system ID

Tip: Avoid Overly Broad Matching Criteria

To avoid identifying non-duplicate records as duplicates, don't create templates that are too broad.

For example, if you used only "First Name" as your matching field, you could accidentally merge every person with a matching first name together, even though they work at different companies and are not the same person.

Make sure your match fields are a unique identifier.

Unique identifiers are data unlikely to be shared by any other record unless they represent the same underlying entity. Common fields used for deduplication include phone numbers, email addresses, mailing addresses, and ID numbers.

3. Use Similar Match to Find Slight Variations

The Comparison Rule lets you define what kind of likeness to look for when deciding if field values should be considered a match.

Using Similar Match instead of Exact Match can be a great way to broaden the search to identify records with a one-character difference, such as a typo, an extra character, or a missing character. This search behaves like when Google shows results for a slightly different term or says, “Did you mean...”

For example, if a Company Name of “Acme” is found, it could match records with Company Name values such as “Akme, acm, Acma,” etc.

merge-duplicates-hubspot-companies-step-1-similar-name-domain-exact-phone.png

However, it is very important that you consider the field you're using it on. Similar Match uses looser criteria that cast a wider net for what constitutes a duplicate, so it's not appropriate for every field. For example, you wouldn't want to use Similar Match on a Phone Number field because people with similar (but different) phone numbers may be identified as duplicate records.

If using ID fields to identify duplicates, note that they will only work with Exact Match, not Similar Match.

Learn more about Setting the Criteria for Finding Duplicate Matches.

4. Ignore Elements of The Field

merge-duplicates-salesforce-accounts-step-1-simple-tab-ignored-arrow-644w.png

Insycle also lets you ignore elements in your fields, so only relevant portions of the values are analyzed.

Ignore Symbols and Whitespace when comparing phone numbers.
Ignoring subdomain (www., app.), top-level domain (.com, co.uk), or URL path (/us/western-region) when comparing websites or email domains is a great way to catch more advanced duplicates.
Insycle comes preloaded with terms to ignore. If you select Common Terms, click the Terms button to view and edit this list on the Common Terms tab.
If you select Text (substrings), click the Terms button, then the Ignored Text tab, and enter text to be ignored. Separate multiple substrings (or phrases) with a new line.

*If you’ve set up Ignored terms or strings, don’t forget to also enable them. Select the Ignored Common Terms or Text (substrings) checkbox.

merge-duplicates-salesforce-accounts-step-1-simple-tab-ignored-checkboxes-w-arrow-646w.png

Learn more about Setting the Criteria for Finding Duplicate Matches.

5. Limit How Much of a Value Insycle Looks At

Setting Match Parts in Step 1 allows you to hone in on specific portions of field values. If the values' beginning or ending portions are all unique, you can limit the comparison to that part.

For example, you can instruct Insycle to only look at:

First X Words
First X Characters
Last X Words
Last X Characters

merge-duplicates-salesforce-accounts-step-1-simple-tab-match-parts-w-arrow-646w.png

Learn more about Setting the Criteria for Finding Duplicate Matches.

6. Match Using Related Fields

Sometimes you might want to match duplicates based on data in two separate fields. For example, you might want to compare your Phone Number field to a Mobile Phone Number field to identify duplicates.

Using the Related Fields feature, you can use two different fields (that contain similar data) as matching fields to catch more duplicates.

You can set up Related Fields in the Advanced tab.

merge-duplicates-step-1-advanced-related-phone-field.png

Common Examples of Related Field Matching

Matching Field	Related Fields
Phone Number	Mobile Phone Number, Company Phone
Email Domain	Website, Company Domain
Email Address	Additional Email Addresses
Mailing Address	Company Address

Learn more about Setting the Criteria for Finding Duplicate Matches.

7. Refine Duplicate Detection with Granular Conditions and Rules

The Conditions tab provides rules that one or more of the records in a duplicate group must meet. These options let you choose fields that are required, can be empty, or must include specific values.

The Conditions tab provides rules that one or more records in a duplicate group must meet.

Value Required in All Records - Each record must contain a value in this field to be considered a duplicate.
Empty Allowed in Any Record - A record can still be considered a duplicate if this field is blank. Allowing empty values requires using two or more fields to identify duplicates.
At Least One Record With Non-Empty - At least one record in the duplicate group must contain a value.
At Least One Record Match - At least one record in the duplicate group must match the specified value. If none of the records have the specified value, the duplicate group will not be merged.
Only One Record Match - If more than one record in a duplicate group contains the specified field value, the duplicate group is skipped (not merged).
Within Timeframe - Set a time parameter that can find duplicates created or modified within a specific timeframe, such as the last 20 minutes.
Values Don't Match - All records in the duplicate group must have a value, but the value doesn't match the other records in the group.

merge-duplicates-hubspot-contacts-step-1-conditions-all-6.png

Learn more about Setting the Criteria for Finding Duplicate Matches.

8. Organize Your Templates Into Recipes

Insycle Recipes allow you to organize multiple templates into a multi-step data maintenance process for automation, training, and organization.

A Recipe is a collection of templates ordered into numbered steps that are run in sequence. You can add Insycle's built-in or your own custom templates to a Recipe.

Recipes can also be automated to run monthly, weekly, or daily. Then, you ensure that your duplicates are continuously identified and merged, hands-free.

Explore additional approaches to merging duplicates in our Common Scenarios articles.

Additional Resources

Related Help Articles

Related Blog Posts

1. Create Multiple Templates for Different Matching Criteria

2. Use Fields that Ensure Uniqueness

Reliable, Often Used Matching Fields:

Tip: Avoid Overly Broad Matching Criteria

3. Use Similar Match to Find Slight Variations

4. Ignore Elements of The Field

5. Limit How Much of a Value Insycle Looks At

6. Match Using Related Fields

Common Examples of Related Field Matching

7. Refine Duplicate Detection with Granular Conditions and Rules

8. Organize Your Templates Into Recipes

Additional Resources

Related articles