Duplicate data in HubSpot CRM poses serious problems for companies of any size.
Duplicate records inhibit your marketing team from effectively segmenting and personalizing your communications. Sales teams step on each other's toes and lack vital context in conversations. Support teams miss important information, and analysis and reporting are skewed.
Insycle helps you to merge duplicate contacts, companies, deals, and custom objects—flexibly and powerfully with the Merge Duplicates module.
Use Cases
- Deduplicate HubSpot Contacts, Companies, and Deals in Bulk
- Bulk Merge Duplicate People, Companies
- Deduplicate HubSpot Companies and Salesforce Accounts
How It Works
Insycle analyzes your HubSpot data and identifies duplicates with flexible matching rules, using any field in your database, to help you identify and merge more duplicates.
Once Insycle identifies the duplicate records, you set rules for determining the master record that other duplicates will be merged into—such as the first record created, record with the most email opens, or any other attribute that would be relevant. You can also set merging logic on a field-by-field basis.
You can merge duplicates in bulk, and Insycle provides a complete report of what was identified as a duplicate, what was merged, and what the outcome was in your master record. Then you can automate the process using templates to keep HubSpot CRM free from duplicates at all times.
To learn more, see Deduplicate HubSpot Contacts, Companies, and Deals in Bulk.
HubSpot Record Types Supported
Insycle supports the following HubSpot record types:
- Contacts
- Companies
- Deals
- Tickets
- Line Items
- Custom Objects
You can select the record type that you would like to import at the top of the Merge Duplicates module screen.
Deduplicate HubSpot Companies When Salesforce Sync is Active
Fixing duplicate HubSpot companies and Salesforce accounts while syncing has several nuanced issues that need to be accounted for. There are specific data issues that can break the sync and require you to merge records manually. You also need to determine the appropriate “master record” to use across both HubSpot and Salesforce.
Then you have to consider the merging process. If two records are merged on Salesforce, are they merged on HubSpot as well? What are the mechanics behind how that works? Often, your company's settings in each platform impact how the merge takes place, which can be confusing.
Insycle allows you to merge duplicate Hubspot companies and Salesforce accounts while keeping your sync intact, simply.
To learn more, see Deduplicate HubSpot Companies and Salesforce Accounts.
Deduplicate HubSpot and Salesforce Simultaneously
Deduplicating HubSpot and Salesforce, while the two platforms are syncing is difficult.
Both HubSpot and Salesforce have unique ways of handling duplicate records that need to be accounted for during the merging process. Failing to account for this could mean leaving many duplicate contacts floating in your system, unaccounted for, despite the fact they share a lot of data in other fields.
You also need to determine the appropriate “master record” to use. But that is often easier said than done.
Insycle allows you to deduplicate HubSpot and Salesforce at the same time while keeping your sync intact.
To learn more, see How to Merge Duplicates in HubSpot and Salesforce and Keep Them Syncing.
HubSpot Merge Logic
Here is what happens when you bulk merge HubSpot duplicates in Insycle:
Contacts
- Email: The email address from the master record becomes the primary, the duplicate email addresses are added as additional email addresses.
- Activities (notes, emails, tasks, etc.): Reassigned from the duplicates to the master.
- Deals: Reassigned from the duplicates to the master.
- Attachments: Reassigned from the duplicates to the master. (Note that there may be a short delay before the attachment appears in the merged record.)
- Fields: Use the Field tab under Step 4: Master Selection to determine what data is retained in the master record on a field-by-field basis. By default, the most recently updated value becomes the present value, all other values are available in the history. See HubSpot's merge contacts help article to learn about HubSpot's default contact merging behavior.
Companies, Deals, Tickets, Custom Objects, and Line Items
- Contacts: Reassigned from the duplicates to the master.
- Deals: Reassigned from the duplicates to the master.
- Activities (notes, emails, tasks, etc): Reassigned from the duplicates to the master
- Domains (applies only to Companies): Copied from the duplicates into the master and appended as secondary domains to avoid future duplicates with the same domain.
- Attachments: Reassigned from the duplicates to the master. (Note that there may be a short delay before the attachment appears in the merged record.)
- Fields: Use the Field tab under Step 4: Master Selection to determine what data is retained in the master record on a field-by-field basis. By default, the value is retained from the master. When a value is empty in the master it picks a non-empty value from the most recently updated duplicate.
When in doubt about conflicting field values, include those fields in the CSV report by adding them to the Master Selection section and their values would show on the audit trail.
Granular Control for Picking Duplicate Records
For situations where there are no common rules you can apply for identifying duplicates for all or some of the records, you may need more granular control for picking records to include or exclude from the process. You can customize bulk deduplication using exclusions and pre-defined masters via a CSV file.
Preview Changes Before They Go Live
You can preview the changes that you are making to your data before they are pushed to your live database. That way, you can check to ensure your deduplication operation is working as expected.
Automation
You can schedule your Merge Duplicates templates for HubSpot to run on an automated, set schedule.
To schedule them, click the Review button at the bottom of the module page. Then, you go through a three-step process to run the operation. In the third step, you can choose the "Automate" tab, and schedule your template.
You can also schedule deduplication automation using Recipes, which are a collection of templates run together. You can view all scheduled automations on the “Automations” page on your dashboard.
Learn More:
Deduping Child/Parent Companies While Retaining Associations
When deduplicating child/parent companies in HubSpot, Insycle is able to detect even the most complex company hierarchy associations, ensuring that the correct child company master records are associated with the correct parent company master records after the companies are merged.
Troubleshooting
If you're not seeing the results you expect when merging duplicates, consider these issues:
You have duplicate records that have been identified by Insycle but not all of them are merging into the master. Check to see how many duplicates are in the affected duplicate groups. If you have duplicate groups that contain more than five records, you may want to change the value in Skip duplicate groups with more than 5 records per group to make sure you can get them all.
If the Result column of the CSV report displays this error:
Cannot determine master record because multiple records (#) satisfy the master selection rules. In ‘Master Selection’, change/add/reorder the rules such that only one record satisfies them (if cannot determine master based on field values, use ‘ID is lowest’ as the last rule).
This error means that based on the master rules you set, Insycle could not figure out which would be the master.
Check Step 4 to ensure that you have Priority Match selected and not Absolute Match.
With Priority Match, the rules configured in the Records tab of Step 4 are processed in order and your master record only has to match one rule. Using Absolute Match, your master record would have to meet all of the rule criteria. The majority of the time it is best to select Priority Match.
If Priority Match was used, then none of the records meet any of the criteria on the list more than the others. In this case, you'll need to experiment with Step 4, reordering or adding additional rules for fields likely to have unique values.
There are a couple of things to look at that may be misidentifying records as duplicates.
First, you may need a better unique identifier. Under Step 1, if you only use fields that could correctly contain the same values in multiple records, these aren't unique identifiers. In this case, you are likely to identify unrelated records as duplicates and may accidentally merge them.
Unique identifiers are data that is unlikely to be shared by any other record unless it represents the same underlying entity. Fields that are commonly used in deduplication include phone numbers, email, mailing addresses, or ID numbers.
Second, this may indicate the Comparison Rule under Step 1 is too broad. Try using the Exact Match comparison rule instead of Similar Match. Similar Match looks for values that may be close but with a one-character difference (maybe a typo) which broadens the search.
Remember, always run your deduplication in Preview Mode to confirm things are working as expected before running them in Update Mode and applying the changes to your HubSpot records.
Most of the time when Insycle can't find duplicates, it is due to your matching rules in Step 1. To better understand how to set up your rules, it is important to analyze the underlying data. A useful exercise can be to set up your matching filters to look for exact matches of just First Name and Last Name.
When you click the Find button, these rules can show you a broad overview of what duplicates are potentially in your database, and what fields might be useful to include in your matching fields. These settings are just for discovery and should not be used for a final merge operation; many people can have the same first and last names and are not duplicates.
To get further context, click the gear button on the right side of the Record Viewer pane. Here, you can add any field in your database as a column to the Record Viewer to better understand the data inside of these records.
If the Result column of the CSV displays an error, read the error text for help figuring out how to resolve the problem.
The most common error is:
Cannot determine master record because multiple records (#) satisfy the master selection rules. In ‘Master Selection’, change/add/reorder the rules such that only one record satisfies them (if cannot determine master based on field values, use ‘ID is lowest’ as the last rule).
This means that based on all the rules, Insycle could not figure out which would be the master. None of the records meet more of the rules than others. In this case, you'll need to experiment with reordering or adding additional fields likely to have unique values under Step 4.
It can take a while for Insycle to find and match duplicates if the fields being used to identify them have very long values. The longer the values, the longer it takes Insycle to process the data and generate the results. This might come up when looking for matches based on long ID numbers, LinkedIn bio links, or other URLs with long strings attached (ex, https://www.linkedin.com/in/svadin%C3%ADr-n%C4%9Bmec-1234b31a3/).
You can speed this up by limiting how much of the value Insycle looks at.
If the beginning or ending portion of the values are all unique, you can limit the comparison to the first or last several characters using the Match Parts parameter under Step 1.
Or use the Ignore Text (Substrings) parameter, then click the Terms button.
On the Ignored Text tab of the popup, add the common portion of the URL or text string.
For more help troubleshooting issues with Insycle, refer to our Troubleshooting Issues article.
Frequently Asked Questions
Yes. When using two or more fields to identify duplicates, records can still be considered matches even if one of the field values is blank. You just need to specify which field(s) allow a blank value.
Under Step 1, configure your matching rules in the Simple tab, then click the Conditions tab.
All the matching fields you included will automatically appear with the Value Required in All Records condition selected. Change the condition to Empty Allowed in Any Record to allow empty values for certain fields. You can also use the At Least One Record with Non-Empty condition to help you determine which is the master record. Make sure at least one field remains required and is a reliable unique identifier to ensure the records are really duplicates.
For example, on the Simple tab, you may have the matching fields: First Name, Last Name, and Phone Number. But on some of your records, the Phone Number field may be empty. Using the Empty Allowed in Any Record or At Least One Record with Non-Empty, all records with the same name, same phone number, and no phone number will be considered duplicates.
Yes. This can be done, for example, if you want to look at both the Phone Number field values and Mobile Phone Number field values as a single pool of values to compare between records and identify duplicates.
Using the Related Fields feature, you can use two different fields (that contain similar data) as matching fields to catch more duplicates. You can set up Related Fields in the Advanced tab.
Currently, there are two ways to make sure that the records that you are merging are indeed duplicate records.
First, always run your deduplication templates in Preview Mode before running them in Update Mode. This produces a CSV that shows you how your records would have been merged. Then you can ensure that your Merge Duplicates template is working as expected and not merging non-duplicate records together.
Additionally, you can reduce the risk when merging duplicates by narrowing your duplicate matching settings in Step 1. Try the Exact Match Comparison Rule instead of Similar Match. Then make sure that you are using actual uniquely identifying fields—first name, last name, email, and phone number are popular choices. The more tightly defined your filter is, the less likely you are to merge non-duplicate records.
If the Result column of the CSV report displays this error:
Cannot determine master record because multiple records (#) satisfy the master selection rules. In ‘Master Selection’, change/add/reorder the rules such that only one record satisfies them (if cannot determine master based on field values, use ‘ID is lowest’ as the last rule).
This error means that based on the master rules you set, Insycle could not figure out which would be the master.
Check Step 4 to ensure that you have Priority Match selected and not Absolute Match.
With Priority Match, the rules configured in the Records tab of Step 4 are processed in order and your master record only has to match one rule. Using Absolute Match, your master record would have to meet all of the rule criteria. The majority of the time it is best to select Priority Match.
If Priority Match was used, then none of the records meet any of the criteria on the list more than the others. In this case, you'll need to experiment with Step 4, reordering or adding additional rules for fields likely to have unique values.
Yes. When two contacts are merged in HubSpot, by default, workflows will not enroll merged contacts. However, merged contacts can enroll in the future if they meet the enrollment triggers again and re-enrollment is enabled.
In contact-based workflows, you can manage the enrollment of merged contacts, remove contacts that no longer meet enrollment criteria, and prevent enrollment of contacts in specific lists. To learn more, see HubSpot's workflow documentation.
Yes. You can use a customized list of duplicates and use the Magical Import module to tag duplicates in your CRM, then use the Merge Duplicates module to deduplicate in bulk. Include ID numbers from your CRM in your CSV.
Yes, Insycle allows you to select which field data is retained in the master record using the Fields tab under Step 4. See the Bulk Merge Duplicate People, Companies article for more detail.
Yes. You can exclude records from deduplication by including a "Deduplication Exclude" field in your CSV, as detailed in the Customize Bulk Deduplication Using Exclusions and Pre-Defined Masters article.
Yes, if your HubSpot objects have attachments, the attachment will be merged into the master record. Note though that there may be a short delay before the attachment appears in the merged record.
When merging HubSpot contact records using the “From master record (even empty)” data retention rule, the property history in HubSpot shows that Insycle set the value to “empty.” This is a nuance of how HubSpot manages the history of empty values. You can verify that the master record value before the merge was indeed empty by reviewing the Activity Tracker report in Insycle.
Yes, there are several ways to share details and get approval before merging duplicates.
You can manually approve master records and mark them in a CSV, then use Insycle to bulk deduplicate down to those master records. Consult with this Customize Bulk Deduplication Using Exclusions and Pre-Defined Masters article to learn more.
Or, you can run the Merge Duplicates module in Preview Mode, then deliver the preview CSV that Insycle generates. The CSV report that Insycle generates includes your entire merge operation down to individual duplicate groups but does not update your live data. Then your team can approve the merge based on this report, before running Merge Duplicates in Update Mode.
Additionally, team members can review duplicates and manually select the master for each record under Step 4. Review the Manually Merge Duplicates article for more detail.
No, your field data does not need to match exactly. The Similar Match found in Step 1 looks for values that may be close but with a one-character difference (maybe a typo) and broadens the search.
This search behaves like when Google shows results for a slightly different term, or says “Did you mean...” For example, if an Email of, “huey@coahulldu.co” is found, it could include records with the values “hueyy@coahulldu.co," or "hue.y@coahulldu.co,” as a match.
Do pay close attention when using Similar Match as the looser criteria can incorrectly identify non-duplicates as duplicates.
Review the Similar Matching best practices for more detail.
Yes, Insycle can analyze leads and contacts together and deduplicate across those record types. See the Deduplicate Across Salesforce Leads and Contacts article to learn more.
Yes, Insycle solves numerous deduplication relates issues when Salesforce and HubSpot are syncing. See the Deduplicate Salesforce and HubSpot While Keeping the Sync Active article to learn more.
Insycle shows 50 records on the module screen as a preview, this isn't the entire list of records. Include All records when you view the Preview CSV report to see everything.
Insycle can process thousands of duplicate groups in one operation. Potentially, you could deduplicate your entire database in one operation.
You can merge up to 100 duplicate records into a single master record.
If you have duplicate groups that contain more than five records, you may want to change the value in Skip duplicate groups with more than 5 records per group under Step 3 to make sure you can get them all.
This is a precaution to ensure that if you use a duplicate matching filter that is too broad in Step 1, you do not accidentally merge many non-duplicate records together. If you are going to set this number at a high level, it is a good idea to run Preview Mode first to make sure your deduplication template is operating as you intend.
All plans include unlimited usage, unlimited users, and unlimited operations. See the pricing page for more details. During the free trial, there is a cap of 500 records updated, cleansed, or merged.
Additional Resources
Related Help Articles
- Deduplicate HubSpot Companies and Salesforce Accounts
- Deduplication Best Practices
- Bulk Merge Duplicate People, Companies
- Deduplicate HubSpot and Salesforce While Keeping the Sync Active
Related Blog Posts
- Why HubSpot Duplicate Contacts are Hurting Your Marketing Team and Straining Your Budget
- Data Duplication and HubSpot: Dealing With Duplicates and the Impact They Have on Your Business
- Hidden Duplicates: 11 Advanced Ways to Identify & Deduplicate Customer Data
- How to Merge Duplicates in HubSpot and Salesforce and Keep them Syncing