How to Analyze and Merge Duplicates Manually for Granular Deduplication Control
You have duplicate records in your CRM, but you need a controlled, careful process to merge these records, so bulk merging isn't an option.
With the Merge Duplicates module, you can surface duplicates, analyze the records to determine which have the relevant data, and merge them manually, one at a time.
Process Summary
- Identify duplicates.
- Review and analyze the duplicates.
- Select the records that need merging.
- Choose the master record.
- Individually select or modify values to be retained.
- Merge duplicate records.
Step-by-Step Instructions
Configure Rules to Identify Duplicates
Navigate to the Merge Duplicates module, pick the record type, and explore the default templates for a pre-built solution.
To find duplicates, you need to define how to match records. Step 1 looks through the records in your database, examining the fields that you specify for matches. Each row is for a field you want to look at for duplicates.
For example, to find duplicate Contacts, you may use the "First Name," "Last Name," and "Email Domain" fields. Contacts with the same first name AND last name AND email domain will show as possible duplicates.
Choose fields that, in combination, give a high degree of certainty that the matched records are duplicate records.
See the Merge Duplicates Module Overview for complete details on selecting fields to use and narrowing your results with the filter.
When finished, click the Find button, and Insycle will generate a list of duplicates for you to review.
Expand Criteria for Matching Duplicates
If you'd like to look at the data in two different fields (that contain similar data) as if it were one, you can set up Related Fields under the Advanced tab. For example, you might want to look at both the Email and Additional Email fields for duplicate values.
The Conditions tab provides rules one or more of the records in a duplicate group will need to meet.
- Value Required in All Records - Each record must contain a value in this field to be considered a duplicate.
- Empty Allowed in Any Record - A record can still be considered a duplicate if this field is blank. Allowing empty values requires using two or more fields to identify duplicates.
- At Least One Record With Non-Empty - At least one record in the duplicate group must contain a value.
- At Least One Record Match - At least one record in the duplicate group must match the specified value. If none of the records have the specified value, the duplicate group will not be merged.
- Only One Record Match - If more than one record in a duplicate group contains the specified field value, the duplicate group is skipped (not merged).
- Within Timeframe - Set a time parameter that can find duplicates created or modified within a specific timeframe, such as the last 30 minutes.
- Values Don't Match - All records in the duplicate group must have a value, and it doesn't match the other records within the group.
Analyze the Identified Duplicates
Records that have the same values in the fields specified in Step 1 are considered matches. When two or more records represent the same entity (person, company, or other), they are clustered together into duplicate groups. Each duplicate group shows the total number of records that were identified as duplicates. For example, if you had four records for the same person, it would count as one duplicate group with four records.
Click the chevrons at the right end of a duplicate group rows to expand them and examine the individual records within that group. The group you select here will be the one you work with when you manually merge.
Deselect the checkboxes next to the individual records you don't want to merge. The records that are selected will be merged, while the unchecked ones will not be merged.
Choose Manual Mode and Select the Records to Merge
Under Step 3. Merge Logic, select Manual mode to have complete control over which records are merged. You'll work with the single duplicate group you've chosen throughout the entire merge process.
Select the Master Record
The master is the record that will remain after the duplicates are merged. If you select three records and merge them, the other two will no longer exist. By default, data from your chosen master will be retained, and any blank values will be automatically filled from the other records. If you'd like more control over the data saved in the master record, you can do so later, in Step 4.
Under Step 3, select the master record into which the other records will be merged. It is usually best to select the record with the email address you want to keep.
Choose or Edit Values to Merge into the Master, then Merge
If you want to control which values are kept, you can choose or edit specific fields under Step 4. This is an optional step—if you don’t pick specific values, the default merging logic will be followed. Learn more about what happens to field data that isn't in the master record in the FAQs below.
Under Step 4, only five fields are initially displayed. To see all the fields, change the number of rows shown per page.
There are three tabs showing different field sets:
- The Conflicts tab shows only the fields that have differences in the values, making it easy to focus only on fields that need attention
- The Read Only tab lists only the non-writable fields from your database
- The Full tab shows all of the record fields, even those without data
Use the search to find specific fields.
On a field-by-field basis, select which values to keep and merge into the master. For example, you could choose the Industry from one and City from another.
To edit the individual field values one by one, hover over a value you want to update and click the pencil icon.
In the Edit popup, update the value and click Save.
Apply the Merge to Your CRM
When you have all critical fields chosen, click the Merge button at the bottom to merge your selected duplicates.
The Merge Result popup provides a summary of the merge operation results.
Emailed CSV Report, Audit Trail, and History
Once you've run the merge operation, Insycle will generate a CSV report and send it to your email. You can use this to review the changes made through Insycle.
With the Activity Tracker, you can download a CSV report at any time that shows all the changes made in a given operation run.
Navigate to Operations Activity Tracker, search by module, app, or template name. The operations run in Manual Mode are indicated by the Manual checkbox. Click the Run ID for the operation to download the CSV report.
When To Use Manual Merging
Manual merge is great when you have only a handful of duplicates to address, need to merge records carefully, want to employ a manual review process when merging, or just want to explore a few duplicates to understand what you have and how best to merge them.
In most cases, large datasets are a better candidate for bulk deduplication.
For situations where you have a large number of records to fix, but there are no common rules you can apply for identifying duplicates, you can create a CSV and use the Magical Import module.
In these cases, you can use CSV files to customize your bulk merging, designate master records, and exclude records from deduplication. Then you can import the CSV from the Magical Import, and use the Merge Duplicates module for complete control over the final merge operation. Learn how to customize merging Duplicates in bulk using a CSV.
Explore other advanced approaches to merging duplicates in our Deduplication Scenarios article.
Troubleshooting
Insycle isn't finding any duplicates
Most of the time, when Insycle can't find duplicates, it is due to your matching rules in Step 1. It is important to analyze the underlying data to better understand how to set up your rules. A helpful exercise for contacts is to set up your matching filters to find exact matches on only First Name and Last Name. For companies, you can use Company Name.
When you click Find, these rules can show you a broad overview of what duplicates are potentially in your database and what fields might be useful to include in your matching fields. These settings are just for discovery and should not be used for a final merge operation; many people can have the same first and last names and are not duplicates.
To get further context, on Step 2, click the layout gear button on the right side of the title bar. Here, you can add any field from your database as a column in the duplicate group review to better understand the data in these records. These will also appear in your CSV report.
It's taking a long time for Insycle to find duplicates
It can take Insycle a while to find and match duplicates when the fields used to identify them have very long values. The longer the values, the longer it takes Insycle to process the data and generate the results. This might come up when looking for matches based on long ID numbers, LinkedIn bio links, or other URLs with long strings attached (ex, https://www.linkedin.com/in/svadin%C3%ADr-n%C4%9Bmec-1234b31a3/).
You can speed this up by limiting how much of the value Insycle looks at.
If the beginning or ending portion of the values are all unique, you can limit the comparison to the first or last several characters using the Match Parts parameter under Step 1.
Or use the Ignore Text (Substrings) parameter, then click the Terms button.
On the Ignored Text tab of the popup, add the common portion of the URL or text string.
For a complete guide to troubleshooting issues with Insycle, please refer to our article on Troubleshooting Issues.
Frequently Asked Questions
What happens to field data that isn't in the master record? Will I lose data?
When manually merging duplicates, Insycle will, by default, keep the master record values. If a master record has empty fields, Insycle copies values from the most recently updated record in the duplicate group where that data exists. Any data that is not in the master or not copied to the master is removed.
For example, if the master record's Industry field is empty but another record in the duplicate group has an Industry value, that value will automatically be copied to the master record.
Additionally, you can manage which values are saved in the master record after the merge, regardless of the default merge behavior.
Select each field value you wish to retain under Step 4 and click the Full tab. The Full tab shows all of the record fields, even those without data. On a field-by-field basis, select which values to keep and merge into the master.
For example, you could choose the Primary Associated Company ID from one and Company Name from another.
Can I merge specific duplicates within a duplicate group?
Yes. You can select individual records within a duplicate group for manual merging. Under Step 2, select the duplicate group. Then, select the records that you would like to merge.
What is the Merge API in Step 3?
The Merge API options are unique features for HubSpot and Salesforce accounts. The Native merge uses your CRM's built-in merge functionality, while the Synthetic merge provides Insycle's unique merging capabilities. Learn more about the Merge API for HubSpot or the Merge API for Salesforce.
Can I view more fields to help me analyze my duplicates?
To help you analyze and determine which records to merge, you can change the fields that appear in the preview under Step 2 by clicking the gear button to adjust the layout.
Additional Resources
Related Help Articles
- Module Overview: Merge Duplicates
- Deduplication Best Practices
- Customize Bulk Deduplication Using Exclusions and Pre-defined Masters
- Deduplicate Salesforce Contacts, Leads, Accounts, and Other Objects in Bulk
- Deduplicate HubSpot Contacts, Companies, and Deals in Bulk
Related Blog Posts