How to find similar duplicates
Tutor 5 (112 Reviews)
Excel Tutor
Still stuck with a Excel question
Ask this expertAnswer
To find similar duplicates in Excel, you can use multiple methods depending on whether you want exact duplicates or near matches. Excel offers built-in tools and formulas that simplify the process.
Method 1: Using Conditional Formatting
Conditional Formatting highlights cells that contain duplicates.
Select the range of cells you want to check for duplicates.
Go to the Home tab.
Click Conditional Formatting → Highlight Cells Rules → Duplicate Values.
In the pop-up window, select the formatting style (e.g., red fill with dark red text).
Click OK.
The duplicates will be highlighted in the selected style.
Variation for Mac: The steps are the same, but the Conditional Formatting menu may appear under Format → Conditional Formatting in older versions.
Method 2: Using the COUNTIF Formula
The COUNTIF function can detect duplicates by counting occurrences of each value in a range.
Suppose your data is in column A (
A2:A100). In cell B2, enter:=COUNTIF($A$2:$A$100, A2)
Press Enter.
Drag the formula down through column B.
Any value greater than 1 indicates a duplicate.
You can filter or sort column B to see duplicates quickly.
Example: If A2 = Apple and it appears three times in the range, B2 will show 3.
Method 3: Using TEXTJOIN and FIND for Similar (Near-Match) Duplicates
To find values that are similar but not the same (like "Apple Inc." and "Apple Incorporated"), you can use formulas combining TEXTJOIN, SEARCH, or FIND.
Suppose your data is in column A.
Use a helper column B to identify similarity. For example:
=IF(SUMPRODUCT(--ISNUMBER(SEARCH("Apple", A$2:A$100)))>1,"Similar","")
This flags cells containing the word "Apple" more than once.
Replace
"Apple"with any key string you want to check for similarity.
This method is useful for partial matches or entries with minor differences.
Method 4: Using Excel’s Remove Duplicates Tool (for Exact Matches)
Select the range of data.
Go to the Data tab.
Click Remove Duplicates.
Select the columns to check.
Click OK.
Excel removes duplicates, leaving unique entries. You can use this to create a clean list and compare it with the original to find duplicates.
Method 5: Using Power Query for Advanced Duplicate Detection
Power Query is useful for detecting fuzzy duplicates or near matches.
Go to Data → Get & Transform → From Table/Range.
Load the range into Power Query.
Select the column to analyze.
Go to Home → Remove Rows → Remove Duplicates (for exact duplicates) or Transform → Fuzzy Merge (for similar duplicates).
Adjust the similarity threshold for near matches (e.g., 80%).
This method identifies both exact and approximate duplicates efficiently in large datasets.
Tips for Handling Duplicates
Always create a backup of your data before removing duplicates.
Use helper columns with
COUNTIForIFstatements to avoid accidental data loss.For text variations (e.g., spacing, capitalization), use
TRIMandUPPERfunctions to standardize before checking for duplicates:=UPPER(TRIM(A2))
These methods allow you to identify both exact and similar duplicates effectively, whether your dataset is small or large.
. Was this Helpful?Get Online Tutoring or Questions answered by Experts.
You can post a question for a tutor or set up a tutoring session
Answers · 1
Why is drop down list not showing in excel
Answers · 0
Why is the drop down list not showing all options
Answers · 1
Why can't I delete drop down list excel
Answers · 1
How to create an Excel drop-down list from a lookup table
Answers · 1