If you work with data in Excel, you're likely to find yourself dealing with the issue of duplicate records or data points.
With small data sets, you can manually scan and try and remove duplicates. But if you have large data sets, doing this manually would take a lot of your time.
The good news is that Excel has an inbuilt feature that allows you to remove duplicate with a few clicks.
In this tutorial, I will show you the steps you need to follow to remove duplicates from your data set in Excel.
Suppose you have a transaction records as shown below:
As you can see, the second and third record are the same.
While you can visually see this duplicate data point in this example, if you have hundreds or thousands of records, doing this manually would be time consuming.
Here are the steps to remove duplicates from the data set shown above:
- Select the entire data set.
- Go to Data and with the Data Tools group, click on the Remove Duplicates icon.
- In the Remove Duplicates dialog box, make sure all the columns are selected (which is the default setting).
- Click OK.
This will instantly remove all the duplicate records from the data set.
Note that this technique alters your data set. If you want to keep the original data set as is, make a copy of the data set and then perform these steps on it.
Hope you find this technique useful and it saves you some time.
If you want to learn more about this technique and how to use it, here is a tutorial that covers this topic in detail.
You may also like the following tutorials: