Duplicate Content Finder

With the help of the Duplicate Content Finder you can search your website for texts which are used on several sites ("Duplicate Content") to avoid that some contents and subpages are evaluated more negatively or are not indexed by Google. Make sure that every subpage has its own, unique content.

Note: It is, of course, not possible to avoid all kinds of overlapping in one website project. Even the use of the same always used navigation bar can lead to a certain grade of concordance. Therefore, you should concentrate on the text contents and avoid sites which are 90 or 100% identical in regard to content.

If you want to analyze your project URL and its subpages, click Start Analysis. Otherwise, click the lower half of the Start Analysis button to do a custom scan. In the Custom Scan dialog, you can select a Scan Depth to determine to which level Hello Engines! is going to follow links on your site and increase/decrease the number of Threads to increase/decrease the query speed. In addition, you can skip certain pages, folders and session IDs.

After the scan finishes, you will be shown a matrix of all the pages in the main window which allows you to compare every site in your domain with every other site. You get a percentage for every pair of pages, which will show the level of similarity.

You should look for pairs of websites which show a higher number of similarity, for example 70 or 90%. These fields will be marked in colors (yellow and orange). If you click such a field, you can compare the Original Page to the Similar Page below.

You can export the results of the Duplicate Content Finder to .csv or an Excel spreadsheet by clicking Export in the toolbar. In addition, you can create a Report on the according tab which will list all similar web pages.