When Your Blog is Getting Scraped
It’s upsetting to discover that nefarious individuals have scraped your RSS feed to make money off your hard work.
Many bloggers are also concerned that it will affect their rankings in the search engines. However, according to Google, they are quite good at identifying original content, and the result is no negative effects for the originating site.
According to Google:
Generally, we can differentiate between two major scenarios for issues related to duplicate content:
- Within-your-domain-duplicate-content, i.e. identical content which (often unintentionally) appears in more than one place on your site
- Cross-domain-duplicate-content, i.e. identical content of your site which appears (again, often unintentionally) on different external sites
Search engines identify duplicate results and will then filter out all but one. They try to figure out which is the original by checking which version was published first and which has the most links pointing to it. They also look at which site is the more authoritative one.
Usually, duplicate content will not affect your site negatively, but instead will get filtered out.
All that said, it’s still infuriating to find your own content on a scraped blog. There’s at least one blog that has been scraping the Diva for some time now. Contacting site owners that scrape usually does no good. One plugin that’s helpful is WP-Ban. You can block the IP address of the scraping site.
One thing I’ve been considering is putting an absolute link within or at the end of my posts, so I at least have a link to the originating site (mine), like this:









Pingback: Use Tracer to Track Blog Scrapers | SEO Diva
Pingback: Use Tracer to Track Blog Scrapers | SEO Blog