Is scraped content a pain for you? Is your site being scraped of its content on a daily basis? The internet is a vast and magnificent place for knowledge. It has been a place where you can find almost anything. It has been said, if you can't find it on the internet, it does not exist. That being said, the internet is also a sewer of fraud, theft and corruption. Many original content sites have had their content scraped or stolen, then posted on some other site without their express consent or any credit given. For many years there has been debate as to how to stop this, if it can be stopped.
But Google's Matt Cutts says that content scraping might not be that bad of a problem, in fact it might actually benefit you.
Whether you own a
blog or another type of
website for any length of time, then there is a good chance that your content has been scraped at sometime or other. Perhaps even on a continuous basis. This is frustrating and many people believe that it can adversely affect your site or blog. But not so, according to Matt Cutts from
Google.
Matt Cutts says that you actually may be able to slightly benefit from having your content scraped. The trick though, according to Matt Cutts, is to make sure the pages on your site have links to you in them, the scrapers may leave the links in and end up linking to you. He says these links can "help you along."
What is a Content Scraper?
In short a content scraper is basically a copy ‘n paste of content from one site directly to another site. This can be done manually or automatically with the use of content scraping software. A scraper site is a website that copies all of its content from other websites. No part of a scraper site is original.
Two big issues with this in the past have been:
-
It is regarded as plain theft and plagiarism. Many times these content scraped sites do not give credit or any link back to the original content or article.
-
Many times it has been said that these content scraper sites have ranked higher in Google Search Engine Results Pages (SERP) than the original site. Basically stealing valuable traffic and rankings and even revenue from you. Thereby being indirectly penalised by the likes of Google.
How can we benefit from scraped content?
There has been much debate over this issue. Closely related is the issue of duplicate content. Answering a user question, Matt Cutts, from Google says that it might actually benefit you.
"There are some people who really hate scrapers and try to crack down on them and try to get every single one deleted or kicked off their web host," says Cutts. "I tend to be the sort of person who doesn't really worry about it, because the vast, vast, vast majority of the time, it's going to be you that come up tops, not the scraper. If the guy is scraping and scrapes the content that has a link to you, he's linking to you, so worst case, it won't hurt, but in some weird cases, it might actually help a little bit."
So it would seem that the important part of any content is to link well. Link well within your own site and blog, link well to other sites and blogs. We all know the importance of linking, so getting into the habit; if not for content scraping, will always be beneficial.
The Debate is Still Hot
Many would totally disagree with Matt Cutts, even going so far as to start calling him names and referring to Google as evil. In this debate many have experience the opposite to what Matt Cutts is presenting. Their sites have come off second best to these scrapper sites. Loosing valuable traffic, ranking and even revenue.
“Matt cutts is full of it! smaller sites, new sites , site with low page rank are easily beat in the se results by scrapers. I've seen it happen many times to my smaller clients.”
“Google sucks! Fight Google!”
“I fail to see where this guy has a Clue”
I can see why some have a particular issue with Matt Cutts’ statement. I myself have not yet experienced any negative behaviour from my content being scraped. But then one has to think of an intelligent scraper, who, even if you do have back links, could easily insert the “rel=nofollow” attribute into your link. See my post on To follow or nofollow. There goes the theory of benefiting from the back links. Does this happen? I have no clue.
However so do believe that you can benefit from being scraped.
“Yes, this happens to me frequently but I also borrow content from others. The key is, borrow content, give credit back to the source via a back link, and add to the conversation. Republished content can actually help spread the message. And that's a good thing as long as it's above board.”
“ I mean I am getting links back from this scraper and my site always shows up before their site in search results, so what the hay... I'll just leave them be”
Some do in fact see and have experienced some benefit from scraper sites. Which goes directly to what Matt cuts says. But I do think that this issue can be site specific. It is clear that not all sites experience the same effect. Some might even experience different levels of benefit or loss.
Matt Cutts does warn that if you see a scraper ranking higher than you, you can consider doing a Digital Millennium Copyright Act request (DMCA), or if it's a true spammer (gibberish, etc.) you can go ahead and do a spam report on them.
Conclusion
So where do we go from here? I wouldn’t completely discount Matt Cutts. After all, he does carry an enormous amount of weight in the Search Industry. He sets Google trends and has great influence in their search algorithm and the way results are displayed as well as how pages are ranked. Google is King whether you like it or not. They might not be perfect, but then who is?
But I also believe that one needs to be prudent in this matter. Consider your own site and situation, monitor your articles. Keep tabs on your content. Then you can decide for yourself if it benefits you. But for the moment, all things considered, we might actually benefit from scrapers.
Related Reading:
What are your thoughts?
New here, or perhaps you've been here a few times? Like this post? Why not subscribeto this blog and get the most up to date posts as soon as they are published.