Stephan Spencer's Scatterings

The Scattered Wisdom of a scientist turned web marketing virtuoso

October 2008
S M T W T F S
 << <   > >>
      1 2 3 4
5 6 7 8 9 10 11
12 13 14 15 16 17 18
19 20 21 22 23 24 25
26 27 28 29 30 31  

Scrapers stealing your content for SEO

Content is king on the web. A site without content is doomed to lousy search engine rankings. Search engine spammers can't be bothered writing good content. Especially when they can easily steal it from other web sites. How do they do it? They use "scrapers" — spiders that trawl web pages and/or RSS feeds and siphon off the content. They then stick your content on their own site and slap their own ads and affiliate links onto it.

The spammers especially want you to use relative links across your web site. That way they can lift your entire website and they don't even have to go to the trouble of rejigging your internal links to make them point back to the scraped site. Granted, as far as bandwidth conservation, relative links are better than absolute links (also known as "hard links"). But let's not make the spammer's job any easier.

So use absolute links throughout your site.

As a side benefit, if your site responds to multiple domains and you use absolute links, you'll also be helping the search engines reduce the potential for duplicate content by definitively identifying the full, canonical URL.

Also, to check if your site has been scraped, use Copyscape.

Posted by Stephan Spencer on 12/15/2005 | Permalink

Comments (4)| Comments RSS | Filed under: Search Engines, Content            

3 comments, 1 pingback

  1. [...] Now, while most people have been taught to just use relative links, it is actually not the best for both search engines and preventing people from scraping your site content. Some people tend to use relative links because the shorter code can decrease a page’s download time and decrease the amount of typing they do. However, a millisecond is about all that is saved this was as far as load time and the majority of users these days are not on a 14K dial-up anymore. [...]

    Pingback by Web Design Blog » Proper Internal Site Linking [Visitor] — 12/19/06 @ 16:21


  2. It makes sense. I was trying to think if there are negative effects of having links from a scraped website to the original. Could you be penalized from the bad reputation of the website scraping your content linking back to you? Or would it make it easier for Google to decide that it is content duplication?

    Comment by Jean Moniatte [Visitor] Email · http://www.ugal.com/ — 10/22/07 @ 18:51


  3. I've just discovered that my own uniquely written content for a whole travel website has been completely copied or scraped by a couple of competing travel sites (they copied the whole site - I used copyscape)! Recently this site experienced a big drop out of google rankings and having made no changes ourselves in this time, I am wondering the effects of this duplicate content from the copiers (our site is only 1 year old). Could this be why? Any advice appreciated...

    Comment by orcn [Visitor] Email — 11/21/07 @ 16:34


  4. put content scraping to good use..
    i like scrapers coming to visit my site!!

    just put backlinks to your own page in your rss feeds and voila, 100's of backlinks :-)

    Comment by kristof george [Visitor] Email · http://bloggtag.com — 01/23/08 @ 01:17


Leave a comment


Your email address will not be revealed on this site.

Your URL will be displayed.
(Line breaks become <br />)
(Name, email & website)
(Allow users to contact you through a message form (your email will not be revealed.)