Stephan Spencer's Scatterings

The Scattered Wisdom of a scientist turned web marketing virtuoso

August 2008
S M T W T F S
 << <   > >>
          1 2
3 4 5 6 7 8 9
10 11 12 13 14 15 16
17 18 19 20 21 22 23
24 25 26 27 28 29 30
31            

Tricks for viewing cloaked content

There are two types of cloaking: user-agent based and IP based (also known by the euphamism "IP delivery"). Cloakers try to cover their tracks by making it difficult to examine the version meant only for spiders. They do this with a "noarchive" command embedded within the meta tags. Googlebot will obey that directive and not archive the page, which then causes the "Cached" link in that page's search listing to disappear.

So getting a view behind the curtain to see what is being served to the spider can be a bit tricky. If the type of cloaking is solely user-agent based, you can use the User Agent Switcher extension for Firefox. Just create the following user-agent under Tools > User Agent Switcher > Options > Options > User Agents:

Description: Googlebot
User Agent: Googlebot/2.1 (+http://www.googlebot.com/bot.html)

Then switch to that user agent by selecting Googlebot under Tools > User Agent Switcher.

But that won't work if the cloaker is doing IP delivery. If there's no "Cached" link in the SERPs, you might think you're out of luck. But you may not be!

A lot of times, Google's "Translate This Page" functionality can be used to view the cloaked content, because many cloakers don't bother to differentiate between the bot coming in for the purpose of translating or coming in for the purpose of crawling. Either way, it uses the same range of Google IP addresses. Thus, when a cloaker is doing IP delivery they tend to serve up the Googlebot-only version of the page to the Translate tool. This loophole can be plugged, but many cloakers miss this.

And I bet you didn't know that you can actually set the Translation language to English even if the source document is in English! You simply set it in the URL, like so:

http://translate.google.com/translate?hl=en&sl=en&u=URL&sa=X&oi=translate&resnum=9&ct=result

(Above, replace URL with the actual URL of the page you want to view)

That way, when you are reviewing someone's cloaked page, you can see the page in English instead of having to see the page in a foreign language. 

You can also sometimes use this trick to view paid content. i.e. if you're too cheap to pay for content from sites like WebmasterWorld where that content has been placed behind a registration wall and removed from Google's cache.

Example

Do pay for WebmasterWorld, though. Do right by Brett.

Posted by Stephan Spencer on 02/07/2007 | Permalink

Comments (4)| Comments RSS | Filed under: Search Engines cloaking, ip delivery            

3 comments, 1 pingback

  1. I like the translate english to english trick - I knew the others, but this one was new to me. Thanks Stephan!

    Ian

    Comment by Ian McAnerin [Visitor] Email · http://mcanerin.blogspot.com — 02/07/07 @ 23:53


  2. [...] Stephan Spencer: Tricks for viewing cloaked content [...]

    Pingback by February ‘07: Best Search/Marketing Posts » Small Business SEM [Visitor] — 03/01/07 @ 19:05


  3. hey..google bot doesnt work...

    even on webmasterworld..

    im using firefox 2.0

    Comment by kurt [Visitor] Email — 08/03/07 @ 13:14


  4. That will only grab bottom-end cloakers. Real ones, there's only a few ways to get around it.
    ...which I'm not looking to repeat here.

    Comment by SlightlyShadySEO [Visitor] Email · http://www.slightlyshadyseo.com — 01/12/08 @ 18:42


Leave a comment


Your email address will not be revealed on this site.

Your URL will be displayed.
(Line breaks become <br />)
(Name, email & website)
(Allow users to contact you through a message form (your email will not be revealed.)