Stephan Spencer's Scatterings

The Scattered Wisdom of a scientist turned web marketing virtuoso

July 2010
S M T W T F S
 << <   > >>
        1 2 3
4 5 6 7 8 9 10
11 12 13 14 15 16 17
18 19 20 21 22 23 24
25 26 27 28 29 30 31

Google bug reveals favored web sites

A couple months ago I shared one of my Google secrets, since that secret no longer worked. ;-) Specifically, it was how to obtain a list of the most important web sites according to Google.

Now, surprisingly, this little trick appears to work again (it stopped working in 2003), thanks to a bug introduced into Google's algorithm. Two months ago, a search for http would have revealed results like HTTP - Hypertext Transfer Protocol Overview and Welcome! - The Apache HTTP Server Project. Today, these sites appear nowhere near the top of the results. Instead, the top results are occupied by a "who's who" list of highly important web sites — sites that don't include the word http anywhere in the text of the page.

As already noted by blogger Nathan Weinberg, this same phenonemon occurs when you search for www.

One thing I found curious is that http and www Google queries return different results. Now these results are NOT in order of PageRank score, at least not the PageRank scores as revealed by the Google Toolbar. You can verify this to be the case yourself simply by using SEO Chat's PageRank Search tool. Indeed, it's a well-known fact within the SEO community that the PageRank scores served up by the Google Toolbar servers are not the actual PageRanks used by Google in the ranking algorithm. PageRank debate aside, perhaps this list offers us a (now) rare glimpse at some of Google's Chosen Ones — the most important sites on the Internet according to Google.

What makes me say this is due to a bug in Google? For one thing, these results are NOT relevant to the search query. Secondly, I've uncovered another bug newly introduced into Google's algorithm, namely that the inurl: query operator does not work properly, and I think these two bugs might be related. For an example of this second bug in action, search Google for site:blogs.msdn.com scoble inurl:msnsearch and the top search result is currently blogs.msdn.com/mikehall/archive/2004/11/10/255417.aspx. Note there's no msnsearch in that URL!

I've compiled a list the top 1000 results for each of the two queries for your convenience. You'll see, they do vary quite dramatically:

Follow up:


#Google results for webGoogle results for http
1www.yahoo.com/www.microsoft.com/
2www.microsoft.com/www.altavista.com/
3www.altavista.com/www.yahoo.com/
4www.cnn.com/www.w3.org/
5www.amazon.com/exec/
obidos/subst/home/home.html
www.cnn.com/
6www.lycos.com/www.excite.com/
7www.adobe.com/www.amazon.com/exec/
obidos/subst/home/home.html
8www.adobe.com/products/
acrobat/readstep2.html
www.lycos.com/
9www.excite.com/www.adobe.com/
10www.google.com/www.adobe.com/products/
acrobat/readstep2.html
11www.mapquest.com/www.mapquest.com/
12www.alltheweb.com/www.nytimes.com/
13www.nytimes.com/www.webcrawler.com/
14www.hotbot.com/www.hotbot.com/
15www.w3.org/www.netscape.com/
16www.real.com/www.real.com/
17www.webcrawler.com/www.apache.org/
18www.mozilla.org/www.who.int/en/
19www.dogpile.com/www.mozilla.org/
20home.netscape.com/www.winzip.com/
21www.hotmail.com/www.php.net/
22www.winzip.com/www.dogpile.com/
23www.who.int/en/www.apple.com/
24www.apache.org/www.alltheweb.com/
25www.apple.com/www.hotmail.com/
26www.ibm.com/go.com/
27www.macromedia.com/www.ibm.com/
28www.hp.com/www.macromedia.com/
29www.imdb.com/www.britannica.com/
30www.php.net/www.hp.com/
31www.ebay.com/groups.google.com/
32www.un.org/www.un.org/
33go.com/www.washingtonpost.com/
34www.tucows.com/www.mysql.com/
35www.washingtonpost.com/www.weather.com/
36www.weather.com/www.ebay.com/
37www.worldbank.org/www.usatoday.com/
38www.britannica.com/www.worldbank.org/
39www.oecd.org/home/slashdot.org/
40www.usatoday.com/www.gnu.org/home.html
41slashdot.org/www.gnu.org/
42www.mysql.com/www.sun.com/
43www.sun.com/www.msn.com/
44www.wto.org/www.tucows.com/
45www.msn.com/www.northernlight.com/
46www.cisco.com/www.winamp.com/
47www.gnu.org/home.htmlwww.nationalgeographic.com/
48www.gnu.org/www.howstuffworks.com/
49www.winamp.com/www.redhat.com/
50www.cancer.org/www.linux.org/
51www.barnesandnoble.com/www.opera.com/
52www.northernlight.com/www.barnesandnoble.com/
53www.linux.org/www.oecd.org/home/
54www.redcross.org/www.askjeeves.com/
55www.paypal.com/www.m-w.com/
56www.symantec.com/www.symantec.com/
57www.intel.com/www.intel.com/
58www.opera.com/freshmeat.net/
59www.askjeeves.com/www.cisco.com/
60www.redhat.com/www.aol.com/
61www.howstuffworks.com/www.paypal.com/
62www.aol.com/www.reuters.com/
63www.americanheart.org/www.icq.com/
64www.nationalgeographic.com/www.redcross.org/
65www.collegeboard.com/www.monster.com/
66www.monster.com/searchenginewatch.com/
67www.about.com/www.cancer.org/
68www.reuters.com/www.download.com/
69www.looksmart.com/www.ask.com/
70www.ipl.org/www.wto.org/
71www.download.com/us.mcafee.com/root/
campaign.asp?cid=10550
72www.ask.com/www.eff.org/
73www.wsj.com/www.ipl.org/
74www.unicef.org/www.dictionary.com/
75www.openoffice.org/www.looksmart.com/
76www.mamma.com/www.wsj.com/
77www.unesco.org/www.openoffice.org/
78www.pbs.org/www.debian.org/
79www.finaid.org/www.wired.com/
80www.m-w.com/www.shareware.com/
81www.wired.com/www.facstaff.bucknell.edu/
rbeard/diction.html
82www.cdc.gov/www.blogger.com/
83www.whitehouse.gov/www.amnesty.org/
84www.amnesty.org/www.whitehouse.gov/
85www.greenpeace.org/www.latimes.com/
86www.eff.org/www.americanheart.org/
87www.debian.org/www.greenpeace.org/
88www.dictionary.com/sourceforge.net/
89www.careerbuilder.com/www.unicef.org/
90www.oracle.com/www.collegeboard.com/
91www.economist.com/www.about.com/
92searchenginewatch.com/www.fao.org/
93www.imf.org/www.pbs.org/
94www.ft.com/www.unesco.org/
95www.metacrawler.com/www.economist.com/
96www.icq.com/www.ieee.org/
97www.mcafee.com/us/www.topica.com/
98www.shareware.com/www.oracle.com/
99www.refdesk.com/www.findlaw.com/
100www.archive.org/www.cdc.gov/
101freshmeat.net/www.careerbuilder.com/
102www.findlaw.com/www.gimp.org/
103www.latimes.com/www.imf.org/
104www.fao.org/www.epa.gov/
105www.blogger.com/www.si.edu/
106www.ticketmaster.com/www.discovery.com/
107www.epa.gov/www.onelook.com/
108www.facstaff.bucknell.edu/
rbeard/diction.html
www.ft.com/
109www.lonelyplanet.com/www.infoplease.com/
110www.gimp.org/www.nasa.gov/
111www.discovery.com/web.mit.edu/
112www.nasa.gov/www.foxnews.com/
113www.bravenet.com/www.refdesk.com/
114www.amtrak.com/www.yourdictionary.com/
115www.usps.com/www.archive.org/
116www.ieee.org/www.perl.com/
117en.wikipedia.org/wiki/Main_Pagewww.python.org/
118www.si.edu/www.dell.com/
119www.nba.com/www.finaid.org/
120www.dell.com/www.freebsd.org/
121www.npr.org/www.webopedia.com/
122www.topica.com/www.fastweb.com/
123www.yourdictionary.com/www.metacrawler.com/
124www.nature.com/www.lonelyplanet.com/
125www.usps.gov/www.switchboard.com/
126www.sciencemag.org/www.encyclopedia.com/
127www.bloomberg.com/www.mamma.com/
128www.loc.gov/www.npr.org/
129www.bartleby.com/www.nba.com/
130www.exploratorium.edu/www.nature.com/
131www.medscape.com/www.bartleby.com/
132www.ups.com/en.wikipedia.org/wiki/Main_Page
133www.mot.com/www.sciencemag.org/
134www.foxnews.com/www.loc.gov/
135www.petersons.com/www.bloomberg.com/
136www.motorola.com/www.sciam.com/
137www.sony.com/www.census.gov/
138www.perl.com/www.biography.com/
139www.infoplease.com/www.whowhere.com/
140www.switchboard.com/www.medscape.com/
141www.census.gov/www.w3.org/Protocols/
142www.zonelabs.com/www.cert.org/
143www.ilo.org/www.businessweek.com/
144www.freebsd.org/www.metmuseum.org/
145www.digits.com/www.sony.com/
146www.onelook.com/www.digits.com/
147www.verisign.com/www.apa.org/
148www.businessweek.com/www.teoma.com/
149www.metmuseum.org/www.rnc.org/
150sourceforge.net/www.imdb.com/
151www.hoovers.com/free/www.newscientist.com/
152www.time.com/www.infospace.com/
153www.apa.org/www.usps.com/
154www.cert.org/www.petersons.com/
155www.sciam.com/www.amtrak.com/
156www.python.org/www.opensource.org/
157www.iso.org/www.aclu.org/
158www.networksolutions.com/www.salon.com/
159www.encyclopedia.com/www.ilo.org/
160www.newscientist.com/www.ajkids.com/
161www.biography.com/www.time.com/
162www.aarp.org/www.motorola.com/
163www.teoma.com/www.usps.gov/
164www.amd.com/lii.org/
165www.undp.org/www.funbrain.com/
166www.whowhere.com/www.useit.com/
167www.fda.gov/www.mot.com/
168www.aclu.org/www.networksolutions.com/
169www.travelocity.com/www.ups.com/
170www.cnet.com/www.diabetes.org/
171www.senate.gov/www.moma.org/
172www.useit.com/www.zonelabs.com/
173www.apple.com/quicktime/www.bigfoot.com/
174www.opensource.org/www.amd.com/
175www.salon.com/www.exploratorium.edu/
176www.ajkids.com/www.drudgereport.com/
177www.kernel.org/www.kernel.org/
178www.moma.org/www.ietf.org/
179www.house.gov/www.forbes.com/
180www.webopedia.com/www.iso.org/
181www.bbb.org/www.senate.gov/
182www.thehungersite.com/www.apple.com/quicktime/
183web.mit.edu/www.fda.gov/
184www.expedia.com/www.undp.org/
185www.goto.com/www.mckinley.com/
186www.infospace.com/www.house.gov/
187www.ansi.org/www.hrw.org/
188www.nih.gov/www.nih.gov/
189www.eatright.org/www.ansi.org/
190www.forbes.com/www.travelocity.com/
191www.nasdaq.com/www.webmd.com/
192marriott.com/default.miwww.verisign.com/
193www.irs.gov/www.freetranslation.com/
194www.ietf.org/www.stpt.com/
195www.unep.org/vivisimo.com/
196www.review.com/www.nasdaq.com/
197www.bigfoot.com/www.jasc.com/
198www.webmd.com/www.goto.com/
199whatis.techtarget.com/www.ala.org/
200www.ala.org/www.bbb.org/
201www.aap.org/www.ticketmaster.com/
202www.jasc.com/www.thehungersite.com/
203www.novell.com/linux/suse/www.surfwatch.com/
204www.novell.com/www.novell.com/linux/suse/
205www.freetranslation.com/www.novell.com/
206www.movabletype.org/www.unep.org/
207www.aa.com/www.expedia.com/
208www.3com.com/www.napster.com/
209www.mirc.com/www.oreilly.com/
210www.nokia.com/www.xe.com/ucc/
211www.historychannel.com/www.cbs.com/
212www.lungusa.org/www.mayoclinic.com/
213www.apple.com/
quicktime/download/
www.aarp.org/
214www.icann.org/www.kde.org/
215www.compaq.com/www.chicagotribune.com/
216www.ets.org/toefl/www.stanford.edu/
217www.mp3.com/www.tomshardware.com/
218www.hilton.com/www.securityfocus.com/
219www.cbs.com/www.profusion.com/
220www.kodak.com/www.wunderground.com/
221www.ed.gov/www.historychannel.com/
222www.olympic.org/www.snopes.com/
223www.xe.com/ucc/fdncenter.org/
224www.mckinley.com/www.icann.org/
225www.nyse.com/www.ixquick.com/
226www.nbc.com/www.cnet.com/
227www.stanford.edu/europa.eu.int/
228lii.org/www.apple.com/
quicktime/download/
229www.securityfocus.com/www.nyse.com/
230www.profusion.com/www.postgresql.org/
231www.kde.org/www.nbc.com/
232www.wunderground.com/whatis.techtarget.com/
233www.drudgereport.com/www.dynamicdrive.com/
234www.collegenet.com/www.harvard.edu/
235www.chicagotribune.com/www.mp3.com/
236www.oreilly.com/www.aap.org/
237www.ual.com/www.compaq.com/
238www.kbb.com/www.ed.gov/
239www.ca.com/www.mirc.com/
240www.starwars.com/www.studyweb.com/
241www.irfanview.com/www.3com.com/
242www.idealist.org/www.familysearch.org/
243www.eudora.com/www.starwars.com/
244www.postgresql.org/www.gnome.org/
245www.act.org/www.kodak.com/
246www.sgi.com/www.irfanview.com/
247www.nvidia.com/page/homewww.acm.org/
248www.unaids.org/www.eudora.com/
249www.sec.gov/www.realaudio.com/
250www.studyweb.com/www.epic.org/
251www.harvard.edu/www.eatright.org/
252www.ftc.gov/www.kbb.com/
253www.intellicast.com/www.ca.com/
254www.ixquick.com/www.nokia.com/
255www.zdnet.com/www.michaelmoore.com/
256www.walmart.com/www.gutenberg.org/
257www.berkeley.edu/content.nejm.org/
258www.corel.com/www.csmonitor.com/
259www.wipo.int/www.aa.com/
260www.dynamicdrive.com/www.thesaurus.com/
261www.microsoft.com/windows/ie/www.olympic.org/
262www.gnome.org/www.review.com/
263www.thelancet.com/www.irs.gov/
264www.epic.org/www.search.com/
265www.acm.org/vlib.org/
266www.fifa.com/www.artcyclopedia.com/
267www.csmonitor.com/www.thegateway.org/
268vivisimo.com/www.lungusa.org/
269www.ti.com/www.lavasoftusa.com/
270www.uspto.gov/www.jumbo.com/
271www.nfl.com/mathforum.org/
272www.moveon.org/www.wipo.int/
273www.gutenberg.org/www.htmlgoodies.com/
274www.samsung.com/www.microsoft.com/windows/ie/
275www.att.com/www.perl.org/
276www.realaudio.com/www.iht.com/
277www.mapblast.com/www.eb.com/
278www.search.com/creativecommons.org/
279www.iht.com/www.webelements.com/
280www.ssa.gov/www.berkeley.edu/
281www.netnanny.com/www.netnanny.com/
282www.mozilla.org/products/firefox/www.uspto.gov/
283www.alexa.com/www.nvidia.com/page/home
284www.thesaurus.com/www.thelancet.com/
285www.htmlgoodies.com/www.acronymfinder.com/
286www.sciencedirect.com/www.ap.org/
287www.eb.com/www.ets.org/toefl/
288www.ryanair.com/www.sciencedirect.com/
289www.arthritis.org/www.mozilla.org/
products/firefox/
290www.unitedmedia.com/
comics/dilbert/
www.corel.com/
291www.johnkerry.com/www.collegenet.com/
292www.scholastic.com/www.ipswitch.com/
293www.sco.com/www.w3schools.com/
294www.ipswitch.com/www.fifa.com/
295www.w3schools.com/www.libraryspot.com/
296www.fbi.gov/www.idealist.org/
297vlib.org/www.alexa.com/
298www.sba.gov/www.moveon.org/
299www.enchantedlearning.com/
Home.html
www.ual.com/
300www.fool.com/www.att.com/
301www.space.com/www.openssl.org/
302www.ap.org/www.pgpi.org/
303www.divx.com/www.movabletype.org/
304www.avis.com/www.kidshealth.org/
305www.perl.org/www.fbi.gov/
306www.pwcglobal.com/www.space.com/
307www.usnews.com/usnews/home.htmwww.firstgov.gov/
308www.pgpi.org/www.mapblast.com/
309www.kidshealth.org/www.zdnet.com/
310www.experian.com/www.skype.com/
311www.palm.com/www.walmart.com/
312www.fema.gov/www.ti.com/
313www.webelements.com/www.alz.org/
314www.acronymfinder.com/www.unitedmedia.com/comics/dilbert/
315www.astm.org/www.sco.com/
316www.sierraclub.org/www.fool.com/
317www.bestwestern.com/www.gnupg.org/
318www.fortune.com/www.enchantedlearning.com/Home.html
319www.thegateway.org/www.linux.com/
320www.nea.org/www.clearinghouse.net/
321www.lucent.com/www.unicode.org/
322www.linux.com/www.scholastic.com/
323www.esri.com/www.sec.gov/
324www.lordoftherings.net/www.cygwin.com/
325www.openssl.org/www.ushmm.org/
326www.hertz.com/www.bankofamerica.com/
327www.libraryspot.com/www.lordoftherings.net/
328www.delta.com/www.nea.org/
329www.firstgov.gov/www.georgewbush.com/
330www.wellsfargo.com/www.umich.edu/
331www.cuteftp.com/www.cuteftp.com/
332www.sbc.com/gen/landing-pages?pid=3308www.sierraclub.org/
333www.cnn.com/si/?cnn=yeswww.democrats.org/
334www.cnn.com/money/?cnn=yeswww.atlapedia.com/
335www.cygwin.com/www.esri.com/
336www.osha.gov/www.ftc.gov/
337www.ama-assn.org/www.hwg.org/
338www.java.com/www.fortune.com/
339www.ushmm.org/www.vim.org/
340www.healthfinder.gov/www.samsung.com/
341www.usda.gov/www.nsf.gov/
342www.democrats.org/www.ssa.gov/
343www.panda.org/www.lucent.com/
344www.factmonster.com/www.usda.gov/
345www.siemens.com/www.sba.gov/
346www.unicode.org/www.envirolink.org/
347www.acs.org/www.astm.org/
348www.british-airways.com/www.scirus.com/
349www.nortelnetworks.com/www.nhl.com/
350www.overture.com/www.ama-assn.org/
351www.gnupg.org/www.fema.gov/
352www.forrester.com/www.factmonster.com/
353www.umich.edu/www.wisenut.com/
354www.scirus.com/www.spamcop.net/
355www.bmj.com/www.3m.com/
356www.epicurious.com/www.panda.org/
357www.sfgate.com/marriott.com/default.mi
358www.ada.org/www.aaas.org/
359www.mayohealth.org/www.xfree86.org/
360www.nhl.com/www.globeandmail.com/
361www.clearinghouse.net/www.palm.com/
362www.pgp.com/www.fsf.org/
363www.salliemae.com/www.bluemountain.com/
364mathforum.org/www.kartoo.com/
365www.ets.org/www.sendmail.org/
366www.rsa.com/www.ingenta.com/
367www.nsf.gov/www.overture.com/
368www.abebooks.com/www.pgp.com/
369www.globeandmail.com/www.ucla.edu/
370www.merck.com/www.healthfinder.gov/
371www.aaas.org/www.psu.edu/
372www.os.dhhs.gov/www.osha.gov/
373www.prnewswire.com/www.livejournal.com/
374www.target.com/www.noaa.gov/
375www.ge.com/en/www.prnewswire.com/
376www.nationalacademies.org/www.health.org/
377www.intelihealth.com/www.arthritis.org/
378www.boston.com/www.rsa.com/
379www.noaa.gov/www.openbsd.org/
380www.boeing.com/www.xml.com/
381creativecommons.org/www.washington.edu/
382www.wisenut.com/www.techweb.com/
383www.ingenta.com/www.sbc.com/gen/
landing-pages?pid=3308
384www.ucla.edu/www.virtualtourist.com/vt/
385www.verizon.com/home.netscape.com/
386www.vim.org/www.columbia.edu/
387www.wiley.com/WileyCDA/www.gentoo.org/
388www.livejournal.com/www.johnkerry.com/
389www.vatican.va/www.nationalacademies.org/
390www.fsf.org/www.utexas.edu/
391www.ford.com/www.isoc.org/
392www.vh.org/www.abebooks.com/
393www.sap.com/javascript.internet.com/
394www.jpost.com/www.developer.com/java/
395www.athens2004.com/www.acs.org/
396www.xfree86.org/www.lego.com/
397www.internet.com/www.activestate.com/
398www.xerox.com/www.yale.edu/
399www.thomasregister.com/www.safesurf.com/
400www.education-world.com/www.microsoft.com/ie/
401www.hwg.org/www.cornell.edu/
402www.columbia.edu/www.slackware.com/
403www.ea.com/www.edmunds.com/
404www.adbusters.org/www.merck.com/
405www.oanda.com/www.act.org/
406www.virtualtourist.com/vt/www.os.dhhs.gov/
407www.psu.edu/chronicle.com/
408www.techweb.com/www.hilton.com/
409www.toshiba.com/www.edweek.org/
410www.borland.com/www.boeing.com/
411www.utexas.edu/www.anywho.com/
412www.activestate.com/www.mayohealth.org/
413www.sendmail.org/www.forrester.com/
414www.state.gov/www.thomasregister.com/
415www.isoc.org/www.ada.org/
416www.thenation.com/www.usnews.com/
usnews/home.htm
417home.americanexpress.com/
home/mt_personal.shtml
www.enc.org/
418www.nec.com/www.education-world.com/
419www.fodors.com/www.fodors.com/
420www.cornell.edu/https://www.cvshome.org/
421www.usgs.gov/www.sgi.com/
422www.openbsd.org/www.directhit.com/
423www.edweek.org/www.cbsnews.com/
424www.nwa.com/www.eclipse.org/
425www.anywho.com/www.jpost.com/
426www.pcworld.com/www.sfgate.com/
427www.washington.edu/phpnuke.org/
428www.eclipse.org/www.pwcglobal.com/
429www.slackware.com/www.siemens.com/
430www.autodesk.com/www.wellsfargo.com/
431www.fcc.gov/www.vatican.va/
432www.nap.edu/www.borland.com/
433www.seagate.com/www.ets.org/
434www.microsoft.com/ie/www.webreference.com/
435www.cbsnews.com/www.intelihealth.com/
436www.accuweather.com/www.cnn.com/si/?cnn=yes
437www.rnc.org/www.brainpop.com/
438groups.google.com/altavista.digital.com/
439www.safesurf.com/www.rotary.org/
440www.gentoo.org/www.uiuc.edu/
441www.dol.gov/www.cauce.org/
442www.xml.com/www.epicurious.com/
443news.netcraft.com/www.ea.com/
444www.kidsdomain.com/pbskids.org/
445www.enc.org/www.omg.org/
446www.continental.com/www.alistapart.com/
447www.cauce.org/www.nec.com/
448www.familyeducation.com/home/www.wiley.com/WileyCDA/
449www.omg.org/www.adbusters.org/
450www.hotscripts.com/www.oanda.com/
451www.bluetooth.com/promo.net/pg/
452www.animationfactory.com/www.state.gov/
453www.hud.gov/www.cdt.org/
454dir.webring.com/rwwww.fark.com/
455www.panasonic.com/www.onlinenewspapers.com/
456www.ams.org/www.boston.com/
457www.pcmag.com/www.blizzard.com/
458www.uiuc.edu/www.phpbb.com/
459https://www.cvshome.org/www.fcc.gov/
460www.elsevier.com/wps/find/www.animationfactory.com/
461www.guardian.co.uk/www.computer.org/
462www.bls.gov/home.htmwww.usgs.gov/
463www.guggenheim.org/www.avis.com/
464www.usdoj.gov/www.archives.gov/
465www.webreference.com/news.netcraft.com/
466www.ifrc.org/www.toshiba.com/
467www.yale.edu/www.copernic.com/
468www.findarticles.com/www.fair.org/
469www.thawte.com/www.ryanair.com/
470www.missingkids.com/www.pcmag.com/
471www.multimap.com/www.nctm.org/
472www.visitbritain.com/www.parentsoup.com/
473www.pfizer.com/www.thawte.com/
474www.fark.com/www.iaf.net/
475www.gre.org/www.xerox.com/
476www.nps.gov/www.biblegateway.com/
477www.adobe.com/
products/acrobat/main.html
www.kidsdomain.com/
478www.opentext.com/www.thenation.com/
479www.archives.gov/www.samba.org/
480www.computerworld.com/www.ge.com/en/
481www.iomega.com/global/index.jspwww.accuweather.com/
482www.wwf.org/www.healthatoz.com/
483www.gm.com/www.nap.edu/
484www.nctm.org/www.nfpa.org/
485www.asme.org/www.ford.com/
486www.netlibrary.com/www.w3.org/WAI/
487spamcop.net/www.systransoft.com/
488www.netscape.com/www.guardian.co.uk/
489www.cdt.org/www.elsevier.com/wps/find/
490www.nist.gov/www.priceline.com/
491www.w3.org/WAI/www.scriptarchive.com/
492www.alistapart.com/www.ajb.org/
493www.apple.com/itunes/www.autodesk.com/
494www2.ncaa.org/www.usdoj.gov/
495www.blackboard.com/www.guggenheim.org/
496www.brainpop.com/www.webstandards.org/
497www.itu.int/home/www.nps.gov/
498www.bmn.com/www.infoworld.com/
499www.rand.org/www.netbsd.org/
500www.uefa.com/www.dol.gov/

501-1000 >>

Posted by Stephan Spencer on 01/09/2005 | Permalink

Comments (3)| Comments RSS | Filed under: Search Engines google, pagerank, seo            

3 comments

  1. It's not a bug per se -- actually, Google just treats the word "http" like any other word. And a lot of people link to another site using "http://www.example.com", which in the eyes of Google is the same as the link text "http www example com", so all these words are "attached" to that page. So naturally the same "top 100" list happens when you enter "www" or "com" into Google. You just get very popular pages because the "http://www. ..." link type is so common across all fields and web sites it basically works as a "neutral" term.

    Comment by Philipp Lenssen [Visitor] Email · http://blog.outer-court.com — 01/09/05 @ 09:36


  2. Ah, but why then do the SERPs for inanchor:http vary from the SERPs for http? If it weren't a bug, the SERPs for the 2 queries should be near identical. Unfortunately I don't have a screenshot of a http search from last year, but believe me it was markedly different from what it is now. Two months ago, I could search for http and get relevant results back in the top 10 — relevant to the hypertext transfer protocol. Currently that's no longer the case. Now I would need to search for intitle:http for relevant results.

    Comment by Stephan Spencer [Visitor] Email · http://www.stephanspencer.com — 01/10/05 @ 05:24


  3. "but why then do the SERPs for inanchor:http vary from the SERPs for http? If it weren’t a bug, the SERPs for the 2 queries should be near identical."

    They are in fact near identical.

    I still can't see a bug. You may prefer last year's results, possibly they were different because now Google favors link text even stronger than before. In other words it would be incredibly hard to target the keywords "http" and "www" with any meaningful strategy, which would be a bad side-effect but not really a bug...

    Comment by Philipp Lenssen [Visitor] Email · http://blog.outer-court.com — 01/11/05 @ 07:35


Leave a comment


Your email address will not be revealed on this site.

Your URL will be displayed.
(Line breaks become <br />)
(Name, email & website)
(Allow users to contact you through a message form (your email will not be revealed.)