<?xml version="1.0" encoding="utf-8"?><rss
version="2.0"
xmlns:content="http://purl.org/rss/1.0/modules/content/"
xmlns:dc="http://purl.org/dc/elements/1.1/"
xmlns:atom="http://www.w3.org/2005/Atom"
xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
xmlns:media="http://search.yahoo.com/mrss/"
> <channel><title>Comments on: Google Webmaster Tools Content Analysis shows Google breaks the rules.</title> <atom:link href="http://yoast.com/google-webmaster-tools-content-analysis-shows-google-breaks-the-rules/feed/" rel="self" type="application/rss+xml" /><link>http://yoast.com/google-webmaster-tools-content-analysis-shows-google-breaks-the-rules/#utm_source=rss&amp;utm_medium=rss&amp;utm_campaign=google-webmaster-tools-content-analysis-shows-google-breaks-the-rules</link> <description>Tweaking Websites</description> <lastBuildDate>Fri, 19 Mar 2010 08:37:34 +0000</lastBuildDate> <generator>http://wordpress.org/?v=</generator> <sy:updatePeriod>hourly</sy:updatePeriod> <sy:updateFrequency>1</sy:updateFrequency> <xhtml:meta xmlns:xhtml="http://www.w3.org/1999/xhtml" name="robots" content="noindex" /> <item><title>By: gissit</title><link>http://yoast.com/google-webmaster-tools-content-analysis-shows-google-breaks-the-rules/#comment-27822</link> <dc:creator>gissit</dc:creator> <pubDate>Mon, 07 Jan 2008 20:12:55 +0000</pubDate> <guid
isPermaLink="false">http://www.joostdevalk.nl/google-webmaster-tools-content-analysis-shows-google-breaks-the-rules/#comment-27822</guid> <description>Joost
I can&#039;t really argue with someone with a homepage pr like yours :) but experience has shown me that less is very often more in terms of raw number of pages Vs SERP positions.
If I want to create discovery links I do it in a html sitemap in a page formatted for users.
Using this type of philosophy I have much better results than most competitiors with far fewer inbound links and visible (toolbar) PR.I know that a lot of talk has gone into the whole nofollow thing but I have also seen the effect of using it as intended. A bit of link management work on a friends forum has increased Google traffic more than 20 fold in less than 9 months.Anyway, I think this may all be a bit off-topic for this thread but if you don&#039;t mind I will bookmark your site and return when I have more time as it looks interesting here.</description> <content:encoded><![CDATA[<p>Joost<br
/> I can't really argue with someone with a homepage pr like yours :) but experience has shown me that less is very often more in terms of raw number of pages Vs SERP positions.<br
/> If I want to create discovery links I do it in a html sitemap in a page formatted for users.<br
/> Using this type of philosophy I have much better results than most competitiors with far fewer inbound links and visible (toolbar) PR.</p><p>I know that a lot of talk has gone into the whole nofollow thing but I have also seen the effect of using it as intended. A bit of link management work on a friends forum has increased Google traffic more than 20 fold in less than 9 months.</p><p>Anyway, I think this may all be a bit off-topic for this thread but if you don't mind I will bookmark your site and return when I have more time as it looks interesting here.</p> ]]></content:encoded> </item> <item><title>By: Joost de Valk</title><link>http://yoast.com/google-webmaster-tools-content-analysis-shows-google-breaks-the-rules/#comment-27812</link> <dc:creator>Joost de Valk</dc:creator> <pubDate>Mon, 07 Jan 2008 19:22:53 +0000</pubDate> <guid
isPermaLink="false">http://www.joostdevalk.nl/google-webmaster-tools-content-analysis-shows-google-breaks-the-rules/#comment-27812</guid> <description>@gissit: First of all, what you call &quot;current wisdom&quot; does not apply here, I&#039;ve removed nofollows. Second: the pages are just there for discovery, just like XML sitemaps are, there&#039;s nothing wrong with that.</description> <content:encoded><![CDATA[<p>@gissit: First of all, what you call "current wisdom" does not apply here, I've removed nofollows. Second: the pages are just there for discovery, just like XML sitemaps are, there's nothing wrong with that.</p> ]]></content:encoded> </item> <item><title>By: gissit</title><link>http://yoast.com/google-webmaster-tools-content-analysis-shows-google-breaks-the-rules/#comment-27811</link> <dc:creator>gissit</dc:creator> <pubDate>Mon, 07 Jan 2008 19:17:09 +0000</pubDate> <guid
isPermaLink="false">http://www.joostdevalk.nl/google-webmaster-tools-content-analysis-shows-google-breaks-the-rules/#comment-27811</guid> <description>I read the article (I was searching for something else and this caught my eye). The behaviour of the Googlebot is exactly as I would expect, it was not prohibited from viewing the page in robots.txt, it has not indexed the page as directed by the meta tag and &#039;maybe&#039; it followed the links as requested.
As this is an SEO blog my comment was more related to the wisdom of having such pages in todays link spam climate. Google has specifically made a point of saying on many occaisions that pages should be for visitors and not for search engines. If I were an algorithm writer this would certainly raise some alarms with me.
If the pages are not for visitors to see then what else are they for? If it is to lead a search engine to other pages then they are indeed doorway pages and may suffer the wrath of Google.
As for posting just to get a link? I really do not need a link from a blog to help my seo. If this page is written in accordance with current wisdom the link will either have a rel=&quot;nofollow&quot; or will be wrapped in javascript. If it is not then it would have little or no value anyway as this is not related to my site topic.</description> <content:encoded><![CDATA[<p>I read the article (I was searching for something else and this caught my eye). The behaviour of the Googlebot is exactly as I would expect, it was not prohibited from viewing the page in robots.txt, it has not indexed the page as directed by the meta tag and 'maybe' it followed the links as requested.<br
/> As this is an SEO blog my comment was more related to the wisdom of having such pages in todays link spam climate. Google has specifically made a point of saying on many occaisions that pages should be for visitors and not for search engines. If I were an algorithm writer this would certainly raise some alarms with me.<br
/> If the pages are not for visitors to see then what else are they for? If it is to lead a search engine to other pages then they are indeed doorway pages and may suffer the wrath of Google.<br
/> As for posting just to get a link? I really do not need a link from a blog to help my seo. If this page is written in accordance with current wisdom the link will either have a rel="nofollow" or will be wrapped in javascript. If it is not then it would have little or no value anyway as this is not related to my site topic.</p> ]]></content:encoded> </item> <item><title>By: tom</title><link>http://yoast.com/google-webmaster-tools-content-analysis-shows-google-breaks-the-rules/#comment-27805</link> <dc:creator>tom</dc:creator> <pubDate>Mon, 07 Jan 2008 15:32:50 +0000</pubDate> <guid
isPermaLink="false">http://www.joostdevalk.nl/google-webmaster-tools-content-analysis-shows-google-breaks-the-rules/#comment-27805</guid> <description>Did you read the article gissit? or do want a link to your site...</description> <content:encoded><![CDATA[<p>Did you read the article gissit? or do want a link to your site...</p> ]]></content:encoded> </item> <item><title>By: gissit</title><link>http://yoast.com/google-webmaster-tools-content-analysis-shows-google-breaks-the-rules/#comment-27804</link> <dc:creator>gissit</dc:creator> <pubDate>Mon, 07 Jan 2008 15:25:55 +0000</pubDate> <guid
isPermaLink="false">http://www.joostdevalk.nl/google-webmaster-tools-content-analysis-shows-google-breaks-the-rules/#comment-27804</guid> <description>So you have pages that you want google to follow links from but do not want it to send visitors there? So it is a doorway page of some sort?</description> <content:encoded><![CDATA[<p>So you have pages that you want google to follow links from but do not want it to send visitors there? So it is a doorway page of some sort?</p> ]]></content:encoded> </item> <item><title>By: ed</title><link>http://yoast.com/google-webmaster-tools-content-analysis-shows-google-breaks-the-rules/#comment-27450</link> <dc:creator>ed</dc:creator> <pubDate>Wed, 19 Dec 2007 09:46:31 +0000</pubDate> <guid
isPermaLink="false">http://www.joostdevalk.nl/google-webmaster-tools-content-analysis-shows-google-breaks-the-rules/#comment-27450</guid> <description>Thanks, got confused by the term analysis, which is not analytics...</description> <content:encoded><![CDATA[<p>Thanks, got confused by the term analysis, which is not analytics...</p> ]]></content:encoded> </item> <item><title>By: Joost de Valk</title><link>http://yoast.com/google-webmaster-tools-content-analysis-shows-google-breaks-the-rules/#comment-27433</link> <dc:creator>Joost de Valk</dc:creator> <pubDate>Tue, 18 Dec 2007 19:31:25 +0000</pubDate> <guid
isPermaLink="false">http://www.joostdevalk.nl/google-webmaster-tools-content-analysis-shows-google-breaks-the-rules/#comment-27433</guid> <description>@ed: this is in Google Webmaster Tools, not analytics.</description> <content:encoded><![CDATA[<p>@ed: this is in Google Webmaster Tools, not analytics.</p> ]]></content:encoded> </item> <item><title>By: ed</title><link>http://yoast.com/google-webmaster-tools-content-analysis-shows-google-breaks-the-rules/#comment-27424</link> <dc:creator>ed</dc:creator> <pubDate>Tue, 18 Dec 2007 14:51:31 +0000</pubDate> <guid
isPermaLink="false">http://www.joostdevalk.nl/google-webmaster-tools-content-analysis-shows-google-breaks-the-rules/#comment-27424</guid> <description>Where can I find this duplicate title tag section in google analytics?</description> <content:encoded><![CDATA[<p>Where can I find this duplicate title tag section in google analytics?</p> ]]></content:encoded> </item> <item><title>By: &#187; Tracking URLs that pass link juice Dixon Jones: Echoes from a quiet space</title><link>http://yoast.com/google-webmaster-tools-content-analysis-shows-google-breaks-the-rules/#comment-27385</link> <dc:creator>&#187; Tracking URLs that pass link juice Dixon Jones: Echoes from a quiet space</dc:creator> <pubDate>Mon, 17 Dec 2007 13:09:46 +0000</pubDate> <guid
isPermaLink="false">http://www.joostdevalk.nl/google-webmaster-tools-content-analysis-shows-google-breaks-the-rules/#comment-27385</guid> <description>[...] but by colleague Andy has some issues. Let&#8217;s be honest Google - you are ruining standards. Joost gave an example of ignoring standards they&#8217;ve agreed by indexing &#8220;no index&#8221; [...]</description> <content:encoded><![CDATA[<p>[...] but by colleague Andy has some issues. Let&#8217;s be honest Google - you are ruining standards. Joost gave an example of ignoring standards they&#8217;ve agreed by indexing &#8220;no index&#8221; [...]</p> ]]></content:encoded> </item> <item><title>By: Joost de Valk</title><link>http://yoast.com/google-webmaster-tools-content-analysis-shows-google-breaks-the-rules/#comment-27380</link> <dc:creator>Joost de Valk</dc:creator> <pubDate>Mon, 17 Dec 2007 09:13:47 +0000</pubDate> <guid
isPermaLink="false">http://www.joostdevalk.nl/google-webmaster-tools-content-analysis-shows-google-breaks-the-rules/#comment-27380</guid> <description>@tingeltangeltill: noindex without a nofollow impicitly means, noindex, follow. Hence they need to spider those pages to know which links are on there, that&#039;s logical. They won&#039;t show them in their index, yet they DO give me some sort of advice on them. I&#039;d rather have them not do that, but they don&#039;t &quot;put them in their index&quot;, they spider them because they need to follow the links.</description> <content:encoded><![CDATA[<p>@tingeltangeltill: noindex without a nofollow impicitly means, noindex, follow. Hence they need to spider those pages to know which links are on there, that's logical. They won't show them in their index, yet they DO give me some sort of advice on them. I'd rather have them not do that, but they don't "put them in their index", they spider them because they need to follow the links.</p> ]]></content:encoded> </item> <item><title>By: tingeltangeltill</title><link>http://yoast.com/google-webmaster-tools-content-analysis-shows-google-breaks-the-rules/#comment-27379</link> <dc:creator>tingeltangeltill</dc:creator> <pubDate>Mon, 17 Dec 2007 09:03:32 +0000</pubDate> <guid
isPermaLink="false">http://www.joostdevalk.nl/google-webmaster-tools-content-analysis-shows-google-breaks-the-rules/#comment-27379</guid> <description>If someone is linking to your &quot;noindex&quot; sites, google will put them in their index, as Matt said in a Google Video</description> <content:encoded><![CDATA[<p>If someone is linking to your "noindex" sites, google will put them in their index, as Matt said in a Google Video</p> ]]></content:encoded> </item> <item><title>By: Sint</title><link>http://yoast.com/google-webmaster-tools-content-analysis-shows-google-breaks-the-rules/#comment-27369</link> <dc:creator>Sint</dc:creator> <pubDate>Sun, 16 Dec 2007 13:06:19 +0000</pubDate> <guid
isPermaLink="false">http://www.joostdevalk.nl/google-webmaster-tools-content-analysis-shows-google-breaks-the-rules/#comment-27369</guid> <description>@Sergey: I hope the reason for this is not that you called your file really tobots.txt ;-)
Do you have a noindex-tag in these files? Or are they all non-HTML so this isn&#039;t possible?Alternatively you could also block all traffic coming from Googlebot using .htaccess on these folders.</description> <content:encoded><![CDATA[<p>@Sergey: I hope the reason for this is not that you called your file really tobots.txt ;-)<br
/> Do you have a noindex-tag in these files? Or are they all non-HTML so this isn't possible?</p><p>Alternatively you could also block all traffic coming from Googlebot using .htaccess on these folders.</p> ]]></content:encoded> </item> <item><title>By: Sergey Rusak</title><link>http://yoast.com/google-webmaster-tools-content-analysis-shows-google-breaks-the-rules/#comment-27363</link> <dc:creator>Sergey Rusak</dc:creator> <pubDate>Sun, 16 Dec 2007 11:08:22 +0000</pubDate> <guid
isPermaLink="false">http://www.joostdevalk.nl/google-webmaster-tools-content-analysis-shows-google-breaks-the-rules/#comment-27363</guid> <description>For my blog I block all /trackback/ and /feed/ pages through tobots.txt but for some reason Google still index them every time new page appear. Only one good thing, it appear in index only for 1-3 days and later dissapear. Wierd.</description> <content:encoded><![CDATA[<p>For my blog I block all /trackback/ and /feed/ pages through tobots.txt but for some reason Google still index them every time new page appear. Only one good thing, it appear in index only for 1-3 days and later dissapear. Wierd.</p> ]]></content:encoded> </item> <item><title>By: nam</title><link>http://yoast.com/google-webmaster-tools-content-analysis-shows-google-breaks-the-rules/#comment-27328</link> <dc:creator>nam</dc:creator> <pubDate>Sat, 15 Dec 2007 19:20:05 +0000</pubDate> <guid
isPermaLink="false">http://www.joostdevalk.nl/google-webmaster-tools-content-analysis-shows-google-breaks-the-rules/#comment-27328</guid> <description>Content Analysis is a excellent tool to determine a duplicated content, I&#039;ve used my file robots.txt to  to restrict access to Google on these pages, and it&#039;s working well for me :
For exemple :
Disallow: /*&amp;view=getnewpost$
Disallow: /*&amp;view=getlastpost$
Disallow: /*&amp;view=old$
Disallow: /*&amp;view=new$</description> <content:encoded><![CDATA[<p>Content Analysis is a excellent tool to determine a duplicated content, I've used my file robots.txt to  to restrict access to Google on these pages, and it's working well for me :<br
/> For exemple :<br
/> Disallow: /*&amp;view=getnewpost$<br
/> Disallow: /*&amp;view=getlastpost$<br
/> Disallow: /*&amp;view=old$<br
/> Disallow: /*&amp;view=new$</p> ]]></content:encoded> </item> <item><title>By: Sint</title><link>http://yoast.com/google-webmaster-tools-content-analysis-shows-google-breaks-the-rules/#comment-27327</link> <dc:creator>Sint</dc:creator> <pubDate>Sat, 15 Dec 2007 19:09:59 +0000</pubDate> <guid
isPermaLink="false">http://www.joostdevalk.nl/google-webmaster-tools-content-analysis-shows-google-breaks-the-rules/#comment-27327</guid> <description>I agree with some people here that the Google Webmaster Tool can used for more things than just SEO, so providing this information from pages that should be no-indexed can still be useful. But it also raises questions about how Google is interpreting the noindex-metatag and what the status of this tag is related to non-SERP-functionalities of search engines. Maybe it would be wise if Google would include some information on this issue in the documentation/help section of WM-Tools.</description> <content:encoded><![CDATA[<p>I agree with some people here that the Google Webmaster Tool can used for more things than just SEO, so providing this information from pages that should be no-indexed can still be useful. But it also raises questions about how Google is interpreting the noindex-metatag and what the status of this tag is related to non-SERP-functionalities of search engines. Maybe it would be wise if Google would include some information on this issue in the documentation/help section of WM-Tools.</p> ]]></content:encoded> </item> <item><title>By: Sebastian</title><link>http://yoast.com/google-webmaster-tools-content-analysis-shows-google-breaks-the-rules/#comment-27315</link> <dc:creator>Sebastian</dc:creator> <pubDate>Sat, 15 Dec 2007 10:18:30 +0000</pubDate> <guid
isPermaLink="false">http://www.joostdevalk.nl/google-webmaster-tools-content-analysis-shows-google-breaks-the-rules/#comment-27315</guid> <description>From Google&#039;s POV that&#039;s usability advice, not crawling, indexing or ranking advice. I see where you&#039;re coming from but I don&#039;t think that&#039;s really &quot;breaking rules&quot;. :)</description> <content:encoded><![CDATA[<p>From Google's POV that's usability advice, not crawling, indexing or ranking advice. I see where you're coming from but I don't think that's really "breaking rules". :)</p> ]]></content:encoded> </item> <item><title>By: Joost de Valk</title><link>http://yoast.com/google-webmaster-tools-content-analysis-shows-google-breaks-the-rules/#comment-27308</link> <dc:creator>Joost de Valk</dc:creator> <pubDate>Sat, 15 Dec 2007 07:59:14 +0000</pubDate> <guid
isPermaLink="false">http://www.joostdevalk.nl/google-webmaster-tools-content-analysis-shows-google-breaks-the-rules/#comment-27308</guid> <description>Sebastian: that&#039;s not new to me either, believe me :) but it&#039;s just that since I don&#039;t want these pages to appear in the SERPs, have made that VERY clear to Google, and YET they still give me that stupid advice :)</description> <content:encoded><![CDATA[<p>Sebastian: that's not new to me either, believe me :) but it's just that since I don't want these pages to appear in the SERPs, have made that VERY clear to Google, and YET they still give me that stupid advice :)</p> ]]></content:encoded> </item> <item><title>By: Sebastian</title><link>http://yoast.com/google-webmaster-tools-content-analysis-shows-google-breaks-the-rules/#comment-27295</link> <dc:creator>Sebastian</dc:creator> <pubDate>Sat, 15 Dec 2007 00:53:49 +0000</pubDate> <guid
isPermaLink="false">http://www.joostdevalk.nl/google-webmaster-tools-content-analysis-shows-google-breaks-the-rules/#comment-27295</guid> <description>Oops ... Disallow: /*?=s  that is</description> <content:encoded><![CDATA[<p>Oops ... Disallow: /*?=s  that is</p> ]]></content:encoded> </item> <item><title>By: Sebastian</title><link>http://yoast.com/google-webmaster-tools-content-analysis-shows-google-breaks-the-rules/#comment-27294</link> <dc:creator>Sebastian</dc:creator> <pubDate>Sat, 15 Dec 2007 00:52:22 +0000</pubDate> <guid
isPermaLink="false">http://www.joostdevalk.nl/google-webmaster-tools-content-analysis-shows-google-breaks-the-rules/#comment-27294</guid> <description>Noindex means &quot;crawl but don&#039;t display it on SERPs&quot;. Since those URLs aren&#039;t indexed Google obeys the REP tag. To follow the links Google needs to keep a copy, so why shouldn&#039;t they provide that info in your GWC acct where nobody else can view it? It&#039;s extracted from the crawl cache, not the visible cache. You could complain when you add a Disallow: /*?p= to your robots.txt&#039;s Googlebot section ;)</description> <content:encoded><![CDATA[<p>Noindex means "crawl but don't display it on SERPs". Since those URLs aren't indexed Google obeys the REP tag. To follow the links Google needs to keep a copy, so why shouldn't they provide that info in your GWC acct where nobody else can view it? It's extracted from the crawl cache, not the visible cache. You could complain when you add a Disallow: /*?p= to your robots.txt's Googlebot section ;)</p> ]]></content:encoded> </item> <item><title>By: André Scholten</title><link>http://yoast.com/google-webmaster-tools-content-analysis-shows-google-breaks-the-rules/#comment-27291</link> <dc:creator>André Scholten</dc:creator> <pubDate>Fri, 14 Dec 2007 23:00:17 +0000</pubDate> <guid
isPermaLink="false">http://www.joostdevalk.nl/google-webmaster-tools-content-analysis-shows-google-breaks-the-rules/#comment-27291</guid> <description>I someone creates a link to that second page, a visitor that clicks on that link immediately sees he&#039;s on page 2. &quot;sense of place&quot;.But I agree with you: nobody will invest time in those little details. You have to be a idealist to do that.</description> <content:encoded><![CDATA[<p>I someone creates a link to that second page, a visitor that clicks on that link immediately sees he's on page 2. "sense of place".</p><p>But I agree with you: nobody will invest time in those little details. You have to be a idealist to do that.</p> ]]></content:encoded> </item> </channel> </rss>
<!-- Performance optimized by W3 Total Cache. Learn more: http://www.w3-edge.com/wordpress-plugins/

Minified using apc
Page Caching using apc
Database Caching 1/6 queries in 0.008 seconds using apc
Content Delivery Network via netdna.yoast.com

Served from: yoast.com @ 2010-03-19 09:44:02 -->