<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0"><channel><title>dmiessler.com | grep understanding - Latest Comments in The Whitehouse.gov Website&amp;#8217;s Robots.txt File Has 1839 Lines In It</title><link>http://danielrm26.disqus.com/</link><description>dmiessler.com/about/</description><language>en</language><lastBuildDate>Thu, 08 Feb 2007 08:38:06 -0000</lastBuildDate><item><title>Re: The Whitehouse.gov Website&amp;#8217;s Robots.txt File Has 1839 Lines In It</title><link>http://dmiessler.com/blog/the-whitehousegov-websites-robotstxt-file-has-1839-lines-in-it#comment-4353125</link><description>Yup, even I noted it sometime back as an excellent sitemap. ;-)</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Deepak</dc:creator><pubDate>Thu, 08 Feb 2007 08:38:06 -0000</pubDate></item><item><title>Re: The Whitehouse.gov Website&amp;#8217;s Robots.txt File Has 1839 Lines In It</title><link>http://dmiessler.com/blog/the-whitehousegov-websites-robotstxt-file-has-1839-lines-in-it#comment-4353126</link><description>Ooooh, /secret/ directories. *nods head*</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">ghost16825</dc:creator><pubDate>Fri, 26 Jan 2007 05:03:46 -0000</pubDate></item><item><title>Re: The Whitehouse.gov Website&amp;#8217;s Robots.txt File Has 1839 Lines In It</title><link>http://dmiessler.com/blog/the-whitehousegov-websites-robotstxt-file-has-1839-lines-in-it#comment-4353123</link><description>Search in Google for 'robots.txt' shows whitehouse.gov at position 5</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">sergei</dc:creator><pubDate>Thu, 25 Jan 2007 13:11:44 -0000</pubDate></item><item><title>Re: The Whitehouse.gov Website&amp;#8217;s Robots.txt File Has 1839 Lines In It</title><link>http://dmiessler.com/blog/the-whitehousegov-websites-robotstxt-file-has-1839-lines-in-it#comment-4353124</link><description>Looking at most of those entries, it looks like they're excluding pages which look to be designed for text only browsers/screen readers.. nearly every directory ends in /text&lt;br&gt;&lt;br&gt;Disallow:	/asia/2005/photoessay/china/text&lt;br&gt;Disallow:	/asia/2005/photoessay/japan/text&lt;br&gt;Disallow:	/asia/2005/photoessay/korea/text&lt;br&gt;Disallow:	/asia/2005/photoessay/mongolia/text&lt;br&gt;Disallow:	/asia/2005/photoessay/mrsbush1/text&lt;br&gt;Disallow:	/asia/2005/photoessay/mrsbush2/text&lt;br&gt;&lt;br&gt;&lt;br&gt;and if you browse up one directory, you get the same story with pictures..&lt;br&gt;&lt;br&gt;I'd say it looks like they are doing it to work around for a poor file structure or possibly to keep search engines from finding duplicate text (although without pictures)&lt;br&gt;&lt;br&gt;*shrugs* I'm all for pointing out when the administration does something crooked, but I can't see fault in this one.. (granted, I've only checked out 20 or so of the links.. the only one that didn't go anywhere for me was /video/text )</description><dc:creator xmlns:dc="http://purl.org/dc/elements/1.1/">Some Joker</dc:creator><pubDate>Tue, 23 Jan 2007 17:06:03 -0000</pubDate></item></channel></rss>