<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments for thinking sysadmin</title>
	<atom:link href="http://andyleonard.com/comments/feed/" rel="self" type="application/rss+xml" />
	<link>http://andyleonard.com</link>
	<description>qstat -u aleonard -s z</description>
	<lastBuildDate>Mon, 11 Jan 2010 17:10:09 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.9.1</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>Comment on Fishworks&#8217; LDAP Schema Definition by Saint Aardvark</title>
		<link>http://andyleonard.com/2008/11/18/fishworks-ldap-schema-definition/comment-page-1/#comment-317</link>
		<dc:creator>Saint Aardvark</dc:creator>
		<pubDate>Mon, 11 Jan 2010 17:10:09 +0000</pubDate>
		<guid isPermaLink="false">http://andyleonard.com/?p=164#comment-317</guid>
		<description>Thanks for this -- I hadn&#039;t realized the importance of the Search Descriptor field.</description>
		<content:encoded><![CDATA[<p>Thanks for this &#8212; I hadn&#8217;t realized the importance of the Search Descriptor field.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Duplicity to Amazon S3 on FreeBSD: Building on the work of others by tash</title>
		<link>http://andyleonard.com/2009/03/02/duplicity-to-amazon-s3-on-freebsd-building-on-the-work-of-others/comment-page-1/#comment-315</link>
		<dc:creator>tash</dc:creator>
		<pubDate>Sun, 03 Jan 2010 22:25:41 +0000</pubDate>
		<guid isPermaLink="false">http://andyleonard.com/?p=226#comment-315</guid>
		<description>I&#039;m getting that &quot;Fatal Error: Neither remote nor local manifest is readable&quot; message in my cron logs.   Hopefully exporting the HOME variable does the trick.  Thanks in advance if it does :)</description>
		<content:encoded><![CDATA[<p>I&#8217;m getting that &#8220;Fatal Error: Neither remote nor local manifest is readable&#8221; message in my cron logs.   Hopefully exporting the HOME variable does the trick.  Thanks in advance if it does <img src='http://andyleonard.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Test Driving Google Public DNS (Updated with OpenDNS comparison) by Andy</title>
		<link>http://andyleonard.com/2009/12/03/test-driving-google-publi-dns/comment-page-1/#comment-271</link>
		<dc:creator>Andy</dc:creator>
		<pubDate>Thu, 03 Dec 2009 23:09:38 +0000</pubDate>
		<guid isPermaLink="false">http://andyleonard.com/?p=414#comment-271</guid>
		<description>@Raj - As query response times - reported by &quot;dig&quot; as &quot;Query time&quot; - fell into two fairly tight groups, it seemed a reasonable assumption to assume the the faster responses were served out of cache, while the slower response times were not.  Noting that repeated queries of the same hostname were always faster and always tightly clustered bears this out.  You could also look at the TTL of the returned record to see if it is cached or not - if it is below what the authoritative host has for it, you know you&#039;re looking at a cache hit.</description>
		<content:encoded><![CDATA[<p>@Raj &#8211; As query response times &#8211; reported by &#8220;dig&#8221; as &#8220;Query time&#8221; &#8211; fell into two fairly tight groups, it seemed a reasonable assumption to assume the the faster responses were served out of cache, while the slower response times were not.  Noting that repeated queries of the same hostname were always faster and always tightly clustered bears this out.  You could also look at the TTL of the returned record to see if it is cached or not &#8211; if it is below what the authoritative host has for it, you know you&#8217;re looking at a cache hit.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Test Driving Google Public DNS (Updated with OpenDNS comparison) by Raj</title>
		<link>http://andyleonard.com/2009/12/03/test-driving-google-publi-dns/comment-page-1/#comment-270</link>
		<dc:creator>Raj</dc:creator>
		<pubDate>Thu, 03 Dec 2009 22:35:25 +0000</pubDate>
		<guid isPermaLink="false">http://andyleonard.com/?p=414#comment-270</guid>
		<description>Hey I wnated to know how you compared public dns and opendns .I mean what tool did you use to see the cache hit and miss ratio.I want to test the cache hit and miss of some DNS related app need info and help on that</description>
		<content:encoded><![CDATA[<p>Hey I wnated to know how you compared public dns and opendns .I mean what tool did you use to see the cache hit and miss ratio.I want to test the cache hit and miss of some DNS related app need info and help on that</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Interesting Linux VM Crash Pattern by Andy</title>
		<link>http://andyleonard.com/2009/11/20/interesting-linux-vm-crash-pattern/comment-page-1/#comment-257</link>
		<dc:creator>Andy</dc:creator>
		<pubDate>Fri, 20 Nov 2009 23:37:48 +0000</pubDate>
		<guid isPermaLink="false">http://andyleonard.com/?p=343#comment-257</guid>
		<description>I&#039;ve posted the NFS settings above; unfortunately, they don&#039;t appear to deviate from default/recommended settings.</description>
		<content:encoded><![CDATA[<p>I&#8217;ve posted the NFS settings above; unfortunately, they don&#8217;t appear to deviate from default/recommended settings.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Interesting Linux VM Crash Pattern by Andy</title>
		<link>http://andyleonard.com/2009/11/20/interesting-linux-vm-crash-pattern/comment-page-1/#comment-256</link>
		<dc:creator>Andy</dc:creator>
		<pubDate>Fri, 20 Nov 2009 23:13:20 +0000</pubDate>
		<guid isPermaLink="false">http://andyleonard.com/?p=343#comment-256</guid>
		<description>Thanks for the input and suggestions, Nate - much appreciated.

I failed to mention in the post that we are running another cluster of ESX 3.5 hosts off the same filer (and same aggregate) using a Fibre Channel LUN without issue, FWIW.  Guests on that cluster include Linux VMs, but all without vSMP.  That suggests it&#039;s not the filer itself, but it could still be the storage protocol, or the host hardware, or vSMP.

(And now that I&#039;ve written that, I&#039;m curious about verifying the VMware NFS settings on the problem host.  I&#039;ll post an update on what I find there.)</description>
		<content:encoded><![CDATA[<p>Thanks for the input and suggestions, Nate &#8211; much appreciated.</p>
<p>I failed to mention in the post that we are running another cluster of ESX 3.5 hosts off the same filer (and same aggregate) using a Fibre Channel LUN without issue, FWIW.  Guests on that cluster include Linux VMs, but all without vSMP.  That suggests it&#8217;s not the filer itself, but it could still be the storage protocol, or the host hardware, or vSMP.</p>
<p>(And now that I&#8217;ve written that, I&#8217;m curious about verifying the VMware NFS settings on the problem host.  I&#8217;ll post an update on what I find there.)</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Keeping your RHEL VMs from crushing your storage at 4:02am by Andy</title>
		<link>http://andyleonard.com/2009/11/19/keeping-your-rhel-vms-from-crushing-your-storage-at-402am/comment-page-1/#comment-255</link>
		<dc:creator>Andy</dc:creator>
		<pubDate>Fri, 20 Nov 2009 23:03:05 +0000</pubDate>
		<guid isPermaLink="false">http://andyleonard.com/?p=315#comment-255</guid>
		<description>Thanks for the comment, Nate.  I hadn&#039;t considered makewhatis when I wrote this article, but I believe that it is called with the &quot;-u&quot; flag which greatly reduces its I/O load; a couple quick experiments on my workstation seem to bear this out, but, obviously, test in your own environment.

Adding the random delay loop in a separate file in /etc/cron.daily - say &quot;1randomdelay&quot; so it runs after the &quot;0*&quot; files - may also be effective (and cleaner) and will stagger all the files after it, sorted by the glob in run-parts.</description>
		<content:encoded><![CDATA[<p>Thanks for the comment, Nate.  I hadn&#8217;t considered makewhatis when I wrote this article, but I believe that it is called with the &#8220;-u&#8221; flag which greatly reduces its I/O load; a couple quick experiments on my workstation seem to bear this out, but, obviously, test in your own environment.</p>
<p>Adding the random delay loop in a separate file in /etc/cron.daily &#8211; say &#8220;1randomdelay&#8221; so it runs after the &#8220;0*&#8221; files &#8211; may also be effective (and cleaner) and will stagger all the files after it, sorted by the glob in run-parts.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Interesting Linux VM Crash Pattern by nate</title>
		<link>http://andyleonard.com/2009/11/20/interesting-linux-vm-crash-pattern/comment-page-1/#comment-254</link>
		<dc:creator>nate</dc:creator>
		<pubDate>Fri, 20 Nov 2009 22:54:32 +0000</pubDate>
		<guid isPermaLink="false">http://andyleonard.com/?p=343#comment-254</guid>
		<description>just for reference, I run about 100 VMs with 2.6.18-128.1.10.el5 on CentOS 5.2 (5.3 kernel on 5.2 distro) with the SMP kernel though there is only 1 vCPU per VM. All of them are local storage on Dell R610. Also have about a dozen VMs on the same kernel running with 2 vCPUs.

I run about 40 physical servers on the same kernel on native R610 with 8 cores no panics there either.

I run the SMP kernel on everything even if it&#039;s only 1 vCPU, because you never know if you may need to upgrade to 2 or more CPUs and if you do I don&#039;t want to have to change kernels. Though now that I think about it I think RHEL/CentOS 5.x kernels are all SMP now vs on 4.x where they had SMP and UP kernels, but not 100% sure.

Never had a panic of any sort on any of them.

To me your problem looks related to storage, so I would look to NetApp or VMware&#039;s NFS stuff. I would also consider hooking at least one host via Fiber or iSCSI and run some of the VMs off of that and see what you get.

My own infrastructure is split into two classes, we have our edge web servers which are at several physical locations, all of them run off of local storage. The other class is most of our back end stuff or QA or internal IT, of which it&#039;s all fiber channel connected(most of it is boot from SAN, with the exception of some older ESXi 3.5 systems that don&#039;t support that), connected to our 3PAR T400 storage system.

in case it helps you in your tracing of the issue..</description>
		<content:encoded><![CDATA[<p>just for reference, I run about 100 VMs with 2.6.18-128.1.10.el5 on CentOS 5.2 (5.3 kernel on 5.2 distro) with the SMP kernel though there is only 1 vCPU per VM. All of them are local storage on Dell R610. Also have about a dozen VMs on the same kernel running with 2 vCPUs.</p>
<p>I run about 40 physical servers on the same kernel on native R610 with 8 cores no panics there either.</p>
<p>I run the SMP kernel on everything even if it&#8217;s only 1 vCPU, because you never know if you may need to upgrade to 2 or more CPUs and if you do I don&#8217;t want to have to change kernels. Though now that I think about it I think RHEL/CentOS 5.x kernels are all SMP now vs on 4.x where they had SMP and UP kernels, but not 100% sure.</p>
<p>Never had a panic of any sort on any of them.</p>
<p>To me your problem looks related to storage, so I would look to NetApp or VMware&#8217;s NFS stuff. I would also consider hooking at least one host via Fiber or iSCSI and run some of the VMs off of that and see what you get.</p>
<p>My own infrastructure is split into two classes, we have our edge web servers which are at several physical locations, all of them run off of local storage. The other class is most of our back end stuff or QA or internal IT, of which it&#8217;s all fiber channel connected(most of it is boot from SAN, with the exception of some older ESXi 3.5 systems that don&#8217;t support that), connected to our 3PAR T400 storage system.</p>
<p>in case it helps you in your tracing of the issue..</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on Keeping your RHEL VMs from crushing your storage at 4:02am by nate</title>
		<link>http://andyleonard.com/2009/11/19/keeping-your-rhel-vms-from-crushing-your-storage-at-402am/comment-page-1/#comment-253</link>
		<dc:creator>nate</dc:creator>
		<pubDate>Fri, 20 Nov 2009 22:45:12 +0000</pubDate>
		<guid isPermaLink="false">http://andyleonard.com/?p=315#comment-253</guid>
		<description>also disable the makewhatis cron I found that to be expensive as well.

I just checked my configs and it turns out I don&#039;t have updatedb disabled on my VMs, I thought I did, suppose I should go and disable it. I do have my cfengine installation push out a version of the makewhatis cron file that just has an exit 0 at the top of it.</description>
		<content:encoded><![CDATA[<p>also disable the makewhatis cron I found that to be expensive as well.</p>
<p>I just checked my configs and it turns out I don&#8217;t have updatedb disabled on my VMs, I thought I did, suppose I should go and disable it. I do have my cfengine installation push out a version of the makewhatis cron file that just has an exit 0 at the top of it.</p>
]]></content:encoded>
	</item>
	<item>
		<title>Comment on ESX VM swap on NFS: If it crashes, try something else by Andy Daniel</title>
		<link>http://andyleonard.com/2009/02/04/esx-vm-swap-on-nfs-if-it-crashes-try-something-else/comment-page-1/#comment-240</link>
		<dc:creator>Andy Daniel</dc:creator>
		<pubDate>Tue, 21 Jul 2009 21:04:35 +0000</pubDate>
		<guid isPermaLink="false">http://andyleonard.com/?p=218#comment-240</guid>
		<description>There appears to be some internal debate over this. The KB has been pulled from public access.</description>
		<content:encoded><![CDATA[<p>There appears to be some internal debate over this. The KB has been pulled from public access.</p>
]]></content:encoded>
	</item>
</channel>
</rss>
