<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>thinking sysadmin &#187; sge</title>
	<atom:link href="http://andyleonard.com/tag/sge/feed/" rel="self" type="application/rss+xml" />
	<link>http://andyleonard.com</link>
	<description>qstat -u aleonard -s z</description>
	<lastBuildDate>Tue, 28 Feb 2012 04:47:09 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.3.2</generator>
		<item>
		<title>On SPRINT: A new parallel framework for R</title>
		<link>http://andyleonard.com/2009/01/16/on-sprint-a-new-parallel-framework-for-r/</link>
		<comments>http://andyleonard.com/2009/01/16/on-sprint-a-new-parallel-framework-for-r/#comments</comments>
		<pubDate>Fri, 16 Jan 2009 15:55:45 +0000</pubDate>
		<dc:creator>Andy</dc:creator>
				<category><![CDATA[hpc]]></category>
		<category><![CDATA[R]]></category>
		<category><![CDATA[sge]]></category>

		<guid isPermaLink="false">http://andyleonard.com/?p=194</guid>
		<description><![CDATA[As a sysadmin that supports multiple R users, a post late last year on InsideHPC drew my attention &#8211; Parallel framework for statistical analysis package &#8220;R&#8221;.  The creators of the Simple Parallel R INTerface have &#8220;designed and built a prototype framework that allows the addition of parallelised functions to R to enable the easy exploitation [...]]]></description>
			<content:encoded><![CDATA[<p>As a sysadmin that supports multiple R users, a post late last year on InsideHPC drew my attention &#8211; <a href="http://insidehpc.com/2008/12/30/parallel-framework-for-statistical-analysis-package-r/">Parallel framework for statistical analysis package &#8220;R&#8221;</a>.  The creators of the Simple Parallel R INTerface have &#8220;designed and built a prototype framework that allows the addition of parallelised functions to R to enable the easy exploitation of HPC systems.&#8221; (<a href="http://www.biomedcentral.com/1471-2105/9/558">paper</a>, <a href="http://forge.nesc.ac.uk/projects/sprint">source code</a>)  In other words, a system that lets R users run on a cluster without learning parallel programming.<br />
<span id="more-194"></span><br />
One of the biggest challenges of my job hasn&#8217;t been building cluster resources &#8211; in an era of open source queuing systems like Sun Grid Engine and distributions like Rocks, setting up a cluster is pretty much as easy as you want it to be.  Rather, the challenge has been convincing users to actually run their jobs on the cluster.  One user of mine preferred to use some home-built Perl to dispatch jobs instead of investing a half an hour learning to write job scripts for SGE.  Some of my R users will run jobs on the cluster, but not via a job script &#8211; they use <code>qrsh</code> instead, launching the job interactively and then leaving their terminal idle until it completes.  Oftentimes, it seems that the cluster-aware software that my users need leaves a bad taste in their mouth, as well.  I&#8217;ve seen a software developer who is chronically unclear on the <a href="http://www.cs.wisc.edu/condor/htc.html">difference</a> between high throughput and high performance computing, &#8220;grid-aware&#8221; software products that hard code queue names, and a product that chose a backwards programming model and wrote its own bug-laden queueing system (I&#8217;m looking at you, TurboSEQUEST).</p>
<p>So SPRINT caught my eye as a possible solution to get more computational resources to more of my users more easily.  I recognize that it&#8217;s not a mature solution, or really even much more than an idea of how a mature solution might look one day.  I sent the link for the paper to my heaviest R user &#8211; his initial reaction was &#8220;interesting, but not very useful&#8221; &#8211; not because he couldn&#8217;t see the potential, but because a lot of heavy lifting has to yet to be done with the project to get it to a state where it would benefit his work.</p>
<p>I don&#8217;t know if SPRINT will eventually live up to its promise &#8211; it does have a long way to go, it is only a 0.0.3 release and it doesn&#8217;t scale particularly well right now &#8211; but I&#8217;m encouraged by it because I see it as a step in the right direction.  Single-threaded performance doesn&#8217;t seem to be improving any time soon; if we want to keep analyzing larger and larger datasets, the problem that SPRINT addresses is one we&#8217;ll have to keep thinking about.</p>
]]></content:encoded>
			<wfw:commentRss>http://andyleonard.com/2009/01/16/on-sprint-a-new-parallel-framework-for-r/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Large Link Dump, 7/16/2008</title>
		<link>http://andyleonard.com/2008/07/16/large-link-dump-7-16-2008/</link>
		<comments>http://andyleonard.com/2008/07/16/large-link-dump-7-16-2008/#comments</comments>
		<pubDate>Wed, 16 Jul 2008 21:55:53 +0000</pubDate>
		<dc:creator>Andy</dc:creator>
				<category><![CDATA[link dump]]></category>
		<category><![CDATA[containers]]></category>
		<category><![CDATA[esx]]></category>
		<category><![CDATA[ha]]></category>
		<category><![CDATA[hp]]></category>
		<category><![CDATA[nfsv4]]></category>
		<category><![CDATA[nic teaming]]></category>
		<category><![CDATA[sge]]></category>
		<category><![CDATA[vmware]]></category>

		<guid isPermaLink="false">http://andyleonard.com/?p=48</guid>
		<description><![CDATA[VM HA &#8211; service console networking, isolation behavior &#8211; and other &#8220;under the covers stuff&#8221; &#8211; An overview of how VMware ESX&#8217;s High Availability works under the hood &#8211; making it much more apparent to me how important file locking is to HA&#8217;s functioning. (I&#8217;d love to see an overview of how file locking does [...]]]></description>
			<content:encoded><![CDATA[<ul>
<li><a href="http://virtualgeek.typepad.com/virtual_geek/2008/07/vm-ha---service.html">VM HA &#8211; service console networking, isolation behavior &#8211; and other &#8220;under the covers stuff&#8221;</a> &#8211; An overview of how VMware ESX&#8217;s High Availability works under the hood &#8211; making it much more apparent to me how important file locking is to HA&#8217;s functioning.  (I&#8217;d love to see an overview of how file locking does &#8211; or doesn&#8217;t &#8211; differ on VMFS versus NFS datastores.)</li>
<li><a href="http://blogs.sun.com/templedf/entry/why_upgrade">Why Upgrade?</a> &#8211; DanT on what&#8217;s new in Sun Grid Engine 6.0 through 6.2.</li>
<li><a href="http://blogs.netapp.com/eislers_nfs_blog/2008/07/part-ii-since-n.html">Part II: Since NFSv4 is Stateful It Must Be Less Robust, Right?</a> &#8211; &#8220;Just because CIFS is old and busted, that doesn&#8217;t mean NFSv4 is.&#8221;  <strong>Just kidding, that&#8217;s not an actual quote.</strong>  But I think it&#8217;s a reasonable summary of the piece.</li>
<li><a href="http://www.hp.com/go/pod">HP &#8211; Performance-Optimized Data Center</a> &#8211; Yet another vendor produces a data center-in-a-box product &#8211; which isn&#8217;t to say that there isn&#8217;t good technology inside of HP&#8217;s product.  I wonder if container data centers will come down-market to the point where they become a reasonable alternative for new office building construction instead of building a conventional server room.  (Seen at <a href="http://www.datacenterknowledge.com/archives/2008/Jul/16/hp_unveils_its_pod_data_center_container.html">Data Center Knowledge</a>.)</li>
<li><a href="http://blog.scottlowe.org/2008/07/16/understanding-nic-utilization-in-vmware-esx/">Understanding NIC Utilization in VMware ESX</a> &#8211; Scott Lowe comes through again with another practical piece on networking and VMware ESX.</li>
</ul>
]]></content:encoded>
			<wfw:commentRss>http://andyleonard.com/2008/07/16/large-link-dump-7-16-2008/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

