<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>blog.richardmcdougall.com</title>
	<atom:link href="http://blog.richardmcdougall.com/?feed=rss2" rel="self" type="application/rss+xml" />
	<link>http://blog.richardmcdougall.com</link>
	<description>Commentary on High End Systems and Virtualization Performance</description>
	<lastBuildDate>Wed, 28 Oct 2009 23:56:47 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.6</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Flash &amp; Phase Change Memory Talks</title>
		<link>http://blog.richardmcdougall.com/?p=95</link>
		<comments>http://blog.richardmcdougall.com/?p=95#comments</comments>
		<pubDate>Tue, 27 Oct 2009 00:46:45 +0000</pubDate>
		<dc:creator>rmc</dc:creator>
				<category><![CDATA[Performance]]></category>

		<guid isPermaLink="false">http://blog.richardmcdougall.com/?p=95</guid>
		<description><![CDATA[ 
This week, I attended talks at the international high performance transaction workshop. Following are my rough notes from the Flash Memory discussions.
First up Steve Kleiman from Network Appliance spoke about their intent to move  flash into the clients that access NAS, so that it can intelligently cache and interact with the backend storage:

Netapp is building host-side [...]]]></description>
			<content:encoded><![CDATA[<p> </p>
<p>This week, I attended talks at the international high performance transaction workshop. Following are my rough notes from the Flash Memory discussions.</p>
<p>First up <a href="http://www.netapp.com/us/company/news/press-room/kleiman_s_bio.html">Steve Kleiman</a> from Network Appliance spoke about their intent to move  flash into the clients that access NAS, so that it can intelligently cache and interact with the backend storage:</p>
<ul>
<li>Netapp is building host-side caches for VMware and HyperV</li>
<li>The cache will be a block based read cache </li>
<li>Cache is write through</li>
</ul>
<p> </p>
<p><a href="http://en.wikipedia.org/wiki/Andy_Bechtolsheim">Andy Bechtolshiem</a>, cofounder of Sun and now at <a href="http://www.aristanetworks.com/en/Index">Arista Networks</a> talked about trends affecting system design and how Flash could be leveraged.</p>
<ul>
<li>3d chip packaging will become common, to solve power and latency fundamentals around higher clock rate memory systems. </li>
<li>This will mean that systems will become very NUMA, since the memory is directly attached to the cpu core.</li>
<li>This is the decade where we switch to optical in the datacenter. Cutover is 10gbit (copper), 20 requires optical and will move to volume commodity optical interconnects for distances greater than 1m.</li>
<li>Flash densities to grow 100x between 2010 and 2022</li>
<li>Processors will have 1024 cores in 2022</li>
<li>NOR flash is not scaling as well as NAND flash, in terms of cost and density</li>
<li>Major barrier to non-voltaile main memory is the single host failure issue.</li>
<li>Current controllers (flash on sata) are not up to the speed</li>
<li>Speed of flash increases from 100us down to 10us from 2009 to 2012</li>
<li>ASP per $1GB of device flash $100 in 2004 to $1/gb in 2010 and 0.25c in 2012 (Halves every year)</li>
<li>Controllers improving rapily to keep up with flash throughput (they are deficient now)</li>
<li>Flash Interface moving from sata to pci-e in the near term</li>
</ul>
<p> </p>
<p>Microsoft Research Database - Phil Bernstein, Microsoft</p>
<ul>
<li>DB designed specifically for flash</li>
<li>no-partitioning</li>
<li>Taking advantage of native-flash chip semantics :append-only, db using log-appends, Updates the flash in append-only pages</li>
<li>Database is ZFS-like: root block, copy on write binary tree </li>
<li>Uses a shared storage layer</li>
<li>MySql &#8220;rethinkdb?&#8221; is doing something similar</li>
</ul>
<p> </p>
<p>Bradley Kurszmaul, Tokutek</p>
<ul>
<li>MySQL on SSD btree performance severely lacking</li>
<li>Solution: build a MySQL storage engine suited for flash</li>
<li>Observation: change read/write sizes and concurrency to suite: for today&#8217;s flash: ~20k reads, ~10k writes.</li>
<li>Significant improvement in transaction throughput observed using the above.</li>
</ul>
<p> </p>
<p>IBM on Phase Change Memory</p>
<ul>
<li>Next generation after flash: Phase Change Memory</li>
<li>Prediction is that SCM/PCM will flip the size relationship between RAM and 2nd level memory</li>
<li>Much more 2nd level, small fast on-chip RAM</li>
<li>PCM as fast as 100-1000ns</li>
<li>Discussion centered around Flash being a short term trend, and due to limitations (wear leveling etc) PCM is more likely to be part of the systems architecture long term.</li>
</ul>
<div></div>
]]></content:encoded>
			<wfw:commentRss>http://blog.richardmcdougall.com/?feed=rss2&amp;p=95</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>RISC to x86, Physical to Virtual</title>
		<link>http://blog.richardmcdougall.com/?p=94</link>
		<comments>http://blog.richardmcdougall.com/?p=94#comments</comments>
		<pubDate>Tue, 21 Apr 2009 20:12:52 +0000</pubDate>
		<dc:creator>rmc</dc:creator>
				<category><![CDATA[Benchmarking]]></category>
		<category><![CDATA[Oracle]]></category>
		<category><![CDATA[Performance]]></category>
		<category><![CDATA[Solaris]]></category>
		<category><![CDATA[VMware]]></category>
		<category><![CDATA[Virtualization]]></category>

		<guid isPermaLink="false">http://blog.richardmcdougall.com/?p=94</guid>
		<description><![CDATA[Many of the virtualization customers I speak with have been asking about what database can be virtualized, so I typically present a green, yellow and red set of criteria to help guide the process. We typically look at I/O rates, SMP width and capacity of the source physical system as a guide.
Recently, we were able [...]]]></description>
			<content:encoded><![CDATA[<p>Many of the virtualization customers I speak with have been asking about what database can be virtualized, so I typically present a green, yellow and red set of criteria to help guide the process. We typically look at I/O rates, SMP width and capacity of the source physical system as a guide.</p>
<p>Recently, we were able to run a full TPC-C like workload inside a virtual machine, showing that we can run at over 90% efficiency, even given the brutal nature of this workload (it&#8217;s I/O rate is actually about 6 times higher than what you would typically see in a customer workload).</p>
<p>The methodology is that green is considered for the easy databases &#8212; typically 2 CPUs or less and less than 500 disk I/O&#8217;s per second. These virtualize easy, with minimal best practices required. Yellow is for larger databases, where the I/O rate is higher &#8212; they typically need more care and attention to the way the storage is provisioned underneath the virtual machine. For example, if the database needs 1,000 IOPS, then it should be on a disk or LUN with at least 10 disks backing it, to provide the necessary I/O throughput.</p>
<p>Red is typically when you have a database which is much larger than the limits provided for by a virtual machine &#8212; e.g. the maximum amount of memory or number of virtual CPUs possible.</p>
<p>That got me thinking &#8212; now that vSphere 4.0 (the new name for VMware&#8217;s full virtualization offering) we have three things happening at once &#8212; greatly improved database efficiency, a now-wider SMP virtual machine (SMP up to 8 virtual CPUs), and the increased throughput of the new Intel Nehalem systems. That combined, we have enough throughput to run some of the larger databases &#8212; often hosted on larger RISC machines.</p>
<p>Since it&#8217;s a database system, it&#8217;s relatively easy to migrate from SPARC to x86, given that there is an x86 equivalent for the OS (e.g. Solaris or Linux) and the database (e.g. Oracle).</p>
<p>By taking a composite metric of several different benchmarks, we can create a capability mapping chart between different RISC CPUs and the performance that we see inside a virtual machine. Some of the data may be surprising &#8212; some of those large older SPARC machines can now fit into a virtual machine on the newer x86 systems.</p>
<p>The first version of the table is over at the <a href="http://blogs.vmware.com/performance/2009/04/database-sizing-charts-for-vsphere-40.html">VMware performance blog</a>.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.richardmcdougall.com/?feed=rss2&amp;p=94</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The Solaris Internals Virtual Shell Game</title>
		<link>http://blog.richardmcdougall.com/?p=80</link>
		<comments>http://blog.richardmcdougall.com/?p=80#comments</comments>
		<pubDate>Mon, 02 Mar 2009 19:33:33 +0000</pubDate>
		<dc:creator>rmc</dc:creator>
				<category><![CDATA[Benchmarking]]></category>
		<category><![CDATA[Fun]]></category>
		<category><![CDATA[NAS]]></category>
		<category><![CDATA[Performance]]></category>
		<category><![CDATA[Solaris]]></category>
		<category><![CDATA[VMware]]></category>
		<category><![CDATA[Virtualization]]></category>

		<guid isPermaLink="false">http://blog.richardmcdougall.com/?p=80</guid>
		<description><![CDATA[In a stark contrast to recent past when Solaris Internals was hosted on physical machines, we&#8217;ve just experienced one of the core values of virtualization &#8212; encapsulated, portable state allowing geographic mobility. Solaris Internals has over the last two weeks been teleported out of my datacenter (As Jim notes, the &#8220;RMCplex&#8221;, which refers to my [...]]]></description>
			<content:encoded><![CDATA[<p>In a stark contrast to recent past when Solaris Internals was hosted on physical machines, we&#8217;ve just experienced one of the core values of virtualization &#8212; encapsulated, portable state allowing geographic mobility. Solaris Internals has over the last two weeks been teleported out of my datacenter (As Jim notes, the &#8220;RMCplex&#8221;, which refers to my rack of managed servers), to the VMware&#8217;s CTO&#8217;s iMac, and then to my new physical location &#8212; keeping the the solarisinternals.com site live the entire duration. Yes, I just moved house, with zero downtime!</p>
<p style="text-align: center;"><a href="http://blog.richardmcdougall.com/wp-content/uploads/2009/03/simove.png"><img class="alignnone size-medium wp-image-90 aligncenter" title="simove" src="http://blog.richardmcdougall.com/wp-content/uploads/2009/03/simove.png" alt="" width="300" height="119" /></a></p>
<p><strong>The Physical RMCplex</strong></p>
<p>It wasn&#8217;t long ago that solarisinternals.com was hosted on a triad of physical machines &#8211; a SPARCstation 5 as the router, a dual-socket Opteron machine as the ZFS NAS server, and a dual core AMD server.</p>
<p style="text-align: center;"><img class="alignnone aligncenter" src="http://blog.richardmcdougall.com/wp-content/uploads/2009/03/sidiag1.png" alt="" /></p>
<p>The RMCplex is hosting many sites today:</p>
<ul>
<li> solarisdatabases.com</li>
<li> solarisinternals.com</li>
<li> pronk*.com</li>
<li> joost*.com</li>
<li> zygan*.com</li>
<li> mail server</li>
<li> dns server</li>
<li> shared shell environment server</li>
</ul>
<p>The router provides the perimeter security and QoS rules for the public traffic. I have four DMZ networks &#8212; management consoles, public Zone/VM traffic, a public wireless DMZ, and a private NFS-Only gigabit DMZ. All the data is held on a ZFS server, which is replicated to a remote location. The total power consumption of the rmcplex was about 650w &#8212; which incidently, in California costs about $1,300 a year.</p>
<p>In the past, upgrading or fiddling with versions of Solaris was a very hands-on operation &#8212; interacting directly with the physical console of the machine, while the site was momentarily down. I often upgrade to later builds of ZFS or Solaris (I&#8217;ve been hosting on Solaris Nevada since snv-21). We&#8217;d incur downtime when I had to upgrade or do maintenance on any one of the three machines.</p>
<p><strong>The Virtual RMCplex</strong></p>
<p>Last year, I moved the entire setup to VMware ESX. I&#8217;m using ESX 3.5, managed with Virtual Center. There were multiple motivations for this, which I&#8217;ll summarize later, but one of the most compelling was the ability to stage new/old versions of each now-virtual server concurrently, to eliminate the downtime during upgrades. I can easily bring up a new version of solarisinternals.com on snv101 for example, while the older one is still running. When I&#8217;m happy with the new app/OS provisioning, I just flip over to the new VM.</p>
<p>The new setup looks logically similar, but it&#8217;s appearance in the physical world is stunningly different. It went from three physical machines down to one, and from four to one ethernet switches.</p>
<p style="text-align: center;"><img src="http://blog.richardmcdougall.com/wp-content/uploads/2009/03/sidiag2.png" alt="" /></p>
<p>To move the images from physical to virtual, I simply copied the raw disk image using dd, and encapsulated the raw disk image into a vmdk-format virtual disk. A small change to bootenv.rc and the image boots fine as a virtual machine, the rest of the OS environment is unmodified.</p>
<p>I&#8217;ve virtualized everything &#8212; including the router/firewall, all of the physical LAN segments (using Vlans and Vlan capable switch), and the Solaris ZFS server. The DMZ networks the link between the router and the hosted site virtual machines are now completely virtual &#8212; i.e., they are virtual ethernet networks with TCP traffic flowing between virtual machines, but never connect to anything physical. Consequently, I was able to eliminate the switch hardware for those segments of the configuration, too.</p>
<p>The ZFS server is interesting &#8212; it&#8217;s the NAS server for the virtual disks off all the virtual machines hosted on ESX, except for itself. This allows me to fully utilize ZFS&#8217;s hourly snapshot capability to roll forward, backward, and leverage compression. The ZFS virtual appliance is about 8TB of storage using 10 SATA disks, in a single virtual machine. I&#8217;m using ZFS with VMware ESX&#8217;s raw disk mode &#8212; so ZFS still see&#8217;s a whole physical disk and is able to operate just as it was before on physical &#8212; managing RAID-Z across the disks, and performing fault management isolation etc,&#8230; Incidently, I benchmarked the ZFS appliance just after converting it, and noted that it streams from a 5 disk SATA based zpool at about 350MB/sec sustained &#8212; likely enough for what we need <img src='http://blog.richardmcdougall.com/wp-includes/images/smilies/icon_wink.gif' alt=';-)' class='wp-smiley' />  That&#8217;s pretty close to the raw transfer speed of the 5 SATA disks. I&#8217;ve also enabled the ZIL on a NVRAM backed device, and used a flash ARC cache, which greatly helped performance. I&#8217;ll do a separate post sometime on the ZFS virtual appliance.</p>
<p><strong>Teleportation</strong></p>
<p>Back to the story of teleporting the running instance. Since I&#8217;m using just a single node of ESX, with fairly simple management tools around it, I was looking for a quick way of keeping solarisinternals.com up and running while I moved house. I was considering signing up for a virtual-hoster company to move the solarisinternals.com virtual machine to temporarily, but there was even a simpler way. Our CTO run&#8217;s VMware fusion at home on his iMac, and is close by. So we copied the solarisinternals.com virtual machine on to a 8GB compact flash card, and did a file-&gt;load onto VMware-fusion at the remote end.</p>
<p style="text-align: center;"><a href="http://blog.richardmcdougall.com/wp-content/uploads/2009/03/sandisk-8gb-flash8-g-79360-13.png"><img class="alignnone size-medium wp-image-92 aligncenter" title="sandisk-8gb-flash8-g-79360-13" src="http://blog.richardmcdougall.com/wp-content/uploads/2009/03/sandisk-8gb-flash8-g-79360-13.png" alt="" width="300" height="279" /></a></p>
<p>Within a few minutes, we had a fully running replica of solarisinternals.com hosted at a different physical location. We overlapped the move by disabling updates at the original location, changing DNS, and enabling updates at the new location. That way, any requests with the stale IP would get read only access for the 5 minute TTL configured in DNS. We did this 10 days ago, and apart from the small hint &#8220;hosted on VMware Fusion&#8221; logo on the front page, it should have been completely transparent.</p>
<p>I monitor the site internally with the soon to be released &#8220;AppSpeed&#8221; product that is developed from the BHive aquisition by VMware last year, and externally using mon.itor.us. Over this weekend I did the reverse move, back onto the rmcplex. You can see from the monitoring data that we maintained similar performance levels before and after the move.</p>
<p style="text-align: center;"><a href="http://blog.richardmcdougall.com/wp-content/uploads/2009/03/si_fusion.png"><img class="alignnone size-medium wp-image-88 aligncenter" title="si_fusion" src="http://blog.richardmcdougall.com/wp-content/uploads/2009/03/si_fusion.png" alt="" width="300" height="166" /></a></p>
<p style="text-align: center;"><strong>solarisinternals.com latency on VMware Fusion</strong></p>
<p>And just after the move back to the RMCplex:</p>
<p style="text-align: center;"><a href="http://blog.richardmcdougall.com/wp-content/uploads/2009/03/si_rmcplex.png"><img class="alignnone size-medium wp-image-89 aligncenter" title="si_rmcplex" src="http://blog.richardmcdougall.com/wp-content/uploads/2009/03/si_rmcplex.png" alt="" width="300" height="166" /></a></p>
<p style="text-align: center;"><strong>solarisinternals.com latency on VMware ESX</strong></p>
<p><strong>Out with all that Physical Stuff!</strong></p>
<p>After this mini-datacenter consolidation,I ended up being able to throw out several old ethernet switches, about 50 cables, and three servers:</p>
<p style="text-align: center;"><a href="http://blog.richardmcdougall.com/wp-content/uploads/2009/03/sijunk.jpg"><img class="alignnone size-medium wp-image-83 aligncenter" title="sijunk" src="http://blog.richardmcdougall.com/wp-content/uploads/2009/03/sijunk.jpg" alt="" width="300" height="278" /></a></p>
<p>I&#8217;m saving quite a bit of power &#8212; going from 650 watts of power down to 214w (a saving of $880 per year in energy costs).</p>
<p style="text-align: center;"><a href="http://blog.richardmcdougall.com/wp-content/uploads/2009/03/powermeterafter.jpg"><img class="alignnone size-medium wp-image-84 aligncenter" title="powermeterafter" src="http://blog.richardmcdougall.com/wp-content/uploads/2009/03/powermeterafter.jpg" alt="" width="300" height="199" /></a></p>
<p>And it looks quite a bit simpler:</p>
<p style="text-align: center;"><a href="http://blog.richardmcdougall.com/wp-content/uploads/2009/03/rmcplex.jpg"><img class="alignnone size-medium wp-image-85 aligncenter" title="rmcplex" src="http://blog.richardmcdougall.com/wp-content/uploads/2009/03/rmcplex.jpg" alt="" width="199" height="300" /></a></p>
<p><strong>Summary</strong></p>
<p>The new setup is much easier to manage, I get remote consoles from any IP connected device, it has the ablity to stage/roll forward/backward, and has geographic mobility when it&#8217;s required. Patching and upgrading is much easier now too.</p>
<p>That&#8217;s it for now &#8212; I have some interesting stories about the SQL and HTML performance data we&#8217;ve been able to collect with the Appspeed setup on the site, and I&#8217;ll share that soon in a subsequent post&#8230;</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.richardmcdougall.com/?feed=rss2&amp;p=80</wfw:commentRss>
		<slash:comments>11</slash:comments>
		</item>
		<item>
		<title>Saving Joules II</title>
		<link>http://blog.richardmcdougall.com/?p=79</link>
		<comments>http://blog.richardmcdougall.com/?p=79#comments</comments>
		<pubDate>Thu, 06 Nov 2008 17:31:44 +0000</pubDate>
		<dc:creator>rmc</dc:creator>
				<category><![CDATA[Benchmarking]]></category>
		<category><![CDATA[Fun]]></category>
		<category><![CDATA[Performance]]></category>
		<category><![CDATA[VMware]]></category>
		<category><![CDATA[Virtualization]]></category>

		<guid isPermaLink="false">http://blog.richardmcdougall.com/?p=79</guid>
		<description><![CDATA[[ there was a bad youtube server hosting the first video, so here's a resubmit with a pointer to an alternative youtube copy
It's been well talked up that consolidating physical systems can save a lot of datacenter engery -- for example, consolidating 20 systems, all with 3-5% utilization can reduce power by up to 10-15x [...]]]></description>
			<content:encoded><![CDATA[<p>[ there was a bad youtube server hosting the first video, so here's a resubmit with a pointer to an alternative youtube copy</p>
<p>It's been well talked up that consolidating physical systems can save a lot of datacenter engery -- for example, consolidating 20 systems, all with 3-5% utilization can reduce power by up to 10-15x -- <span id="lingo_span" class="lingo_region">or considered in more tractable terms, each server removed is equivalent to taking four cars off the road or planting 55 trees in a year.</span></p>
<p>Taking this concept further, the distributed power management feature in Virtual Infrastructure allows us to further reduce power by another 50%. Since load levels of typical applications aren&#8217;t constant, there are periods of the day where less servers are required &#8212; and in those periods, power can be saved by moving the remaining work onto fewer servers and powering down the unneeded capacity</p>
<p><object width="425" height="344"><param name="movie" value="http://www.youtube.com/v/Nkv0fhu-m2k&#038;hl=en&#038;fs=1"></param><param name="allowFullScreen" value="true"></param><param name="allowscriptaccess" value="always"></param><embed src="http://www.youtube.com/v/Nkv0fhu-m2k&#038;hl=en&#038;fs=1" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="425" height="344"></embed></object></p>
<p>A colleague put together <a href="http://www.youtube.com/watch?v=7CbRS0GGuNc">this entertaining video</a>, in which we measured the performance and watts consumed during typical peaky workload. The scenario is based on the email load curve typically observed &#8212; that peaks at 9am in the morning, and is at a minimum during the evening hours.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.richardmcdougall.com/?feed=rss2&amp;p=79</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>VMware Performance Tutorial next week at Usenix LISA</title>
		<link>http://blog.richardmcdougall.com/?p=78</link>
		<comments>http://blog.richardmcdougall.com/?p=78#comments</comments>
		<pubDate>Wed, 05 Nov 2008 22:26:01 +0000</pubDate>
		<dc:creator>rmc</dc:creator>
				<category><![CDATA[Benchmarking]]></category>
		<category><![CDATA[Performance]]></category>
		<category><![CDATA[Public Talks]]></category>
		<category><![CDATA[Virtualization]]></category>

		<guid isPermaLink="false">http://blog.richardmcdougall.com/?p=78</guid>
		<description><![CDATA[Heads up for those interested in understanding VMware performance in more details &#8212; I&#8217;m running my performance and tuning of VMware ESX server class at Usenix LISA again this November in San Diego.   The session is on Friday, November 14, and is an all-day class.
Given our obsession with things that go fast, the [...]]]></description>
			<content:encoded><![CDATA[<p>Heads up for those interested in understanding VMware performance in more details &#8212; I&#8217;m running my performance and tuning of VMware ESX server class at Usenix LISA again this November in San Diego.   The session is on Friday, November 14, and is an all-day class.</p>
<p>Given our obsession with things that go fast, the track code &#8220;<a href="http://en.wikipedia.org/wiki/F1">F1</a>&#8221; is most appropriate <img src='http://blog.richardmcdougall.com/wp-includes/images/smilies/icon_wink.gif' alt=';-)' class='wp-smiley' /> </p>
<p><a href="http://blogs.vmware.com/performance/2008/10/one-day-tutoria.html">http://blogs.vmware.com/performance/2008/10/one-day-tutoria.html</a></p>
]]></content:encoded>
			<wfw:commentRss>http://blog.richardmcdougall.com/?feed=rss2&amp;p=78</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>Saving Joules</title>
		<link>http://blog.richardmcdougall.com/?p=77</link>
		<comments>http://blog.richardmcdougall.com/?p=77#comments</comments>
		<pubDate>Wed, 05 Nov 2008 20:39:54 +0000</pubDate>
		<dc:creator>rmc</dc:creator>
				<category><![CDATA[Benchmarking]]></category>
		<category><![CDATA[Fun]]></category>
		<category><![CDATA[Performance]]></category>
		<category><![CDATA[VMware]]></category>
		<category><![CDATA[Virtualization]]></category>

		<guid isPermaLink="false">http://blog.richardmcdougall.com/?p=77</guid>
		<description><![CDATA[It&#8217;s been well talked up that consolidating physical systems can save a lot of datacenter engery &#8212; for example, consolidating 20 systems, all with 3-5% utilization can reduce power by up to 10-15x &#8212; or considered in more tractable terms, each server removed is equivalent to taking four cars off the road or planting 55 [...]]]></description>
			<content:encoded><![CDATA[<p>It&#8217;s been well talked up that consolidating physical systems can save a lot of datacenter engery &#8212; for example, consolidating 20 systems, all with 3-5% utilization can reduce power by up to 10-15x &#8212; <span id="lingo_span" class="lingo_region">or considered in more tractable terms, each server removed is equivalent to taking four cars off the road or planting 55 trees in a year.</span></p>
<p>Taking this concept further, the distributed power management feature in Virtual Infrastructure allows us to further reduce power by another 50%. Since load levels of typical applications aren&#8217;t constant, there are periods of the day where less servers are required &#8212; and in those periods, power can be saved by moving the remaining work onto fewer servers and powering down the unneeded capacity</p>
<p><object width="425" height="344"><param name="movie" value="http://www.youtube.com/v/7CbRS0GGuNc&#038;hl=en&#038;fs=1"></param><param name="allowFullScreen" value="true"></param><param name="allowscriptaccess" value="always"></param><embed src="http://www.youtube.com/v/7CbRS0GGuNc&#038;hl=en&#038;fs=1" type="application/x-shockwave-flash" allowscriptaccess="always" allowfullscreen="true" width="425" height="344"></embed></object></p>
<p>A colleague put together <a href="http://www.youtube.com/watch?v=7CbRS0GGuNc">this entertaining video</a>, in which we measured the performance and watts consumed during typical peaky workload. The scenario is based on the email load curve typically observed &#8212; that peaks at 9am in the morning, and is at a minimum during the evening hours.</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.richardmcdougall.com/?feed=rss2&amp;p=77</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Recharge my ride&#8230;</title>
		<link>http://blog.richardmcdougall.com/?p=76</link>
		<comments>http://blog.richardmcdougall.com/?p=76#comments</comments>
		<pubDate>Fri, 25 Jul 2008 00:50:37 +0000</pubDate>
		<dc:creator>rmc</dc:creator>
				<category><![CDATA[Fun]]></category>

		<guid isPermaLink="false">http://blog.richardmcdougall.com/?p=76</guid>
		<description><![CDATA[I&#8217;m jealous:
http://teslafounders.wordpress.com/2008/07/19/tesla-founders-car/
Great to see the Telsa is finally shipping!
]]></description>
			<content:encoded><![CDATA[<p>I&#8217;m jealous:</p>
<p><a href="http://teslafounders.wordpress.com/2008/07/19/tesla-founders-car/">http://teslafounders.wordpress.com/2008/07/19/tesla-founders-car/</a></p>
<p>Great to see the Telsa is finally shipping!</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.richardmcdougall.com/?feed=rss2&amp;p=76</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>IO Challenge Update 5: 102,240 IOPS</title>
		<link>http://blog.richardmcdougall.com/?p=73</link>
		<comments>http://blog.richardmcdougall.com/?p=73#comments</comments>
		<pubDate>Wed, 21 May 2008 19:49:57 +0000</pubDate>
		<dc:creator>rmc</dc:creator>
				<category><![CDATA[Performance]]></category>
		<category><![CDATA[VMware]]></category>
		<category><![CDATA[Virtualization]]></category>

		<guid isPermaLink="false">http://blog.richardmcdougall.com/?p=73</guid>
		<description><![CDATA[We&#8217;re really excited to announce that we hit over 100k IOPS!
This was done using 77TB of storage, adding a 3rd rack totalling 495 disks &#8212; enough storage to hold the entire printed library of congress, and all being driven from a single ESX server!

Chethan&#8217;s just written a detailed report over on VROOM&#8230;
The interesting points about [...]]]></description>
			<content:encoded><![CDATA[<p>We&#8217;re really excited to announce that we hit over 100k IOPS!</p>
<p>This was done using 77TB of storage, adding a 3rd rack totalling 495 disks &#8212; enough storage to hold the entire printed library of congress, and all being driven from a single ESX server!</p>
<p><img src="http://blog.richardmcdougall.com/wp-content/uploads/2008/05/cx3_80.gif" alt="" width="250" height="254" /></p>
<p>Chethan&#8217;s just written a detailed report over on <a href="http://blogs.vmware.com/performance/2008/05/100000-io-opera.html">VROOM&#8230;</a></p>
<p>The interesting points about this effort to me are:</p>
<ul>
<li>There was almost no tuning required</li>
<li>There is no visible I/O throughput ceiling for ESX &#8212; the throughput is roughly that of 50 4-cpu databases&#8230;</li>
<li>Even at 100,000 IOPS, the virtualization layer only added 0.1ms of additional latency, which amounts to 1.4% virtualization overhead</li>
</ul>
<p><img src="http://blog.richardmcdougall.com/wp-content/uploads/2008/05/iops4.gif" alt="" width="480" height="347" /></p>
]]></content:encoded>
			<wfw:commentRss>http://blog.richardmcdougall.com/?feed=rss2&amp;p=73</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>IO Challenge Update 4: 59,760 IOPS</title>
		<link>http://blog.richardmcdougall.com/?p=70</link>
		<comments>http://blog.richardmcdougall.com/?p=70#comments</comments>
		<pubDate>Tue, 20 May 2008 22:07:18 +0000</pubDate>
		<dc:creator>rmc</dc:creator>
				<category><![CDATA[Benchmarking]]></category>
		<category><![CDATA[VMware]]></category>
		<category><![CDATA[Virtualization]]></category>

		<guid isPermaLink="false">http://blog.richardmcdougall.com/?p=70</guid>
		<description><![CDATA[We&#8217;ve doubled the number of disks, totaling just over 300, taking up two entire CX-380 racks..

That&#8217;s 44TB of disk, and enough spindles to drive over 100,000 exchange users&#8230;

More disks are on the way&#8230;
]]></description>
			<content:encoded><![CDATA[<p>We&#8217;ve doubled the number of disks, totaling just over 300, taking up two entire CX-380 racks..</p>
<p><img src="http://blog.richardmcdougall.com/wp-content/uploads/2008/05/emc-cx32.jpg" alt="" width="133" height="200" /></p>
<p>That&#8217;s 44TB of disk, and enough spindles to drive over 100,000 exchange users&#8230;</p>
<p><img src="http://blog.richardmcdougall.com/wp-content/uploads/2008/05/iops3.gif" alt="" width="500" height="361" /></p>
<p>More disks are on the way&#8230;</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.richardmcdougall.com/?feed=rss2&amp;p=70</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>File System Benchmarking Workshop in Santa Cruz</title>
		<link>http://blog.richardmcdougall.com/?p=66</link>
		<comments>http://blog.richardmcdougall.com/?p=66#comments</comments>
		<pubDate>Mon, 19 May 2008 18:35:12 +0000</pubDate>
		<dc:creator>rmc</dc:creator>
				<category><![CDATA[Benchmarking]]></category>
		<category><![CDATA[FileBench]]></category>
		<category><![CDATA[Performance]]></category>
		<category><![CDATA[VMware]]></category>
		<category><![CDATA[Virtualization]]></category>
		<category><![CDATA[Work]]></category>

		<guid isPermaLink="false">http://blog.richardmcdougall.com/?p=66</guid>
		<description><![CDATA[
We&#8217;re at Santa Cruz this week at the workshop for file system benchmarking. , chaired by Erez Zadok of StonyBrook University.

The objective of this week is to converge on a set of strategies and recommendations towards more accurate and representative benchmarking of file systems and storage.
Many of the File Systems and I/O benchmarks are represented:

iozone
FileBench
SFS
ZFS
NFS
Microsoft [...]]]></description>
			<content:encoded><![CDATA[<p><img src="http://blog.richardmcdougall.com/wp-content/uploads/2008/05/ucscslug.gif" alt="" width="153" height="138" /></p>
<p>We&#8217;re at Santa Cruz this week at the <a href="http://www.ssrc.ucsc.edu/wikis/ssrc/BenchmarkingWorkshop08/">workshop for file system benchmarking</a>. , chaired by <a href="http://www.cs.sunysb.edu/~ezk/">Erez Zadok</a> of <a href="http://www.sunysb.edu/">StonyBrook University</a>.</p>
<p><img src="http://blog.richardmcdougall.com/wp-content/uploads/2008/05/ucsc.jpg" alt="" /></p>
<p>The objective of this week is to converge on a set of strategies and recommendations towards more accurate and representative benchmarking of file systems and storage.</p>
<p>Many of the File Systems and I/O benchmarks are represented:</p>
<ul>
<li><a href="http://www.iozone.org/">iozone</a></li>
<li><a href="http://www.solarisinternals.com/wiki/index.php/FileBench">FileBench</a></li>
<li><a href="http://www.spec.org/sfs93/news/sfsbase.html">SFS</a></li>
<li><a href="http://en.wikipedia.org/wiki/ZFS">ZFS</a></li>
<li><a href="http://en.wikipedia.org/wiki/Network_File_System_(protocol)">NFS</a></li>
<li><a href="http://msdn.microsoft.com/en-us/library/aa365233.aspx">Microsoft Clients</a></li>
</ul>
<p><a href="http://www.virtualization.info/2008/02/waiting-for-vmworld-europe-2008-part-6.html">Devaki</a>, <a href="http://virtualscoop.org/">Irfan</a> and I will be presenting on the methodologies and strategies for measuring I/O performance with virtualized platforms. Some of the topics we&#8217;ll raise are</p>
<ul>
<li>Virtualization brings a diverse sets of applications: No single workload footprint on the system</li>
<li>Workload characteristics are combined when workloads are consolidated onto a single system, resulting in stochastic workload patterns</li>
<li>Different IO patterns to the same volume, or IO from one app split to different volumes</li>
<li>Provisioning operations should be considered separate from applications run-time footprint (e.g. Create VM, Power On VM</li>
<li>Virtual machines are stored in a clustered file system, need to consider locking</li>
<li>CPU and virtual device emulation can impact storage performance</li>
<li>System setup can affect performance</li>
<li>Partition alignment can affect performance</li>
<li>File system based or raw device mapping possible</li>
<li>New hardware assist technology should be factored and/or considered</li>
<li>Proxied I/O protocols are common: e.g. SCSI proxied over NFS within hypervisor</li>
</ul>
<p>I&#8217;ll try to post an update of the workshop outcomes&#8230;</p>
]]></content:encoded>
			<wfw:commentRss>http://blog.richardmcdougall.com/?feed=rss2&amp;p=66</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
