<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>BioTeam Inc. &#187; amazon cloud</title>
	<atom:link href="http://blog.bioteam.net/tag/amazon-cloud/feed/" rel="self" type="application/rss+xml" />
	<link>http://blog.bioteam.net</link>
	<description>Latest news: publications, presentations, projects and training classes</description>
	<lastBuildDate>Thu, 29 Jul 2010 14:15:15 +0000</lastBuildDate>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.0</generator>
		<item>
		<title>Playing with NFS &amp; GlusterFS on Amazon cc1.4xlarge EC2 instance types</title>
		<link>http://blog.bioteam.net/2010/07/29/playing-with-nfs-glusterfs-on-amazon-cc1-4xlarge-ec2-instance-types/</link>
		<comments>http://blog.bioteam.net/2010/07/29/playing-with-nfs-glusterfs-on-amazon-cc1-4xlarge-ec2-instance-types/#comments</comments>
		<pubDate>Thu, 29 Jul 2010 14:07:42 +0000</pubDate>
		<dc:creator>chrisdag</dc:creator>
				<category><![CDATA[Employee Posts]]></category>
		<category><![CDATA[amazon]]></category>
		<category><![CDATA[amazon cloud]]></category>
		<category><![CDATA[aws]]></category>
		<category><![CDATA[cc1.4xlarge]]></category>
		<category><![CDATA[cloud]]></category>
		<category><![CDATA[ebs]]></category>
		<category><![CDATA[ebs performance]]></category>

		<guid isPermaLink="false">http://blog.bioteam.net/?p=664</guid>
		<description><![CDATA[Early single-client tests of shared ephemeral storage via NFS and parallel GlusterFS We here at BioTeam have been kicking tires and generally exploring around the edges of the new Amazon cc1.4xlarge &#8220;compute cluster&#8221; EC2 instance types. Much of our experimentation has been centered around simplistic benchmarking techniques as a way of slowly zeroing in on [...]]]></description>
			<content:encoded><![CDATA[<h3>Early single-client tests of shared ephemeral storage via NFS and parallel GlusterFS</h3>
<p>We here at BioTeam have been kicking tires and <a href="http://blog.bioteam.net/2010/07/19/exploring-the-new-aws-compute-cluster-ec2-instances/">generally exploring around the edges of the new Amazon cc1.4xlarge &#8220;compute cluster&#8221; EC2 instance types</a>. Much of our experimentation has been centered around simplistic benchmarking techniques as a way of slowly zeroing in on the methods, techniques and orchestration approaches most likely to have a significant usability, performance or <em>wallclock-time-to-scientific-results</em> outcome for the work we do professionally for ourselves and our clients.</p>
<p> </p>
<p><img style="display: block; margin-left: auto; margin-right: auto;" title="glusterFS-002.png" src="http://blog.bioteam.net/wp-content/uploads/2010/07/glusterFS-002.png" border="0" alt="glusterFS-002.png" width="500" height="203" /></p>
<p>We are asking very broad questions and testing assumptions along the lines of:</p>
<ul>
<li>Does the hot new 10 Gigabit non-blocking networking fabric backing up the new instance types really mean that &#8220;legacy&#8221; compute farm and HPC cluster architectures which make heavy use of network filesharing possible?
</li>
<li>How does filesharing between nodes look and feel on the new network and instance types?
</li>
<li>Are the speedy ephemeral disks on the new instance types suitable for bundling into NFS shares or aggregating into parallel or clustered distribtued filesystems?
</li>
<li>Can we use the replication features in GlusterFS to mitigate some of the risks of using ephemeral disk for storage? 
</li>
<li>Should the shared storage built from ephermeral disk be assigned to &#8220;/scratch&#8221; or other non-critical duties due to the risks involved? What can we do to mitigate the risks?
</li>
<li>At what scale is NFS the easiest and most suitable sharing option? What are the best NFS server and client tuning parameters to use? 
</li>
<li>When using parallel or cluster filesystems like GlusterFS, what rough metrics can we use to figure out how many data servers to dedicate to a particular cluster size or workflow profile?</li>
</ul>
<p> </p>
<h4>GlusterFS &amp; NFS Initial Testing</h4>
<p>Over the past week we have been running tests on two types of network filesharing. We&#8217;ve only tested against a single client so obviously these results say nothing about at-scale performance or operation.</p>
<p>Types of tests:</p>
<ol>
<li>Take the pair of ~900GB ephemeral disks on the instance type, stripe them together as a RAID0 set, slap an XFS filesystem on top and export the entire volume out via NFS
</li>
<li>Take the pair of ~900GB ephemeral disks on the instance type, slap a single large partition on each drive, format each drive with an EXT3 filesystem and then use GlusterFS to create, mount and export the volume via the GlusterFS protocol</li>
</ol>
<p>For each of the above two test types we repeatedly ran (at least 4x times) our standard bonnie++ benchmark tests (methodology described in the earlier blog posts). The tests were run on a single remote client that was either NFS mounting or GlusterFS mounting the file share.</p>
<p><strong>GlusterFS parameters</strong></p>
<ul>
<li>None really. We used the standard volume creation command and mounted the file share via the GlusterFS protocol over TCP. Eventually we want to ask some of our GlusterFS expert friends for additional tuning guidance</li>
</ul>
<p><strong>NFS parameters:</strong></p>
<ul>
<li>Server export file:  &#8221;/nfs    &lt;host&gt;(rw,async)?&#8221;</li>
<li>NFS Server config: boosted the number of nfsd daemons to 16 via edits to /etc/sysconfig/nfs file</li>
<li>Client mount options:  &#8221;mount -t nfs -o rw,async,hard,intr,retrans=2,rsize=32768,wsize=32768,nfsvers=3,tcp &lt;host&gt;:/nfs /nfs-scratch?&#8221;</li>
</ul>
<p> </p>
<h3>Lessons Learned So Far  - NFS vs GlusterFS</h3>
<ul>
<li>GlusterFS was incredibly easy to install and creating and exporting parallel filesystem shares was straightforward. The methods involved are easily scripted/automated or built into a server orchestration strategy. The process was so simple that initially we were thinking that GlusterFS would be our default sharing option for all our work on the new compute cluster instances
</li>
<li>GlusterFS has <strong><em>ONE HUGE DOWNSIDE</em></strong>. It turns out that GlusterFS recommends that the participating disk volumes be formatted with an ext3 filesystem for best results. This is &#8230; problematic &#8230; with the 900GB ephemeral disks because<strong><em> formatting a 900 gb disk with ext3 takes damn near forever</em></strong>. We estimate about 15-20 minutes of wallclock time wasted while waiting for the &#8220;mkfs.ext3&#8243; command to complete.
</li>
<li>The wallclock time lost to formatting ext3 volumes for GlusterFS usage is significant enough to affect how we may or may not use GlusterFS in the future. Maybe there is a different filesystem we can use with a faster formatting profile. Using XFS and software RAID we can normally stand up and export filesystems in a matter of a few seconds or a minute or two. Sadly, XFS is not recommended at all with current versions of GlusterFS. </li>
<li> Using GlusterFS with the recommended ext3 configuration seems to mean that we have to accept a minimum delay of 15 minutes or even more when standing up and exporting new storage. This is unacceptable for small deployments or workflows where you might only be running the EC2 instances for a short time.
</li>
<li>The possibility of using the GlusterFS replication features to mitigate against the risks of using ephermeral storage might be significant. We need to do more testing in this configuration. 
</li>
<li>Given the extensive wallclock time delays inherent in waiting for ext3 filesystem formatting to complete in a GlusterFS scenario it seems likely that we might default to using a tuned NFS server setup for (a) small clusters &amp; compute farms or (b) systems that we plan to stand up only for a few hours.
</li>
<li>The overhead of provisioning GlusterFS becomes less significant when we have very large clusters that can benefit from the inherent scaling ability of GlusterFS or when we plan to stand up the clusters for longer periods of time</li>
</ul>
<p> </p>
<h3>Benchmark Results</h3>
<p>In all the results shown below I&#8217;ve included data from a 2-disk RAID0 ephemeral storage  setup. This is so that the network filesharing data can be contrasted against the results seen from running bonnie++ locally.</p>
<p><em>Click on the images for a larger version</em>.</p>
<p><a href="http://blog.bioteam.net/wp-content/uploads/2010/07/glusterFS-004.png"><img style="display: block; margin-left: auto; margin-right: auto;" title="glusterFS-004.png" src="http://blog.bioteam.net/wp-content/uploads/2010/07/glusterFS-004.png" border="0" alt="glusterFS-004.png" width="600" /></a></p>
<p><a href="http://blog.bioteam.net/wp-content/uploads/2010/07/glusterFS-005.png"><img style="display: block; margin-left: auto; margin-right: auto;" title="glusterFS-005.png" src="http://blog.bioteam.net/wp-content/uploads/2010/07/glusterFS-005.png" border="0" alt="glusterFS-005.png" width="600" /></a></p>
<p> </p>
<p><a href="http://blog.bioteam.net/wp-content/uploads/2010/07/glusterFS-006.png"><img style="display: block; margin-left: auto; margin-right: auto;" title="glusterFS-006.png" src="http://blog.bioteam.net/wp-content/uploads/2010/07/glusterFS-006.png" border="0" alt="glusterFS-006.png" width="600" /></a></p>
<p> </p>
<p> </p>
<p> </p>
<img src="http://blog.bioteam.net/?ak_action=api_record_view&id=664&type=feed" alt="" />]]></content:encoded>
			<wfw:commentRss>http://blog.bioteam.net/2010/07/29/playing-with-nfs-glusterfs-on-amazon-cc1-4xlarge-ec2-instance-types/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Boot, ephemeral &amp; EBS storage performance on amazon cc1.4xlarge instance types</title>
		<link>http://blog.bioteam.net/2010/07/20/boot-ephemeral-ebs-storage-performance-on-amazon-cc1-4xlarge-instance-types/</link>
		<comments>http://blog.bioteam.net/2010/07/20/boot-ephemeral-ebs-storage-performance-on-amazon-cc1-4xlarge-instance-types/#comments</comments>
		<pubDate>Tue, 20 Jul 2010 22:51:51 +0000</pubDate>
		<dc:creator>chrisdag</dc:creator>
				<category><![CDATA[Employee Posts]]></category>
		<category><![CDATA[amazon]]></category>
		<category><![CDATA[amazon cloud]]></category>
		<category><![CDATA[aws]]></category>
		<category><![CDATA[cloud]]></category>
		<category><![CDATA[cluster compute]]></category>
		<category><![CDATA[ebs]]></category>

		<guid isPermaLink="false">http://blog.bioteam.net/?p=644</guid>
		<description><![CDATA[Backstory For background and summary writeups of all the various blog posts we have dealing with the new Amazon EC2 &#8220;compute cluster&#8221; cc1.4xlarge instance types please refer to this summary page: http://blog.bioteam.net/2010/07/19/exploring-the-new-aws-compute-cluster-ec2-instances/? Related post We talked about the performance of the boot and ephemeral storage in this post: http://blog.bioteam.net/2010/07/19/local-storage-performance-of-aws-cluster-compute-instances/? This post In this post I&#8217;ve finally collected [...]]]></description>
			<content:encoded><![CDATA[<p><strong>Backstory</strong></p>
<p>For background and summary writeups of all the various blog posts we have dealing with the new Amazon EC2 &#8220;compute cluster&#8221; cc1.4xlarge instance types please refer to this summary page: <a href="http://blog.bioteam.net/2010/07/19/exploring-the-new-aws-compute-cluster-ec2-instances/">http://blog.bioteam.net/2010/07/19/exploring-the-new-aws-compute-cluster-ec2-instances/?</a></p>
<p><strong>Related post</strong></p>
<p>We talked about the performance of the boot and ephemeral storage in this post: <a href="http://blog.bioteam.net/2010/07/19/local-storage-performance-of-aws-cluster-compute-instances/">http://blog.bioteam.net/2010/07/19/local-storage-performance-of-aws-cluster-compute-instances/?</a></p>
<p><strong>This post</strong></p>
<p>In this post I&#8217;ve finally collected enough data to cover repeated bonnie++ benchmark tests against all the main types of block storage available to the new Amazon cc1.4xlarge &#8220;compute cluster&#8221; instance types:</p>
<ul>
<li>Local boot disk performance</li>
<li>Performance of a single ephemeral disk</li>
<li>Performance of the two available ephemeral? disks when striped together with software RAID0</li>
<li>Performance of a single EBS volumes attached to the instance</li>
<li>Performance of 4 EBS volumes striped with RAID0 and attached to the instance</li>
<li>Performance of 8 EBS volumes striped with RAID0 and attached to the instance?</li>
</ul>
<p><strong>Lessons Learned</strong></p>
<p><strong>1. Don&#8217;t use the boot disk for anything other than booting the operating system</strong>. As the results show, the performance of the <em>(no PV driver support</em>) cc1.4xlarge boot disk <strong><em>is the slowest of all possible block storage options available</em></strong>. Really slow. Not worth using for anything other than OS stuff. This also includes <a href="http://blog.bioteam.net/2010/07/14/how-to-resize-an-amazon-ec2-ami-when-boot-disk-is-on-ebs/">not bothering to mess with the size of the available volume</a>.</p>
<p><strong>2. The instance ephemeral storage volumes are fast and should not be ignored</strong>. Every cc1.4xlarge EC2 instance comes with a pair of ~840GB ephemeral disk volumes. In living by our own &#8220;<em>don&#8217;t trust anything to non-persistant storage!</em>&#8221; best practice we are guilty of ignoring these drives in situations where they could have been of significant benefit. This will change. The performance of a single ephemeral volume beats the performance we see out of a single EBS volume. A striped pair of ephemeral volumes performs even better and stacks up well even to multiple EBS volumes striped together. The RAID0 pairing of the two ephemeral drives seems to consistently outperform even 8-drive EBS RAID0 volumes when you look at the bonniee++ results for random and sequential file creation and deletion tests. This has <strong><em>major implications</em></strong> for HPC and scientific pipeline processing on the cloud. In particular I can easily envision using the ephemeral drives to build a shared parallel scratch filesystem (<em>think PVFS or GlusterFS</em>) in cluster configurations. This would give you a nice shared scratch storage pool. Even in simpler cluster setups it looks like it would be a win to stage data into the ephemeral storage so it can be used as the target drive for  scientific processing (where the input data is not unique and has backup copies elsewhere). We can run the IO heavy analysis against the fast ephemeral storage and send our result data into S3 buckets or a proper EBS volume for downstream handling.</p>
<p>3. <strong>Striping EBS volumes into software RAID0 sets is a valid practice</strong>. We clearly see performance gains when using more than one EBS volume, the performance gain is significant enough to justify the hassles involved in backing up and protecting EBS-resident sofware RAID sets. We need to do more work (and really need to test 2-volume EBS stripes) but it is clear that there is a measurable performance gain to be had. Not sure if we&#8217;d use 8-disk RAID0 sets for production work but looking at 2-disk and 4-disk methods is something that we will be looking seriously at.</p>
<p>Obviously there is much more to be drawn from the data but benchmarking is hard (and controversial) in regular settings let alone trying to get repeatable and consistent results out of a virtualized multi-tennant cloud framework. For now I&#8217;d prefer to stick to broad general conclusions and &#8220;lessons learned&#8221; rather than trying to divine highly specific things out of the raw data.</p>
<p> </p>
<p><strong>Test Results</strong></p>
<p>As usual, you can find all of the raw data in <a href="https://spreadsheets.google.com/ccc?key=0AsrRXBRzWSxSdDdTZG9rZXRHUnQyU0sxak9aaGpJUlE&amp;hl=en&amp;authkey=CJmVloIK">this google spreadsheet</a>. We did not finesse the data at all, the only data munging we did was to run tests repeatedly and then average out the results in order to arrive at the numbers used in the graphs.</p>
<p>Here are the numerical numbers behind the graphs, click on the image for the full-size version:</p>
<p><a href="http://blog.bioteam.net/wp-content/uploads/2010/07/cc1.4xlarge-summary-001.png"><img style="display: block; margin-left: auto; margin-right: auto;" title="cc1.4xlarge-summary-001.png" src="http://blog.bioteam.net/wp-content/uploads/2010/07/cc1.4xlarge-summary-001.png" border="0" alt="cc1.4xlarge-summary-001.png" width="620" /></a></p>
<p>And here are the graphs. We&#8217;ve broken out the graphs to represent the results measured in &#8220;K/sec&#8221; versus just &#8221; /sec&#8221;.</p>
<p> </p>
<p><strong>Read, Write &amp; Rewrite results for all cc1.x4large storage types (click on image for full size):</strong></p>
<p><a href="http://blog.bioteam.net/wp-content/uploads/2010/07/cc1.4xlarge-summary-002.png"><img style="display: block; margin-left: auto; margin-right: auto;" title="cc1.4xlarge-summary-002.png" src="http://blog.bioteam.net/wp-content/uploads/2010/07/cc1.4xlarge-summary-002.png" border="0" alt="cc1.4xlarge-summary-002.png" width="620" /></a></p>
<p> </p>
<p><strong>Sequential create, delete &amp; seek results for all cc1.x4large storage types (click on image for full size):</strong></p>
<p> </p>
<p><a href="http://blog.bioteam.net/wp-content/uploads/2010/07/cc1.4xlarge-summary-004.png"><img style="display: block; margin-left: auto; margin-right: auto;" title="cc1.4xlarge-summary-004.png" src="http://blog.bioteam.net/wp-content/uploads/2010/07/cc1.4xlarge-summary-004.png" border="0" alt="cc1.4xlarge-summary-004.png" width="620" /></a></p>
<p> </p>
<p> </p>
<img src="http://blog.bioteam.net/?ak_action=api_record_view&id=644&type=feed" alt="" />]]></content:encoded>
			<wfw:commentRss>http://blog.bioteam.net/2010/07/20/boot-ephemeral-ebs-storage-performance-on-amazon-cc1-4xlarge-instance-types/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Exploring the new AWS Compute Cluster EC2 Instances</title>
		<link>http://blog.bioteam.net/2010/07/19/exploring-the-new-aws-compute-cluster-ec2-instances/</link>
		<comments>http://blog.bioteam.net/2010/07/19/exploring-the-new-aws-compute-cluster-ec2-instances/#comments</comments>
		<pubDate>Mon, 19 Jul 2010 20:11:42 +0000</pubDate>
		<dc:creator>chrisdag</dc:creator>
				<category><![CDATA[Featured]]></category>
		<category><![CDATA[amazon]]></category>
		<category><![CDATA[amazon cloud]]></category>
		<category><![CDATA[aws]]></category>
		<category><![CDATA[cc1.4xlarge]]></category>
		<category><![CDATA[ec2]]></category>
		<category><![CDATA[roundup]]></category>

		<guid isPermaLink="false">http://blog.bioteam.net/?p=632</guid>
		<description><![CDATA[Note: Depending on how you found this post, it might be helpful to understand our own personal &#38; professional biases. We are bioinformatics and HPC types specializing in life sciences, not people trying to build the next twitter or facebook. What we care about when it comes to AWS performance may not be what YOU [...]]]></description>
			<content:encoded><![CDATA[<p><strong>Note: </strong>Depending on how you found this post, it might be helpful to understand our own personal &amp; professional biases. We are bioinformatics and HPC types specializing in life sciences, not people trying to build the next twitter or facebook. What we care about when it comes to AWS performance may not be what YOU care about. In particular there is a ton of internet information concentrating on methods for speeding up random IO access patterns on AWS. In our work, however, we seem to be more bound by the speed of long sequential reads (and sometimes writes). Parallel and serial scientific/HPC computing is different from building giant websites or databases.</p>
<p>In our work with Amazon Web Services we try to spend as much time as we can &#8220;kicking the tires&#8221; so we become better at building stuff that we and our clients can actually use. We also try to share our information as much as possible in the spirit of scientific collaboration &amp; honest exchange.</p>
<p> </p>
<p>This is a &#8217;roundup&#8217; or summary blog post where we&#8217;ll list out blog post or articles that discuss the new Amazon cc1.4xlarge Compute Cluster instances.</p>
<ul>
<li><a href="http://blog.bioteam.net/2010/07/13/grid-engine-on-the-new-amazon-compute-cluster-instances/">Grid Engine on the new Amazon Compute Cluster Instances</a></li>
<li><a href="http://blog.bioteam.net/2010/07/14/how-to-resize-an-amazon-ec2-ami-when-boot-disk-is-on-ebs/">How to resize an EC2 AMI when the boot disk is on EBS</a></li>
<li><a href="http://blog.bioteam.net/2010/07/13/preliminary-ebs-performance-tests-on-amazon-compute-cluster-cc1-4xlarge-instance-types/">Preliminary EBS performance on Amazon Compute Cluster cc1.4xlarge instance types</a></li>
<li><a href="http://blog.bioteam.net/2010/07/19/local-storage-performance-of-aws-cluster-compute-instances/">Local storage performance of AWS cluster compute instances</a></li>
<li><a href="http://blog.bioteam.net/2010/07/20/boot-ephemeral-ebs-storage-performance-on-amazon-cc1-4xlarge-instance-types/">Combined summary of local, ephemeral &amp; EBS storage on cc1.4xlarge instance types</a></li>
<li><a href="http://blog.bioteam.net/2010/07/29/playing-with-nfs-glusterfs-on-amazon-cc1-4xlarge-ec2-instance-types/">NFS &amp; GlusterFS network filesharing on the cc1.4xlarge instance types</a></li>
</ul>
<img src="http://blog.bioteam.net/?ak_action=api_record_view&id=632&type=feed" alt="" />]]></content:encoded>
			<wfw:commentRss>http://blog.bioteam.net/2010/07/19/exploring-the-new-aws-compute-cluster-ec2-instances/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>BioITWorld Cloud Workshop &#8211; Talk Slides</title>
		<link>http://blog.bioteam.net/2010/04/20/bioitworld-cloud-workshop-talk-slides/</link>
		<comments>http://blog.bioteam.net/2010/04/20/bioitworld-cloud-workshop-talk-slides/#comments</comments>
		<pubDate>Tue, 20 Apr 2010 13:46:53 +0000</pubDate>
		<dc:creator>chrisdag</dc:creator>
				<category><![CDATA[Employee Posts]]></category>
		<category><![CDATA[Presentations]]></category>
		<category><![CDATA[amazon aws]]></category>
		<category><![CDATA[amazon cloud]]></category>
		<category><![CDATA[cloud]]></category>
		<category><![CDATA[cloud computing]]></category>

		<guid isPermaLink="false">http://blog.bioteam.net/?p=490</guid>
		<description><![CDATA[Click on the graphic down below to download the PDF talk slides. This was a short 30 minute talk starting off the 2010 BioITWorld Cloud Computing Workshop. Content-wise it&#8217;s very similar to my NHGRI meeting slides except I added more introductory material about cloud segments and definitions.]]></description>
			<content:encoded><![CDATA[<p>Click on the graphic down below to download the PDF talk slides. This was a short 30 minute talk starting off the 2010 BioITWorld Cloud Computing Workshop.</p>
<p>Content-wise it&#8217;s very similar to my NHGRI meeting slides except I added more introductory material about cloud segments and definitions.</p>
<p><a href="http://blog.bioteam.net/wp-content/uploads/2010/04/cdag-BioIT-Cloud-v1.pdf"><img src="http://blog.bioteam.net/wp-content/uploads/2010/04/cloudtalk-icon.png" border="0" alt="cloudtalk-icon.png" width="325" height="254" align="left" /></a></p>
<img src="http://blog.bioteam.net/?ak_action=api_record_view&id=490&type=feed" alt="" />]]></content:encoded>
			<wfw:commentRss>http://blog.bioteam.net/2010/04/20/bioitworld-cloud-workshop-talk-slides/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Great talk on hadoop, bioinformatics and the cloud</title>
		<link>http://blog.bioteam.net/2009/12/02/great-talk-on-hadoop-bioinformatics-and-the-cloud/</link>
		<comments>http://blog.bioteam.net/2009/12/02/great-talk-on-hadoop-bioinformatics-and-the-cloud/#comments</comments>
		<pubDate>Wed, 02 Dec 2009 19:43:38 +0000</pubDate>
		<dc:creator>chrisdag</dc:creator>
				<category><![CDATA[Employee Posts]]></category>
		<category><![CDATA[amazon aws]]></category>
		<category><![CDATA[amazon cloud]]></category>
		<category><![CDATA[aws]]></category>
		<category><![CDATA[cloud]]></category>
		<category><![CDATA[hadoop]]></category>

		<guid isPermaLink="false">http://blog.bioteam.net/?p=351</guid>
		<description><![CDATA[Just passing along the URL to a pretty informative video showing Deepak Singh&#8217;s talk at a recent NYC area Hadoop World conference. The presentation contains quite a bit of bioinformatics/IT background that some readers here may already know but the Hadoop and large-scale dataset processing content is really quite interesting. If you work at the [...]]]></description>
			<content:encoded><![CDATA[<p>Just passing along the URL to a pretty informative video showing Deepak Singh&#8217;s talk at a recent NYC area Hadoop World conference. The presentation contains quite a bit of bioinformatics/IT background that some readers here may already know but the Hadoop and large-scale dataset processing content is really quite interesting. If you work at the intersection of informatics and IT this video also can help explain to friends and family just what the heck you do for a living!</p>
<p><object classid="clsid:d27cdb6e-ae6d-11cf-96b8-444553540000" width="400" height="300" codebase="http://download.macromedia.com/pub/shockwave/cabs/flash/swflash.cab#version=6,0,40,0"><param name="allowfullscreen" value="true" /><param name="allowscriptaccess" value="always" /><param name="src" value="http://vimeo.com/moogaloop.swf?clip_id=7351342&amp;server=vimeo.com&amp;show_title=1&amp;show_byline=1&amp;show_portrait=0&amp;color=&amp;fullscreen=1" /><embed type="application/x-shockwave-flash" width="400" height="300" src="http://vimeo.com/moogaloop.swf?clip_id=7351342&amp;server=vimeo.com&amp;show_title=1&amp;show_byline=1&amp;show_portrait=0&amp;color=&amp;fullscreen=1" allowscriptaccess="always" allowfullscreen="true"></embed></object></p>
<p><a href="http://vimeo.com/7351342">Hadoop for Bioinfomatics &#8211; Deepak Singh</a> from <a href="http://vimeo.com/cloudera">Cloudera</a> on <a href="http://vimeo.com">Vimeo</a>.</p>
<img src="http://blog.bioteam.net/?ak_action=api_record_view&id=351&type=feed" alt="" />]]></content:encoded>
			<wfw:commentRss>http://blog.bioteam.net/2009/12/02/great-talk-on-hadoop-bioinformatics-and-the-cloud/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Slides from the Amazon AWS 2009 StartUp Event</title>
		<link>http://blog.bioteam.net/2009/06/01/slides-from-the-amazon-aws-2009-startup-event/</link>
		<comments>http://blog.bioteam.net/2009/06/01/slides-from-the-amazon-aws-2009-startup-event/#comments</comments>
		<pubDate>Mon, 01 Jun 2009 20:38:06 +0000</pubDate>
		<dc:creator>chrisdag</dc:creator>
				<category><![CDATA[Employee Posts]]></category>
		<category><![CDATA[Presentations]]></category>
		<category><![CDATA[amazon cloud]]></category>
		<category><![CDATA[cloud computing]]></category>
		<category><![CDATA[ec2]]></category>

		<guid isPermaLink="false">http://blog.bioteam.net/?p=213</guid>
		<description><![CDATA[Talk slides from Chris Dagdigian&#8217;s presentation &#8220;Utility Computing for Cynics&#8221; at the 2009 NYC stop of the Amazon AWS StartUp Tour can be downloaded here: 2009-NYC-AWS-BioTeam A few pictures from the event are posted below. It was a great event, standing room only for all of the presentations.]]></description>
			<content:encoded><![CDATA[<p>Talk slides from Chris Dagdigian&#8217;s presentation &#8220;Utility Computing for Cynics&#8221; at the 2009 NYC stop of the Amazon AWS StartUp Tour can be downloaded here:</p>
<p><a href="http://blog.bioteam.net/wp-content/uploads/2009/06/2009-nyc-aws_bioteam_v2.pdf">2009-NYC-AWS-BioTeam</a></p>
<p>A few pictures from the event are posted below. It was a great event, standing room only for all of the presentations.</p>
<p><a title="View 'Amazon NYC StartUp Event' on Flickr.com" href="http://www.flickr.com/photos/8558461@N08/3579064222"> </a></p>
<div style="text-align: center;"><a title="View 'Amazon NYC StartUp Event' on Flickr.com" href="http://www.flickr.com/photos/8558461@N08/3579064222"><img src="http://farm3.static.flickr.com/2456/3579064222_5bb89d75cd.jpg" border="0" alt="Amazon NYC StartUp Event" width="500" height="333" /></a></div>
<p><a title="View 'Amazon NYC StartUp Event' on Flickr.com" href="http://www.flickr.com/photos/8558461@N08/3579055512"> </a></p>
<div style="text-align: center;"><a title="View 'Amazon NYC StartUp Event' on Flickr.com" href="http://www.flickr.com/photos/8558461@N08/3579055512"><img src="http://farm4.static.flickr.com/3656/3579055512_c935eb3319.jpg" border="0" alt="Amazon NYC StartUp Event" width="500" height="249" /></a></div>
<p><a title="View 'Amazon AWS NYC event' on Flickr.com" href="http://www.flickr.com/photos/8558461@N08/3578223875"> </a></p>
<div style="text-align: center;"><a title="View 'Amazon AWS NYC event' on Flickr.com" href="http://www.flickr.com/photos/8558461@N08/3578223875"><img src="http://farm4.static.flickr.com/3400/3578223875_a87573638f.jpg" border="0" alt="Amazon AWS NYC event" width="500" height="333" /></a></div>
<p><a title="View 'AWS event: view from the podium' on Flickr.com" href="http://www.flickr.com/photos/8558461@N08/3578215789"> </a></p>
<div style="text-align: center;"><a title="View 'AWS event: view from the podium' on Flickr.com" href="http://www.flickr.com/photos/8558461@N08/3578215789"><img src="http://farm4.static.flickr.com/3316/3578215789_f09e5a6996.jpg" border="0" alt="AWS event: view from the podium" width="500" height="333" /></a></div>
<img src="http://blog.bioteam.net/?ak_action=api_record_view&id=213&type=feed" alt="" />]]></content:encoded>
			<wfw:commentRss>http://blog.bioteam.net/2009/06/01/slides-from-the-amazon-aws-2009-startup-event/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Short talk at Amazon AWS event in NYC on May 28th</title>
		<link>http://blog.bioteam.net/2009/05/26/short-talk-at-amazon-aws-event-in-nyc-on-may-28th/</link>
		<comments>http://blog.bioteam.net/2009/05/26/short-talk-at-amazon-aws-event-in-nyc-on-may-28th/#comments</comments>
		<pubDate>Tue, 26 May 2009 14:32:27 +0000</pubDate>
		<dc:creator>chrisdag</dc:creator>
				<category><![CDATA[Employee Posts]]></category>
		<category><![CDATA[Events]]></category>
		<category><![CDATA[Presentations]]></category>
		<category><![CDATA[amazon cloud]]></category>
		<category><![CDATA[cloud computing]]></category>
		<category><![CDATA[ec2]]></category>

		<guid isPermaLink="false">http://blog.bioteam.net/?p=199</guid>
		<description><![CDATA[BioTeam consultant Chris Dagdigian will be giving a short (~10 minutes or so) presentation at the NYC stop for the 2009 Amazon AWS Start-up Tour. The tour details and dates are here: http://aws.amazon.com/startupproject/ The agenda for the May 28th event is: 1:00-2:00 Doors Open 2:00-2:10 Opening Statements 2:10-2:50 AWS Overview – Adam Selipsky, Vice President, [...]]]></description>
			<content:encoded><![CDATA[<p>BioTeam consultant Chris Dagdigian will be giving a short (~10 minutes or so) presentation at the NYC stop for the 2009 Amazon AWS Start-up Tour.</p>
<p>The tour details and dates are here:</p>
<p><a href="http://aws.amazon.com/startupproject/" mce_href="http://aws.amazon.com/startupproject/">http://aws.amazon.com/startupproject/</a><br mce_bogus="1"></p>
<p>The agenda for the May 28th event is:<br />
<code><br />
1:00-2:00    Doors Open<br />
2:00-2:10    Opening Statements<br />
2:10-2:50    AWS Overview – Adam Selipsky, Vice President, Amazon Web Services<br />
2:50-3:10    Break<br />
3:10-3:50    Customer Presentations:<br />
Sam Lessin, CEO, drop.io<br />
Dan Gill, VP Business Development, Gotuit<br />
Chris Dagdigian, Founding Partner, BioTeam<br />
Brian Adams, Co-Founder and CTO, Admeld<br />
3:50-4:10    Customer Q&amp;A<br />
4:10-4:50    Architecting Cloud Apps – Matt Tavis, Solutions Architect, Amazon Web Services<br />
4:50-5:00    Closing Statements<br />
5:00-7:00    Networking Reception + Solutions Corner Expo</code></p>
<p>FreedomOSS, RightScale, SOASTA, Pentaho and Kaavo will be the vendors in the Solutions corner.</p>
<img src="http://blog.bioteam.net/?ak_action=api_record_view&id=199&type=feed" alt="" />]]></content:encoded>
			<wfw:commentRss>http://blog.bioteam.net/2009/05/26/short-talk-at-amazon-aws-event-in-nyc-on-may-28th/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Sunny Skies for Compute Cloud</title>
		<link>http://blog.bioteam.net/2007/11/13/sunny-skies-for-compute-cloud/</link>
		<comments>http://blog.bioteam.net/2007/11/13/sunny-skies-for-compute-cloud/#comments</comments>
		<pubDate>Tue, 13 Nov 2007 16:34:39 +0000</pubDate>
		<dc:creator>blogadmin</dc:creator>
				<category><![CDATA[Publications]]></category>
		<category><![CDATA[amazon cloud]]></category>
		<category><![CDATA[cloud computing]]></category>
		<category><![CDATA[Inside-the-box]]></category>

		<guid isPermaLink="false">http://blog.bioteam.net/2007/11/13/sunny-skies-for-compute-cloud/</guid>
		<description><![CDATA[Buying CPUs by the hour is back. Marketing apparently decided that &#8216;On Demand&#8217; sounds pushy and &#8216;Utility&#8217; and &#8216;Grid&#8217; are too rigid. This time it will be called a &#8216;Cloud&#8217;, and it will be running inside Amazon.com.]]></description>
			<content:encoded><![CDATA[<p>Bio-IT World Magazine &ldquo;Inside the Box&rdquo; Column &#8211; November 2007</p>
<p>Summary:</p>
<blockquote><p>Buying CPUs by the hour is back. Marketing apparently decided that &lsquo;On Demand&rsquo; sounds pushy and &lsquo;Utility&rsquo; and &lsquo;Grid&rsquo; are too rigid. This time it will be called a &lsquo;Cloud&rsquo;, and it will be running inside Amazon.com.</p></blockquote>
<p>Article link:</p>
<blockquote><p><a href="http://www.bio-itworld.com/issues/2007/nov/inside-the-box-buying-cpus/">http://www.bio-itworld.com/issues/2007/nov/inside-the-box-buying-cpus/</a></p></blockquote>
<img src="http://blog.bioteam.net/?ak_action=api_record_view&id=32&type=feed" alt="" />]]></content:encoded>
			<wfw:commentRss>http://blog.bioteam.net/2007/11/13/sunny-skies-for-compute-cloud/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
