<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>John Bullard &#187; Hadoop</title>
	<atom:link href="http://johnbullard.net/tag/hadoop/feed/" rel="self" type="application/rss+xml" />
	<link>http://johnbullard.net</link>
	<description>Loose Thinking, Tight Analysis</description>
	<lastBuildDate>Tue, 12 Jan 2010 14:32:46 +0000</lastBuildDate>
	<generator>http://wordpress.org/?v=2.8.6</generator>
	<language>en</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>AWS and Hadoop</title>
		<link>http://johnbullard.net/2009/03/23/aws-and-hadoop/</link>
		<comments>http://johnbullard.net/2009/03/23/aws-and-hadoop/#comments</comments>
		<pubDate>Mon, 23 Mar 2009 18:40:47 +0000</pubDate>
		<dc:creator>John</dc:creator>
				<category><![CDATA[Cloud]]></category>
		<category><![CDATA[Amazon]]></category>
		<category><![CDATA[Hadoop]]></category>

		<guid isPermaLink="false">http://johnbullard.net/?p=286</guid>
		<description><![CDATA[The New York Times used 100 Amazon EC2 instances and a Hadoop application to process 4TB of raw image TIFF data (stored in S3) into 1.1 million finished PDFs in the space of 24 hours at a computation cost of about $240 (not including bandwidth).
NY Times: Self-service, Prorated Super Computing Fun!

The project was so easy, [...]]]></description>
			<content:encoded><![CDATA[<blockquote><p><a title="The New York Times" href="http://en.wikipedia.org/wiki/The_New_York_Times">The New York Times</a> used 100 Amazon EC2 instances and a Hadoop application to process 4TB of raw image <a class="mw-redirect" title="TIFF" href="http://en.wikipedia.org/wiki/TIFF">TIFF</a> data (stored in S3) into 1.1 million finished <a class="mw-redirect" title="PDF" href="http://en.wikipedia.org/wiki/PDF">PDFs</a> in the space of 24 hours at a computation cost of about $240 (not including bandwidth).</p>
<p style="text-align: right;"><a href="http://open.blogs.nytimes.com/2007/11/01/self-service-prorated-super-computing-fun/?scp=1&amp;sq=self%20service%20prorated&amp;st=cse">NY Times: Self-service, Prorated Super Computing Fun!</a></p>
</blockquote>
<p style="text-align: left;">The project was so easy, and so cheap, that the developers ran the process a second time after noticing  a minute error. Just another example of how cloud computing is changing the game.</p>
<p class="fbconnect_share"><fb:share-button class="url" href="http://johnbullard.net/2009/03/23/aws-and-hadoop/" /></p>]]></content:encoded>
			<wfw:commentRss>http://johnbullard.net/2009/03/23/aws-and-hadoop/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>

