<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: A Morning With Danny Hillis</title>
	<atom:link href="http://battellemedia.com/archives/2004/06/a_morning_with_danny_hillis.php/feed" rel="self" type="application/rss+xml" />
	<link>http://battellemedia.com/archives/2004/06/a_morning_with_danny_hillis.php?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=a_morning_with_danny_hillis</link>
	<description>Thoughts on the intersection of search, media, technology, and more.</description>
	<lastBuildDate>Thu, 23 May 2013 12:55:00 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.4.1</generator>
	<item>
		<title>By: oopser</title>
		<link>http://battellemedia.com/archives/2004/06/a_morning_with_danny_hillis.php#comment-26063</link>
		<dc:creator>oopser</dc:creator>
		<pubDate>Tue, 03 Jan 2012 02:01:00 +0000</pubDate>
		<guid isPermaLink="false">http://battellemedia.com/archives/2004/06/a_morning_with_danny_hillis.php#comment-26063</guid>
		<description>Not Bram - it&#039;s Bran</description>
		<content:encoded><![CDATA[<p>Not Bram &#8211; it&#8217;s Bran</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Dave Douglas</title>
		<link>http://battellemedia.com/archives/2004/06/a_morning_with_danny_hillis.php#comment-23872</link>
		<dc:creator>Dave Douglas</dc:creator>
		<pubDate>Mon, 03 Jan 2005 03:32:57 +0000</pubDate>
		<guid isPermaLink="false">http://battellemedia.com/archives/2004/06/a_morning_with_danny_hillis.php#comment-23872</guid>
		<description>&lt;p&gt;There&#039;s another interesting thread along these lines.  Brewster Kahle was one of the designer/architects of the CM-2, and at the end of that project started spending most of his time on search capabilities.  The output of that work was WAIS (early net-heads may remember it), and led to his work at Alexa (sold to Amazon) and now the Internet Archive.  &lt;/p&gt;</description>
		<content:encoded><![CDATA[<p>There&#8217;s another interesting thread along these lines.  Brewster Kahle was one of the designer/architects of the CM-2, and at the end of that project started spending most of his time on search capabilities.  The output of that work was WAIS (early net-heads may remember it), and led to his work at Alexa (sold to Amazon) and now the Internet Archive.  </p>
]]></content:encoded>
	</item>
	<item>
		<title>By: John Battelle</title>
		<link>http://battellemedia.com/archives/2004/06/a_morning_with_danny_hillis.php#comment-23871</link>
		<dc:creator>John Battelle</dc:creator>
		<pubDate>Sun, 20 Jun 2004 03:10:25 +0000</pubDate>
		<guid isPermaLink="false">http://battellemedia.com/archives/2004/06/a_morning_with_danny_hillis.php#comment-23871</guid>
		<description>&lt;p&gt;Kendall - What I meant to infer was that the idea of massively parallel processing - ie having tens of thousands of CPUs working at once - is now defacto. &lt;/p&gt;</description>
		<content:encoded><![CDATA[<p>Kendall &#8211; What I meant to infer was that the idea of massively parallel processing &#8211; ie having tens of thousands of CPUs working at once &#8211; is now defacto. </p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Kendall Willets</title>
		<link>http://battellemedia.com/archives/2004/06/a_morning_with_danny_hillis.php#comment-23870</link>
		<dc:creator>Kendall Willets</dc:creator>
		<pubDate>Sat, 19 Jun 2004 21:59:15 +0000</pubDate>
		<guid isPermaLink="false">http://battellemedia.com/archives/2004/06/a_morning_with_danny_hillis.php#comment-23870</guid>
		<description>&lt;p&gt;Actually the results can be sorted from best to worst on each node and the top few merged into the top 10 list by the requester, to save a lot of data shipping.  &lt;/p&gt;

&lt;p&gt;That&#039;s assuming that no holistic inter-result calculations like HITS are being done.  Localrank is Google&#039;s version of that idea; I think they do the calculation with only highly-ranked pages to save overhead.&lt;/p&gt;

&lt;p&gt;(I&#039;ve been trying to infer this process recently, so bear with my detail-stickling.)&lt;/p&gt;</description>
		<content:encoded><![CDATA[<p>Actually the results can be sorted from best to worst on each node and the top few merged into the top 10 list by the requester, to save a lot of data shipping.  </p>
<p>That&#8217;s assuming that no holistic inter-result calculations like HITS are being done.  Localrank is Google&#8217;s version of that idea; I think they do the calculation with only highly-ranked pages to save overhead.</p>
<p>(I&#8217;ve been trying to infer this process recently, so bear with my detail-stickling.)</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Patrick Tufts</title>
		<link>http://battellemedia.com/archives/2004/06/a_morning_with_danny_hillis.php#comment-23869</link>
		<dc:creator>Patrick Tufts</dc:creator>
		<pubDate>Sat, 19 Jun 2004 20:10:39 +0000</pubDate>
		<guid isPermaLink="false">http://battellemedia.com/archives/2004/06/a_morning_with_danny_hillis.php#comment-23869</guid>
		<description>&lt;p&gt;Yes, the CM-2 did have a hypercube network. One could program the CM-2 using a data-parallel model, which is what the *Lisp (starlisp) programming language used. In this model, the CM-2 was treated as a SIMD (single instruction, multiple data) computer. Data was spread across the processors or, more accurately, the memory associated with each processor, and then a single instruction was run by all the processors on its local data.&lt;/p&gt;

&lt;p&gt;Now, not all computation can be done this way. FFTs require lots of comparisons between data, which is best done by processor to processor communication, so there were operators to do local data comparisons and moves, along with system wide computations like &quot;find the maximum value of this variable among all the processors.&quot;&lt;/p&gt;

&lt;p&gt;Search can be cast as a data parallel process with a scatter operation at the beginning (send the query to all processors) and a gather operation at the end (collect the results from all the processors and sort from best to worst).&lt;/p&gt;

&lt;p&gt;--Pat / zippy@cs.brandeis.edu&lt;/p&gt;</description>
		<content:encoded><![CDATA[<p>Yes, the CM-2 did have a hypercube network. One could program the CM-2 using a data-parallel model, which is what the *Lisp (starlisp) programming language used. In this model, the CM-2 was treated as a SIMD (single instruction, multiple data) computer. Data was spread across the processors or, more accurately, the memory associated with each processor, and then a single instruction was run by all the processors on its local data.</p>
<p>Now, not all computation can be done this way. FFTs require lots of comparisons between data, which is best done by processor to processor communication, so there were operators to do local data comparisons and moves, along with system wide computations like &#8220;find the maximum value of this variable among all the processors.&#8221;</p>
<p>Search can be cast as a data parallel process with a scatter operation at the beginning (send the query to all processors) and a gather operation at the end (collect the results from all the processors and sort from best to worst).</p>
<p>&#8211;Pat / <a href="mailto:zippy@cs.brandeis.edu">zippy@cs.brandeis.edu</a></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Kendall Willets</title>
		<link>http://battellemedia.com/archives/2004/06/a_morning_with_danny_hillis.php#comment-23868</link>
		<dc:creator>Kendall Willets</dc:creator>
		<pubDate>Sat, 19 Jun 2004 19:33:05 +0000</pubDate>
		<guid isPermaLink="false">http://battellemedia.com/archives/2004/06/a_morning_with_danny_hillis.php#comment-23868</guid>
		<description>&lt;p&gt;I was thinking of the original CM, which was very fine-grained parallelism, with a hypercube routing topology between nodes.  Much more systolic than the usual today.  The model at that time was Navier-Stokes, or finite-element analysis, with a fair amount of cross-traffic between pieces.  &lt;/p&gt;

&lt;p&gt;Google is more of a shared-nothing, massively parallel processor - hack the data up into big chunks, put them on separate nodes, and ship the query to every node.  Ethernet is the bus that holds it all together, and the limiting factor - bandwidth per MIP is fairly low, etc.&lt;/p&gt;

&lt;p&gt;IMHO this is about the bottom of the scale for parallelism - more distributed processing.&lt;/p&gt;</description>
		<content:encoded><![CDATA[<p>I was thinking of the original CM, which was very fine-grained parallelism, with a hypercube routing topology between nodes.  Much more systolic than the usual today.  The model at that time was Navier-Stokes, or finite-element analysis, with a fair amount of cross-traffic between pieces.  </p>
<p>Google is more of a shared-nothing, massively parallel processor &#8211; hack the data up into big chunks, put them on separate nodes, and ship the query to every node.  Ethernet is the bus that holds it all together, and the limiting factor &#8211; bandwidth per MIP is fairly low, etc.</p>
<p>IMHO this is about the bottom of the scale for parallelism &#8211; more distributed processing.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Patrick Tufts</title>
		<link>http://battellemedia.com/archives/2004/06/a_morning_with_danny_hillis.php#comment-23867</link>
		<dc:creator>Patrick Tufts</dc:creator>
		<pubDate>Sat, 19 Jun 2004 17:07:02 +0000</pubDate>
		<guid isPermaLink="false">http://battellemedia.com/archives/2004/06/a_morning_with_danny_hillis.php#comment-23867</guid>
		<description>&lt;p&gt;Google runs what I suspect is a data-parallel architecture - thousands of nodes crawling and searching largely independently of one another. Thinking Machines used a data-parallel architecture in its CM-2 design.&lt;/p&gt;

&lt;p&gt;Thinking Machines also made one of the first distributed search engines, WAIS. The index was distributed across multiple processors.&lt;/p&gt;

&lt;p&gt;Full disclosure: I&#039;m an Applied Minds employee and was affiliated with Thinking Machines in the mid-90s.&lt;/p&gt;

&lt;p&gt;--Pat / zippy@cs.brandeis.edu&lt;/p&gt;</description>
		<content:encoded><![CDATA[<p>Google runs what I suspect is a data-parallel architecture &#8211; thousands of nodes crawling and searching largely independently of one another. Thinking Machines used a data-parallel architecture in its CM-2 design.</p>
<p>Thinking Machines also made one of the first distributed search engines, WAIS. The index was distributed across multiple processors.</p>
<p>Full disclosure: I&#8217;m an Applied Minds employee and was affiliated with Thinking Machines in the mid-90s.</p>
<p>&#8211;Pat / <a href="mailto:zippy@cs.brandeis.edu">zippy@cs.brandeis.edu</a></p>
]]></content:encoded>
	</item>
	<item>
		<title>By: phil jones</title>
		<link>http://battellemedia.com/archives/2004/06/a_morning_with_danny_hillis.php#comment-23866</link>
		<dc:creator>phil jones</dc:creator>
		<pubDate>Sat, 19 Jun 2004 15:40:12 +0000</pubDate>
		<guid isPermaLink="false">http://battellemedia.com/archives/2004/06/a_morning_with_danny_hillis.php#comment-23866</guid>
		<description>&lt;p&gt;Hmmm. I wonder how this compares to, say,  Interval Research? (http://www.wired.com/wired/archive/7.12/interval.html)&lt;/p&gt;</description>
		<content:encoded><![CDATA[<p>Hmmm. I wonder how this compares to, say,  Interval Research? (<a href="http://www.wired.com/wired/archive/7.12/interval.html" rel="nofollow">http://www.wired.com/wired/archive/7.12/interval.html</a>)</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Kendall Willets</title>
		<link>http://battellemedia.com/archives/2004/06/a_morning_with_danny_hillis.php#comment-23865</link>
		<dc:creator>Kendall Willets</dc:creator>
		<pubDate>Fri, 18 Jun 2004 23:51:23 +0000</pubDate>
		<guid isPermaLink="false">http://battellemedia.com/archives/2004/06/a_morning_with_danny_hillis.php#comment-23865</guid>
		<description>&lt;p&gt; I actually bought and read Hillis&#039;s book on the Connection Machine, but I can&#039;t really see any resemblance to Google.  Can you elaborate?&lt;br /&gt;
&lt;/p&gt;</description>
		<content:encoded><![CDATA[<p> I actually bought and read Hillis&#8217;s book on the Connection Machine, but I can&#8217;t really see any resemblance to Google.  Can you elaborate?</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Mike</title>
		<link>http://battellemedia.com/archives/2004/06/a_morning_with_danny_hillis.php#comment-23864</link>
		<dc:creator>Mike</dc:creator>
		<pubDate>Fri, 18 Jun 2004 21:29:19 +0000</pubDate>
		<guid isPermaLink="false">http://battellemedia.com/archives/2004/06/a_morning_with_danny_hillis.php#comment-23864</guid>
		<description>&lt;p&gt;That&#039;s fascinating. Does anyone remember the title sequence of the old TV show, &quot;Get Smart&quot;, in which Don Adams enters the secret headquarters of CONTROL through the phone booth at the end of a corridor? &lt;/p&gt;</description>
		<content:encoded><![CDATA[<p>That&#8217;s fascinating. Does anyone remember the title sequence of the old TV show, &#8220;Get Smart&#8221;, in which Don Adams enters the secret headquarters of CONTROL through the phone booth at the end of a corridor? </p>
]]></content:encoded>
	</item>
</channel>
</rss>
