<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
		>
<channel>
	<title>Comments on: Enterprise Search (Yaawwwwnnn)</title>
	<atom:link href="http://battellemedia.com/archives/2004/04/enterprise_search_yaawwwwnnn.php/feed" rel="self" type="application/rss+xml" />
	<link>http://battellemedia.com/archives/2004/04/enterprise_search_yaawwwwnnn.php?utm_source=rss&#038;utm_medium=rss&#038;utm_campaign=enterprise_search_yaawwwwnnn</link>
	<description>Thoughts on the intersection of search, media, technology, and more.</description>
	<lastBuildDate>Thu, 23 May 2013 12:55:00 +0000</lastBuildDate>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
	<generator>http://wordpress.org/?v=3.4.1</generator>
	<item>
		<title>By: Andrew</title>
		<link>http://battellemedia.com/archives/2004/04/enterprise_search_yaawwwwnnn.php#comment-24277</link>
		<dc:creator>Andrew</dc:creator>
		<pubDate>Tue, 04 May 2004 07:15:03 +0000</pubDate>
		<guid isPermaLink="false">http://battellemedia.com/archives/2004/04/enterprise_search_yaawwwwnnn.php#comment-24277</guid>
		<description>&lt;p&gt;Whoa, Macweek! There&#039;s a flash from the past. It was one of my favorite magazines as a high school kid, along with Macintosh Today (which became defunct earlier than macweek).&lt;/p&gt;

&lt;p&gt;I remember lying on those qualification forms to qualify for the subscriptions.&lt;/p&gt;

&lt;p&gt;I&#039;m still waiting for the holographic storage promised by Macweek :)&lt;/p&gt;</description>
		<content:encoded><![CDATA[<p>Whoa, Macweek! There&#8217;s a flash from the past. It was one of my favorite magazines as a high school kid, along with Macintosh Today (which became defunct earlier than macweek).</p>
<p>I remember lying on those qualification forms to qualify for the subscriptions.</p>
<p>I&#8217;m still waiting for the holographic storage promised by Macweek <img src='http://battellemedia.com/wp-includes/images/smilies/icon_smile.gif' alt=':)' class='wp-smiley' /> </p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Avi Rappoport</title>
		<link>http://battellemedia.com/archives/2004/04/enterprise_search_yaawwwwnnn.php#comment-24276</link>
		<dc:creator>Avi Rappoport</dc:creator>
		<pubDate>Wed, 28 Apr 2004 18:37:25 +0000</pubDate>
		<guid isPermaLink="false">http://battellemedia.com/archives/2004/04/enterprise_search_yaawwwwnnn.php#comment-24276</guid>
		<description>&lt;p&gt;Enterprise search, both on public sites and intranets, is a fascinating problem.    &lt;/p&gt;

&lt;p&gt;On public sites, the new big thing is faceted metadata search and browse, exposing database or metadata attributes in a usable way.  Prof. Marti Hearst of UC Berkeley has done the research to show how it is successful in both Information Retrieval and User Experience -- &lt;a href=&quot;http://flamenco.berkeley.edu.&quot; rel=&quot;nofollow&quot;&gt;http://flamenco.berkeley.edu.&lt;/a&gt;&lt;/p&gt;

&lt;p&gt;On intranets, there is so little of the traditional research and so many simple navigational questions  --, timesheets, software downloads, holiday schedules, etc.  Installing a giant KM system is overkill -- save that for specific areas of research.  On the other hand, because there&#039;s so much more known about the internal, you want to have control over the indexing and relevance ranking, it shouldn&#039;t just be a black box.  Link and authority ranking doesn&#039;t work as well as the WWW, because there is so little meaningful hypertext on intranets -- most links are just navigation, not recommendations. &lt;/p&gt;

&lt;p&gt;You (and Danny Sullivan) say &quot;yawn&quot;, but I&#039;m a librarian and I think enterprise search is incredibly interesting.&lt;/p&gt;</description>
		<content:encoded><![CDATA[<p>Enterprise search, both on public sites and intranets, is a fascinating problem.    </p>
<p>On public sites, the new big thing is faceted metadata search and browse, exposing database or metadata attributes in a usable way.  Prof. Marti Hearst of UC Berkeley has done the research to show how it is successful in both Information Retrieval and User Experience &#8212; <a href="http://flamenco.berkeley.edu." rel="nofollow"></a><a href="http://flamenco.berkeley.edu" rel="nofollow">http://flamenco.berkeley.edu</a>.</p>
<p>On intranets, there is so little of the traditional research and so many simple navigational questions  &#8211;, timesheets, software downloads, holiday schedules, etc.  Installing a giant KM system is overkill &#8212; save that for specific areas of research.  On the other hand, because there&#8217;s so much more known about the internal, you want to have control over the indexing and relevance ranking, it shouldn&#8217;t just be a black box.  Link and authority ranking doesn&#8217;t work as well as the WWW, because there is so little meaningful hypertext on intranets &#8212; most links are just navigation, not recommendations. </p>
<p>You (and Danny Sullivan) say &#8220;yawn&#8221;, but I&#8217;m a librarian and I think enterprise search is incredibly interesting.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: BillG</title>
		<link>http://battellemedia.com/archives/2004/04/enterprise_search_yaawwwwnnn.php#comment-24275</link>
		<dc:creator>BillG</dc:creator>
		<pubDate>Wed, 28 Apr 2004 17:57:42 +0000</pubDate>
		<guid isPermaLink="false">http://battellemedia.com/archives/2004/04/enterprise_search_yaawwwwnnn.php#comment-24275</guid>
		<description>&lt;p&gt;Kendall</description>
		<content:encoded><![CDATA[<p>Kendall</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Kendall Willets</title>
		<link>http://battellemedia.com/archives/2004/04/enterprise_search_yaawwwwnnn.php#comment-24274</link>
		<dc:creator>Kendall Willets</dc:creator>
		<pubDate>Tue, 27 Apr 2004 20:52:33 +0000</pubDate>
		<guid isPermaLink="false">http://battellemedia.com/archives/2004/04/enterprise_search_yaawwwwnnn.php#comment-24274</guid>
		<description>&lt;p&gt;There is a signicant high-end market for enterprise &quot;search&quot;, if you call it document management, classification, textual analysis, database, or some other technical term.  For instance, Autonomy, Texis, etc. &lt;/p&gt;

&lt;p&gt;I suppose this validates your points:  higher-quality content and metadata beget higher-level tools.  &lt;/p&gt;</description>
		<content:encoded><![CDATA[<p>There is a signicant high-end market for enterprise &#8220;search&#8221;, if you call it document management, classification, textual analysis, database, or some other technical term.  For instance, Autonomy, Texis, etc. </p>
<p>I suppose this validates your points:  higher-quality content and metadata beget higher-level tools.  </p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Nick Patience</title>
		<link>http://battellemedia.com/archives/2004/04/enterprise_search_yaawwwwnnn.php#comment-24273</link>
		<dc:creator>Nick Patience</dc:creator>
		<pubDate>Tue, 27 Apr 2004 19:58:28 +0000</pubDate>
		<guid isPermaLink="false">http://battellemedia.com/archives/2004/04/enterprise_search_yaawwwwnnn.php#comment-24273</guid>
		<description>&lt;p&gt;Ah yes, I</description>
		<content:encoded><![CDATA[<p>Ah yes, I</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Ezra Ball</title>
		<link>http://battellemedia.com/archives/2004/04/enterprise_search_yaawwwwnnn.php#comment-24272</link>
		<dc:creator>Ezra Ball</dc:creator>
		<pubDate>Tue, 27 Apr 2004 19:23:49 +0000</pubDate>
		<guid isPermaLink="false">http://battellemedia.com/archives/2004/04/enterprise_search_yaawwwwnnn.php#comment-24272</guid>
		<description>&lt;p&gt;Kendall: I agree wholeheartedly that enterprise search is vastly more interesting than whole-web searching. &lt;/p&gt;&lt;p&gt;The greatest common factor of web content is the hyperlink (which no-one before Google really realized, or took advantage of). And that&#039;s why your other point, that page rank is not optimal for enterprise search, is right on. No matter how heterogenous your document set, no matter how sloppy your content creation workflow or haphazard your categorization, chances are, when your content is all created internally, you&#039;re going to be able to extract more meaningful metadata from your content than what the Google appliance can derive from its hyperlinks (which, in an enterprise context, can&#039;t reasonably be considered a &quot;vote&quot;).   &lt;/p&gt;&lt;p&gt;I recently worked on a project evaluating enterprise search options for my company. The Google appliance made the list only for its CYA factor: we could easily convince internal critics that we&#039;d bought a good search engine, because it was Google. &lt;/p&gt;</description>
		<content:encoded><![CDATA[<p>Kendall: I agree wholeheartedly that enterprise search is vastly more interesting than whole-web searching. </p>
<p>The greatest common factor of web content is the hyperlink (which no-one before Google really realized, or took advantage of). And that&#8217;s why your other point, that page rank is not optimal for enterprise search, is right on. No matter how heterogenous your document set, no matter how sloppy your content creation workflow or haphazard your categorization, chances are, when your content is all created internally, you&#8217;re going to be able to extract more meaningful metadata from your content than what the Google appliance can derive from its hyperlinks (which, in an enterprise context, can&#8217;t reasonably be considered a &#8220;vote&#8221;).   </p>
<p>I recently worked on a project evaluating enterprise search options for my company. The Google appliance made the list only for its CYA factor: we could easily convince internal critics that we&#8217;d bought a good search engine, because it was Google. </p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Kendall Willets</title>
		<link>http://battellemedia.com/archives/2004/04/enterprise_search_yaawwwwnnn.php#comment-24271</link>
		<dc:creator>Kendall Willets</dc:creator>
		<pubDate>Tue, 27 Apr 2004 18:45:10 +0000</pubDate>
		<guid isPermaLink="false">http://battellemedia.com/archives/2004/04/enterprise_search_yaawwwwnnn.php#comment-24271</guid>
		<description>&lt;p&gt;Enterprise search is actually interesting because it&#039;s not the Web, and there&#039;s more room to do creative things in an environment where http is not the only protocol.  For instance, most users could benefit from a combined file/search server, and I think we&#039;ll see more search-enabled SMB devices in the future.  There&#039;s also a lot of rich data competing with the unstructured morass.  Things like P2P search are also much more likely to work in this environment.&lt;/p&gt;

&lt;p&gt;Google doesn&#039;t seem to have a particularly good offering in this area.  It looks more like a one-size-fits-all branding approach than a deeply thought out solution.&lt;/p&gt;

&lt;p&gt;Some of the gaps in their feature set (based on reading their sales literature; I may be wrong about some points):&lt;/p&gt;

&lt;p&gt;Crawl:  Content must be on http servers.  Back when I had a job, most content was on SMB shares, and I would think a crawler for those could be devised.  Likewise, a shared dropbox directory  for  submissions seems more sensible for users than httpd.&lt;/p&gt;

&lt;p&gt;Index:  A static index based on periodic crawls.  You can&#039;t edit your document, push &quot;save&quot;, and expect to see the changes in the index.  The index parses only a fixed set of file types. &lt;/p&gt;

&lt;p&gt;Search:  Based on, guess what, pagerank.  There is some doubt as to whether this is the best ranking algorithm for intranets.  No integration with other data sources, eg an ODBC driver.&lt;/p&gt;

&lt;p&gt;To address these issues, the ultimate solution is going to look more like a network filesystem with a dynamic index than this device.  The door is, for better or worse, open for Microsoft to move into this area.&lt;/p&gt;</description>
		<content:encoded><![CDATA[<p>Enterprise search is actually interesting because it&#8217;s not the Web, and there&#8217;s more room to do creative things in an environment where http is not the only protocol.  For instance, most users could benefit from a combined file/search server, and I think we&#8217;ll see more search-enabled SMB devices in the future.  There&#8217;s also a lot of rich data competing with the unstructured morass.  Things like P2P search are also much more likely to work in this environment.</p>
<p>Google doesn&#8217;t seem to have a particularly good offering in this area.  It looks more like a one-size-fits-all branding approach than a deeply thought out solution.</p>
<p>Some of the gaps in their feature set (based on reading their sales literature; I may be wrong about some points):</p>
<p>Crawl:  Content must be on http servers.  Back when I had a job, most content was on SMB shares, and I would think a crawler for those could be devised.  Likewise, a shared dropbox directory  for  submissions seems more sensible for users than httpd.</p>
<p>Index:  A static index based on periodic crawls.  You can&#8217;t edit your document, push &#8220;save&#8221;, and expect to see the changes in the index.  The index parses only a fixed set of file types. </p>
<p>Search:  Based on, guess what, pagerank.  There is some doubt as to whether this is the best ranking algorithm for intranets.  No integration with other data sources, eg an ODBC driver.</p>
<p>To address these issues, the ultimate solution is going to look more like a network filesystem with a dynamic index than this device.  The door is, for better or worse, open for Microsoft to move into this area.</p>
]]></content:encoded>
	</item>
</channel>
</rss>
