<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	>
<channel>
	<title>Comments on: Science source selection</title>
	<atom:link href="http://federatedsearchblog.com/2009/07/01/science-source-selection/feed/" rel="self" type="application/rss+xml" />
	<link>http://federatedsearchblog.com/2009/07/01/science-source-selection/</link>
	<description>Covers topics related to federated search and the deep web</description>
	<pubDate>Fri, 12 Mar 2010 02:03:26 +0000</pubDate>
	<generator>http://wordpress.org/?v=2.7.1</generator>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
		<item>
		<title>By: Peter Noerr</title>
		<link>http://federatedsearchblog.com/2009/07/01/science-source-selection/comment-page-1/#comment-28652</link>
		<dc:creator>Peter Noerr</dc:creator>
		<pubDate>Fri, 03 Jul 2009 21:09:53 +0000</pubDate>
		<guid isPermaLink="false">http://federatedsearchblog.com/?p=739#comment-28652</guid>
		<description>There is generally another aspect to the "accessibility" of sources other than the technical, which is what Jonathan talks about. And that is commercial.

The commercial conditions for getting a set of records into an aggregated search environment seem to often be more onerous than allowing a broadcast search of the native source. Part of the problem lies in perception of the aggregated environment as being a competitor to the source of the records. This may be a true perception, a false one, or a wishful one.</description>
		<content:encoded><![CDATA[<p>There is generally another aspect to the &#8220;accessibility&#8221; of sources other than the technical, which is what Jonathan talks about. And that is commercial.</p>
<p>The commercial conditions for getting a set of records into an aggregated search environment seem to often be more onerous than allowing a broadcast search of the native source. Part of the problem lies in perception of the aggregated environment as being a competitor to the source of the records. This may be a true perception, a false one, or a wishful one.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Sol</title>
		<link>http://federatedsearchblog.com/2009/07/01/science-source-selection/comment-page-1/#comment-28604</link>
		<dc:creator>Sol</dc:creator>
		<pubDate>Thu, 02 Jul 2009 14:53:52 +0000</pubDate>
		<guid isPermaLink="false">http://federatedsearchblog.com/?p=739#comment-28604</guid>
		<description>Jonathan,

I think the answer is a hybrid service. Index what you can (discovery service), federate what you can, harvest what you can, etc. A good federated search framework can integrate access to content that has been obtained by a number of different technologies.

In other words, lead with the question of what sources are important not with the question of what technology is the most convenient. Then find ways to get access to each of them. I've never said that federated search makes discovery services or crawling or harvesting obsolete.</description>
		<content:encoded><![CDATA[<p>Jonathan,</p>
<p>I think the answer is a hybrid service. Index what you can (discovery service), federate what you can, harvest what you can, etc. A good federated search framework can integrate access to content that has been obtained by a number of different technologies.</p>
<p>In other words, lead with the question of what sources are important not with the question of what technology is the most convenient. Then find ways to get access to each of them. I&#8217;ve never said that federated search makes discovery services or crawling or harvesting obsolete.</p>
]]></content:encoded>
	</item>
	<item>
		<title>By: Jonathan Rochkind</title>
		<link>http://federatedsearchblog.com/2009/07/01/science-source-selection/comment-page-1/#comment-28602</link>
		<dc:creator>Jonathan Rochkind</dc:creator>
		<pubDate>Thu, 02 Jul 2009 13:51:30 +0000</pubDate>
		<guid isPermaLink="false">http://federatedsearchblog.com/?p=739#comment-28602</guid>
		<description>Sol, I've said this before, but I don't understand where you're coming from on the "it's fine to only search content you provide access to" point, because as far as I can tell, _federated search is exactly the same thing_.  Even with broadcast search, you can only search content that is accessible to broadcast search by your tool, which is not ALL content!

With either broadcast search or an aggregated index, you only have access to what you have access to. And in either case, providing access to any arbitrary resource may or may not be possible, and will take a non-zero amount of 'development' time to add.  With either case, the user using the tool only has access to searching content included in the tool, unless they leave the tool, which they can do in either case. 

It seems to me you are assuming that broadcast search will have a lower barrier to including additional resources, and end up having more coverage?  This to me is something that needs to be seen in practice, not assumed theoretically.  The key thing will be comparing two actual tools, and seeing which tool includes more resources, and more relevant and quality resources, for some particular context (audience and use cases). 

If SerSol's aggregated index comes up short, then that will be to it's detriment.  But I don't see how that can be assumed without an actual evaluation of two actual deployed tools side by side.</description>
		<content:encoded><![CDATA[<p>Sol, I&#8217;ve said this before, but I don&#8217;t understand where you&#8217;re coming from on the &#8220;it&#8217;s fine to only search content you provide access to&#8221; point, because as far as I can tell, _federated search is exactly the same thing_.  Even with broadcast search, you can only search content that is accessible to broadcast search by your tool, which is not ALL content!</p>
<p>With either broadcast search or an aggregated index, you only have access to what you have access to. And in either case, providing access to any arbitrary resource may or may not be possible, and will take a non-zero amount of &#8216;development&#8217; time to add.  With either case, the user using the tool only has access to searching content included in the tool, unless they leave the tool, which they can do in either case. </p>
<p>It seems to me you are assuming that broadcast search will have a lower barrier to including additional resources, and end up having more coverage?  This to me is something that needs to be seen in practice, not assumed theoretically.  The key thing will be comparing two actual tools, and seeing which tool includes more resources, and more relevant and quality resources, for some particular context (audience and use cases). </p>
<p>If SerSol&#8217;s aggregated index comes up short, then that will be to it&#8217;s detriment.  But I don&#8217;t see how that can be assumed without an actual evaluation of two actual deployed tools side by side.</p>
]]></content:encoded>
	</item>
</channel>
</rss>
