Archive for the "viewpoints" Category

16
Mar

Laura at the Llyfrgellydd blog rants about library technology. And, she takes no hostages. “Multiple systems, or at the very least, the appearance of multiple systems, are enemies to usability” is Laura’s first lob. Oh goodie, she’s going to have something nice to say about federated search, right, since hiding those multiple systems are friends to usability? Not a chance.

I think one of the reasons federated search doesn’t work is because the metadata is coming from so many different sources that it just can’t be translated consistently. It seems a huge waste of time that vendors have people working on connectors to read that metadata and parse it.

I didn’t realize that federated search doesn’t work. I assume Laura is writing about the metadata like title, author, and snippet that forms the search results pages. Yes, consistency isn’t perfect but it’s not nearly as bad as Laura pronounces.

Kidding aside, go read Laura’s article. You’ll get the perspective of a “past life” (her words) reference librarian who has many bones to pick with library technology. It’s good market research material.

If you enjoyed this post, make sure you subscribe to the RSS feed!

11
Mar

Kosmix is the search engine that produces “topic pages” on millions of subjects. Kosmix creates these topic pages by searching APIs of deep web sources (in real time). In other words, Kosmix relies heavily on federated search for the content of their topic pages. (Actually, Kosmix combines federated search with crawling technology. More about this later in this article.)

Kosmix co-founder, Anand Rajaraman, recently spoke at PARC, the prestigious Palo Alto Research Center. Rajaraman’s talk: “What lies beneath: harnessing the deep web.” A video of the hour-long talk is available at the PARC web-site. The slides are available at the Kosmix Blog.

Rajaraman has very impressive credentials. He is also co-founder of the VC firm Cambrian Ventures, he teaches a class for Stanford’s Computer Science department, and he is former Director of Technology at Amazon.com where:

he was responsible for technology strategy. Anand helped launch the transformation of Amazon.com from a retailer into a retail platform, enabling third-party retailers to sell on Amazon.com’s website. Third-party transactions now account for over 25% of all US transactions, and represent Amazon’s fastest-growing and most profitable business segment.

Rajamaran has his own Kosmix topic page.

Read the rest of this entry »

If you enjoyed this post, make sure you subscribe to the RSS feed!

2
Mar

I was absolutely delighted to read a recent article by Barbara Quint, editor-in-chief of Information Today’s Searcher magazine. Federated Searching: Good Ideas Never Die, They Just Change Their Names reminds us that federated search existed before the term became popular:

Even back in the days when only professional searchers accessed online databases, searchers wanted some way to find answers in multiple files without having to slog through each database one at a time. In those days, the solution was called multi-file or cross-file searching, e.g. Dialog OneSearch or files linked via Z39.50 (ANSI/NISO standard for data exchange).

A little sidebar: I heard about this article from my brother Abe (founder, President, and CTO of blog sponsor Deep Web Technologies) before it came onto my Google Alerts radar. Abe was at the NFAIS Conference where he had gone to deliver a presentation on multilingual federated search. At the conference, Abe had a conversation with Iris Hanney, President of Unlimited Priorities, a support services company for businesses. It turns out that Barbara Quint is a member of the Unlimited Priorities team and produced this article for one of their publications, DCLNews. And, that’s how Abe heard about the article, in which he’s mentioned. Small world!

Read the rest of this entry »

If you enjoyed this post, make sure you subscribe to the RSS feed!

26
Feb

Check out this article at the Concordia University Library News Blog: Goodbye, BearHunt. BearHunt is (soon to be ‘was’) their federated search system. They didn’t like it and they’re not shy about telling you how much they didn’t like it. I’ve never seen so many negative comments about a federated search solution. I counted ten complaints. I’ve broken them out in my reproduction of Concordia’s article here:

Goodbye, BearHunt

QuickSearch via BearHunt will no longer be available beginning March 1st. There are a number of reasons that the library chose to discontinue this federated search service. Most important, as everyone is tightening their belts,

BearHunt came at a substantial cost while offering no new content.

Also important,

usage was low compared to cost.

When it first became available, we were excited to try a federated search, but along with other CLIC libraries and the rest of the library world,

we’ve realized that the technology needed to support the idea isn’t quite there yet.

The relevancy rankings don’t seem to work correctly or consistently,

links were frequently broken,

searches executed slowly, and

the interface left a lot to be desired.

We also felt

there was a possibility that it was misleading searchers to think they were searching ‘everything’, while in fact there are a number of both purchased and freely available resources that aren’t included in a BearHunt search.

Also,

the search is not complete in that it stops at a predetermined time interval instead of continuing until all searches have had a chance to complete.

In the end,

BearHunt and federated search as it now exists seems at odds with our mission to teach academic research skills tailored to specific situations and inquiries, using the full breadth of our resources.

We will look forward to new federated search products with anticipation, especially if they meet our criteria of completeness and have added-value in their presentation.

Read the rest of this entry »

If you enjoyed this post, make sure you subscribe to the RSS feed!

9
Feb

If you’ve got a half hour to spare, maybe in your car via iTunes, then you might enjoy this blogtalkradio interview at Friday Traffic Report: Exploring the Deep Web.

Friday Traffic Report host Jack Humphrey interviewed Bill Wardell about the deep Web. Wardell’s site, The CyberHood Watch Blog, aims to keep families and especially children safe on the Web.

While I know quite a bit about the deep Web, I enjoyed the conversational style within which a basic introduction was provided. I recommend this interview to those of you new to the concept of the deep Web and to new LIS students.

If you enjoyed this post, make sure you subscribe to the RSS feed!

5
Feb

[ Editor's note: Charles Knight received honorable mention in the second annual Federated Search Blog contest. In recognition of this honor, the Blog is publishing his essay and contest sponsor Deep Web Technologies is awarding Charles a $100 prize.

Charles Knight is a blogger and leading authority on alternative search engines. Recently the editor of AltSearchEngines, he now blogs about "All things Search" for TheNextWeb at http://thenextweb.com/search. He lives in Charlottesville, VA. ]

How Federated Search can Make You RICH! by Charles Knight

Unlike all of the other entries to this contest, this is the ONLY one that will MAKE YOU RICH.

Yes, you heard me right, this is a COMMERCIAL application of Federated Search technology.

Feel free to PATENT, BUILD, and SELL this product and then WATCH THE MONEY ROLL IN!

The FIRST STEP is to print out and tape in front of you this image of a beautiful Gemstone Globe.

Read the rest of this entry »

If you enjoyed this post, make sure you subscribe to the RSS feed!

26
Jan

Blog sponsor Deep Web Technologies built a federated search tool for Stanford University. I was involved with the first prototype and I’m proud of what the Stanford/Deep Web Technologies partnership has accomplished. Stanford’s Grace Baysinger - head librarian and bibliographer for the Swain Library of Chemistry and Chemical Engineering - provides a nice overview of the search tool in the “Speaking of Computers” newsletter. And, Deep Web Technologies’ account manager, Debbie Taylor, shares a little bit of her involvement with the project as well on the Deep Web Technologies Blog.

My involvement with the Stanford federated search tool was multi-faceted. I worked with Stanford to understand their requirements. I also worked with Deep Web Technologies’ connector team to build the source interfaces Stanford wanted. I even used my limited web design skills to create the HTML that was used in the very first prototype. There were many conversations and many details that were worked out during a number of phone conversations. When I left my full time position of engineer and manager at Deep Web Technologies I lost touch with the folks at Stanford who led the way in introducing federated search to their university. I hear that they’re very happy with their search tool. I’m proud to have had a role in the building of that tool.

If you enjoyed this post, make sure you subscribe to the RSS feed!

22
Jan

I recently purchased a copy of How UK academic libraries choose metasearch systems from Emerald Insight. I was reading the article on my way to writing a review of it when I was struck by this statement:

Advice comes from successful, as well as unsuccessful implementations. One of the most common pieces of advice is that libraries should remember to treat metasearch as a service, not a product, and to ensure that there is adequate post-installation vendor support.

What struck me was that the statement, which seemed so obvious to me, was deemed important enough by the authors to state explicitly. Of course, federated search is a service. Whether or not you host federated search on your own servers with your own staff managing the server hardware, software, and application there’s plenty of service that someone will have to perform, and it’s on an ongoing basis. Off the top of my head, here are some service aspects to federated search:

  • Building connectors - the more work that goes into building connectors the better the search experience
  • Monitoring/disabling/re-enabling connectors when sources become unavailable and when they come back
  • Updating or rewriting connectors when sources change their search interface
  • Updating the application when the vendor provides new maintenance or feature releases
  • Customizing the application to suit your needs

None of these tasks are easy. Each requires vendor support or time from your staff.

Read the rest of this entry »

If you enjoyed this post, make sure you subscribe to the RSS feed!

19
Jan

Abe pointed me to an article, “An info island or the kitchen sink: when you think about federated search engines, think about what you’ll include in the searches.” While the article is a bit dated (2007) it has this air of timelessness to it.

The author, Richard Todd, considers three paths to source selection:

In the course of implementing a federated search engine at my organization, I eventually recognized three fundamental approaches for determining which of the available sources to include in our system. I call these three guiding concepts the kitchen sink approach, the Easter Island approach, and the gatekeeper approach.

Without reading further can you guess what these three approaches are? Abe told me that he was contacted by someone who threw out these terms as if Abe should be familiar with them. He wasn’t. Neither was I.

Read the rest of this entry »

If you enjoyed this post, make sure you subscribe to the RSS feed!

16
Dec

The commercial investing and personal finance web-site, The Motley Fool, published a brief article about the deep Web. No, the article does not dive very deep in its assessment of the deep Web. It’s a very cursory introduction. But, it’s great to see such coverage at all in such a well respected investment site.

The article tells two ways that investors can make money from the deep Web: buy stock in companies that mine deep Web data, and use the deep Web in your investment research.

I strongly agree with this paragraph:

If you have a keen interest in [the deep Web], it is also possible to invest in technology that is going to indirectly help open up this ‘invisible web’. Let me cite one area that I think holds promise. I reckon analytics programs that can successfully measure the response of ad campaigns to the deep web community or successfully trace browsing habits in these areas will make their owners/investors a lot of money.

Yes, I believe sentiment mining is in its infancy and will only grow in its importance to marketers. If you have a few minutes you might enjoy the article.

If you enjoyed this post, make sure you subscribe to the RSS feed!