Why write my own indexing and search engine
January 30, 2007
Like any developer, I used to trust only the code I wrote myself. All that has changed with OpenSource and its widespread use.
However Iam always on the lookout to write something that is better than what is available in OpenSource – personally a means to justify the act of writing an application ![]()
The growing volume of data in an enterprise can be a liability or an asset, depending on how you see it. Access to this data converts it to useful information.
How does one access information easily? Do we really care about the millions of hits that Google returns? I dont think we go beyond the first couple of pages.
I define “Effective search” to address the above issue – I need to get to the information of interest fast, period.
OpenSource indexing and search frameworks are far behind the commercial ones like a Google search appliance or the Verity or other search engines.
Looking around, Lucene turned out to be a good fit for my index. The catch is I still required parsers, readers and data sources to make it complete.
This led me to write Ferret. It doesnot re-invent the wheel i.e wherever possible.
The good news is that it can index file systems & web sites(secure inranets and public sites). The best part is that it is highly customizable – I can add a datasource to index databases for e.g or add parsers to new file types.
The recent announcement on availability of Omnifind led me to evaluate it and of course compare with Ferret. After some extensive study of its features, Iam still to find out if I will be able to recommend it to a client when I cannot customize many aspects except the look & feel maybe. Also it beats me why I cannot schedule an indexing operation or atleast provide API to invoke the indexer! Omnifind suits the “indexing for dummies” needs but not for any active deployment within a coprorate portal for e.g.
For now, Ferret does all this and has found a client
Feed for the trail
August 20, 2007 at 8:17 am
hi it’s a good to have a complete code for software development. But I still confuse in the aspect of compile,linking, and bug.Is there any manual that only shows figure for that reason. I mean for all type of programming language.To get the code is easy, to test it I don’t know…
August 20, 2007 at 8:20 am
HI….I HOW TO DEVELOP SOFTWARE? I GOT MANY CODE BUT DON’T ON HOW TO GET THE OUTPUT?CAN U SUGGEST THE SITE THAT EXPLAINS IN FIGURE…
August 20, 2007 at 10:42 am
Yodin,
Iam not sure what you are looking for and whether it is even related to this post. Are you looking for a easy to use search engine implementation that requires minimal effort in compiling and deployment?
December 28, 2011 at 5:33 pm
Looks like someone else too has search engine with same name.
http://www.chrisumbel.com/article/ruby_fulltext_ferret
I could not reach the ruby ferret site as I keep getting site not found problem. However, I was puzzled when I had seen people talking about using ferret and wondered whether it was the same ferret that you wrote.
February 5, 2013 at 2:32 pm
“Why write my own indexing and search engine | Regu’s mind
trail” really enables me personally think a small bit extra.
I actually admired each and every particular piece of this post.
Thanks for your effort -Heriberto
April 18, 2013 at 5:30 am
My brother suggested I might like this website. He was entirely right.
This post truly made my day. You can not imagine simply how so much time I had spent
for this info! Thanks!
June 15, 2013 at 10:22 am
I’m really enjoying the theme/design of your website. Do you ever run into any browser compatibility problems? A small number of my blog audience have complained about my blog not operating correctly in Explorer but looks great in Firefox. Do you have any tips to help fix this problem?