Google

Archives

Gallery

DSCN2252 800px-karte_karibik_inseln_zh DSCN2251 DSCN2241

Be care of RangeQuery in Lucene

Reminder, Lucene has many Query types

– TermQuery, BooleanQuery,

ConstantScoreQuery, MatchAllDocsQuery,

MultiPhraseQuery, FuzzyQuery,

WildcardQuery, RangeQuery, PrefixQuery,

PhraseQuery, Span*Query,

DisjunctionMaxQuery, etc.

There is a bunch of Query implements in Lucene, which makes lucene very powerful in search. However, you should be very care of using Query like RangeQuery, especially when the size of your collection is very large.

As you know that lucene will [...]

The Ivory Toolkit with the SMRF Retrieval Engine (under Hadoop Framework)

As the Increase of IR dataset in size, it seems that a powerful platform for rapidly indexing and searching is need.  Ivory is a newly announced search platform developed on the basis of Hadoop. It could be a good choice when we come to billion era.

This would also be a future step for our SaberLucene Project (under release). Beside MapReduce framework, [...]

Trie-based approximate autocomplete implementation with support for ranks and synonyms

Thoughts on Lucene, Solr, Nutch and vertical search 
Trie-based approximate autocomplete implementation with support for ranks and synonyms

Posted by Kelvin on 01 Jul 2009 at 02:30 am | Tagged as: programming

The problem of auto-completing user queries is a well-explored one.

For example,
Type less, find more: fast autocompletion search with a succinct index
http://stevedaskam.wordpress.com/2009/06/07/putting-autocomplete-data-structure-to-the-test/
http://suggesttree.sourceforge.net/
http://sujitpal.blogspot.com/2007/02/three-autocomplete-implementations.html

However, there’s been little written [...]

Lucene 新子项目OpenRelevance起航

25 June 2009 – Apache Open Relevance Kickoff

也就是昨天,Apache 官方投票通过启动一个Lucene 子项目Open Relevance Project (ORP)。ORP主要目标是检索数据集、评价和查询,这样Lucene的开发者和用户就可以更容易地进行相关度评价测试。比较像TREC,NTCIR等这些评测,不过这个项目会更开放些。

更多信息可参考:

http://lucene.apache.org/openrelevance/

http://wiki.apache.org/lucene-java/OpenRelevance

这算是一个比较exciting NEWS,特别是像我比较喜欢Lucene,又想用Lucene搞点IR方面的研究的。

Online free book: Search User Interface

Marti Hearst 刚完成的一本新书, Search User Interfaces. 令人兴奋的是, 网上可以在网上免费阅读!  Marti 是 UC Berkeley 教授, 也是搜索用户接口设计方面的专家。这本相关博客: SearchUpTicious.

url:  http://searchuserinterfaces.com/book/

Incoming search terms for the article:Search User Interfaces (1)