Google

Archives

A short philosophic story with different versions

Story One

A. a man who is dying of hungry staggered to a farmer’s house. The farmer gave him a steamed bun and some water, and then the dying man survived. Thereafter, the dying man found that the farmer is as poor as a church mouse. He was moved with tears in his eyes, and got [...]

看看你的博客值多少钱?顺便征集友情链接

据说老徐的博客值1500万美元,世界第一,没想到吧,呵呵,你知道你的博客值多少钱吗?just click the following link:

http://www.business-opportunities.biz/projects/how-much-is-your-blog-worth/

征集友情链接说明:

1. 主题须与本博客相关–计算机,互联网,软件等

2. 能经常更新博客

3.

符合以上条件的网站,如有意,可直接添加本博,并留言告知您的url和站名,我会即时添加。

My blog is worth $564.54.How much is your blog worth?

IR/NLP/机器学习/misc工具

以下工具绝大多数都是开源的,基于GPL、Apache等开源协议,使用时请仔细阅读各工具的license statement

I. Information Retrieval
1. Lemur/Indri

The Lemur Toolkit for Language Modeling and Information Retrieval
http://www.lemurproject.org/
Indri:
实验系统,用于学术研究。

2. Lucene/Nutch

Apache Lucene is a high-performance, full-featured text search engine library written entirely in Java.
Lucene是apache的顶级开源项目,基于Apache 2.0协议,完全用java编写,具有perl, c/c++, dotNet等多个port
http://lucene.apache.org/
http://www.nutch.org/

但官方版本只适合于企业应用,不适合与学术研究实验。

3. CC-CEDICT Chinese-English dictionary

CC-CEDICT is a continuation of the CEDICT project. The objective of the CEDICT project was to create an online, downloadable (as opposed to [...]