Co-occurrence-based Thesaurus The general idea underlying the use of term co-occurrence data for thesaurus construction is that words that tend to occur together in documents are likely to have similar, or related, meanings.Co-occurrence data thus provides a statistical method for automatically identifying semantic rela-tionships that are normally contained in a hand-made thesaurus. Predicate-Argument-based Thesaurus This method attempts to construct a thesaurus according to predicate-argument structures. The use of this method for thesaurus construction is based on the idea that there are restrictions on what words can appear in certain environments, and in particular, what words can be arguments of a certain predicate .For example, a cat may walk, bite, but can not fly. Each noun may therefore be characterized according to the verbs or adjectives that it occurs with. Nouns may then be grouped according to the extent to which they appear in similar constructions. Reference: Ad Hoc Retrieval Experiments Using WordNet and Automatically Constructed Thesauri. |
Two methods for Thesaurus Construction
Posted by jeffy
Posted on 7:41 PM
with No comments
loading..
Popular Posts
-
Resources about lucene Resources Introductions The API documentation contains a short and simple code example that show...
-
We examine top Python Machine learning open source projects on Github, both in terms of contributors and commits, and identify most popula...
-
The problems such as multirow.sty’ not found can be fixed via the following command (Ubuntu system): sudo apt-get install texlive-latex...
-
Type in a terminal window: gs -sDEVICE=bbox -dNOPAUSE -dBATCH file.pdf (or file.ps) you must have ghostscript installed of course. This c...
-
Just read a post from http://blog.bigml.com/2013/02/21/everything-you-wanted-to-know-about-machine-learning-but-were-too-afraid-to-ask-pa...
-
Sent to you by jeffye via Google Reader: Advantages and Myths of RDF via AI3:::Adaptive Information by Mike on 4/8/09 A ...
-
Logistic Regression by Any Other Name LingPipe Blog I (Bob) have been working on logistic regression. In particular, multinomial logistic r...
-
ACL 2013: ACCEPTED PAPERS A Bayesian Model for Joint Unsupervised Induction of Sentiment, Aspect and Discourse Representations Angel...
-
From https://de.dariah.eu/tatom/preprocessing.html Also refer to http://www.nltk.org/api/nltk.tokenize.html#module-nltk.tokenize ...
-
Repost from http://terrytao.wordpress.com/advice-on-writing-papers/ There are three rules for writing the novel. Unfortunately, no on...
0 Comments:
Post a Comment