All Classes

Class
Description
 
 
This is an abstract class with general index creation functionality.
 
 
An abstract class of all confirmation measures which have to handle undefined results.
 
Abstract class containing the process of the vector creation.
 
Aggregates the given confirmation values and returns a single coherence value.
 
 
 
 
 
 
 
This is an interface for an adapter that makes boolean document word counts available.
 
Deprecated.
A coherence measure calculates the coherence of a given set of top word sets.
Deprecated.
Deprecated.
This confirmation measure calculates the conditional probability of W' given W*.
The general interface of a confirmation measure.
 
This is the interface for a corpus adapter - a component that makes word counts available to the system.
 
This class contains the segmentations as defined in the SegmentationDefinition class and word counts for a word set.
 
This confirmation measure calculates the difference between the conditional probability of W' given W* abd the marginal probability of W'.
This type of coherence measure is a non-vector-based coherence.
This vector creator uses a direct confirmation measure to create the vectors.
This interface is implemented by all confirmation measures which are using the probabilities directly.
Deprecated.
Deprecated.
This confirmation measure calculates the difference of the conditional probability of W' given W* and the conditional probability of W' given ¬W*.
 
Determines the frequencies of words and word sub sets of a given word set.
 
 
 
 
A simple structure containing the text of a document and its number of tokens.
This confirmation measure calculates the Jaccard similarity between W' and W*.
 
 
 
This confirmation measure calculates the Likelihood similarity between W' and W*.
 
This interface defines the epsilon used by log based calculations for avoiding the logarithm of 0.
This confirmation measure calculates the logarithm of the conditional probability of W' given W*.
This confirmation measure calculates the logarithm of the Jaccard similarity between W' and W*.
This confirmation measure calculates the Likelihood similarity between W' and W*.
This confirmation measure calculates the logarithm of the ratio between the joint probability of W' and W* and the product of the two marginal probabilities.
This class can make usage of a given Lucene index as corpus.
A simple class that iterates over a given Lucene index and creates a histogram of the document lengths stored inside a documentLengthField.
Deprecated.
 
 
 
This confirmation measure calculates the logarithm of the ratio between the joint probability of W' and W* and the product of the two marginal probabilities.
This confirmation measure calculates Olssons coherence.
 
 
 
This is just for internal usage by the DirectConfirmationBasedVectorCreator class.
 
 
 
Simple example of a Segmentator that gets a word set comprising two topics and creates a SegmentationDefinition with which every word of one of the topics is compared to every other word of the other topic.
 
Class implementing Pearsons sample correlation coefficient for two given, paired samples.
This class creates the Lucene indexes which are used to access the reference corpus during the coherence calculation using a WindowSupportingLuceneCorpusAdapter.
This vector creator uses the marginal probabilities for the vector creation.
This interface defines the methods of a class that estimates the probabilities of a given word set.
 
 
This confirmation measure calculates the ratio between the joint probability of W' and W* and the product of the two marginal probabilities.
This object contains the segmentations of a word set.
A Segmentator generates the SegmentationDefinition object for a word set with the given size.
This confirmation measure calculates Shogenjis coherence.
A simple Lucene Analyzer used for the index creation.
Deprecated.
This decorator implements a very simple cache.
This class creates the Lucene indexes which are used to access the reference corpus during the coherence calculation using a LuceneCorpusAdapter.
 
Deprecated.
 
 
 
This class contains the segmentations and probabilities for a word set.
This class contains the segmentations, probabilities and context vectors for a word set.
This is a vector-based coherence measure.
 
Interface for the vector creation.
Deprecated.
 
 
This adapter supports window based probability estimation methods.
 
Deprecated.