Class MalletLdaWrapper.MalletLDATopicModeler
java.lang.Object
cc.mallet.topics.ParallelTopicModel
org.dice_research.topicmodeling.algorithm.mallet.MalletLdaWrapper.MalletLDATopicModeler
- All Implemented Interfaces:
Serializable,org.dice_research.topicmodeling.algorithms.ClassificationModel,org.dice_research.topicmodeling.algorithms.LDAModel,org.dice_research.topicmodeling.algorithms.Model,org.dice_research.topicmodeling.algorithms.ProbabilisticWordTopicModel,org.dice_research.topicmodeling.algorithms.VocabularyContaining,org.dice_research.topicmodeling.algorithms.VocabularyContainingClassificationModel,org.dice_research.topicmodeling.algorithms.VocabularyContainingModel
- Enclosing class:
- MalletLdaWrapper
protected static class MalletLdaWrapper.MalletLDATopicModeler
extends cc.mallet.topics.ParallelTopicModel
implements org.dice_research.topicmodeling.algorithms.LDAModel
- See Also:
- Serialized Form
-
Field Summary
FieldsModifier and TypeFieldDescriptionstatic intprotected intprotected MalletLdaInferenceWrapperprotected intprotected intprotected static org.slf4j.Loggerprotected cc.mallet.topics.WorkerRunnable[]private static longprotected double[]protected org.dice_research.topicmodeling.utils.vocabulary.Vocabularyprotected double[][]Fields inherited from class cc.mallet.topics.ParallelTopicModel
alpha, alphabet, alphaSum, beta, betaSum, burninPeriod, data, DEFAULT_BETA, docLengthCounts, formatter, modelFilename, numIterations, numTopics, numTypes, optimizeInterval, printLogLikelihood, randomSeed, saveModelInterval, saveSampleInterval, saveStateInterval, showTopicsInterval, stateFilename, temperingInterval, tokensPerTopic, topicAlphabet, topicBits, topicDocCounts, topicMask, totalTokens, typeTopicCounts, UNASSIGNED_TOPIC, usingSymmetricAlpha, wordsPerTopic -
Constructor Summary
ConstructorsConstructorDescriptionMalletLDATopicModeler(int numberOfTopics, double alphaSum, double beta, long seed)MalletLDATopicModeler(int numberOfTopics, long seed)MalletLDATopicModeler(cc.mallet.types.LabelAlphabet topicAlphabet, double alphaSum, double beta, long seed) -
Method Summary
Modifier and TypeMethodDescriptionprotected voidvoidestimate()double[]doublegetBeta()org.dice_research.topicmodeling.utils.doc.DocumentClassificationResultgetClassificationForDocument(org.dice_research.topicmodeling.utils.doc.Document document)cc.mallet.topics.TopicInferencerintdoublegetProbabilityOfWord(int wordId, int topicId)doublegetSmoothedProbabilityOfTopic(int topicId)doublegetSmoothedProbabilityOfWord(int wordId, int topicId)double[]getTopicProbabilitiesForDocument(org.dice_research.topicmodeling.utils.doc.DocumentWordCounts wordCounts)intorg.dice_research.topicmodeling.utils.vocabulary.Vocabularyorg.dice_research.topicmodeling.utils.vocabulary.VocabularyMappinggetVocabularyMapping(org.dice_research.topicmodeling.utils.vocabulary.Vocabulary otherVocabulary)int[]inferTopicAssignmentsForDocument(int[] tokens)int[]inferTopicAssignmentsForDocument(org.dice_research.topicmodeling.utils.doc.Document document)int[]inferTopicAssignmentsForDocument(org.dice_research.topicmodeling.utils.doc.DocumentWordCounts wordCounts)voidinitialize(cc.mallet.types.InstanceList instances)voidsetInferenceIterations(int inferenceIterations)voidsetVersion(int version)voidsetVocabularyDecorator(org.dice_research.topicmodeling.utils.vocabulary.VocabularyDecorator vocabulary)Methods inherited from class cc.mallet.topics.ParallelTopicModel
addInstances, buildInitialTypeTopicCounts, displayTopWords, getAlphabet, getData, getNumTopics, getProbEstimator, getSortedWords, getTopicAlphabet, getTopicProbabilities, getTopicProbabilities, getTopWords, initializeFromState, main, modelLogLikelihood, optimizeAlpha, optimizeBeta, printDocumentTopics, printDocumentTopics, printDocumentTopics, printState, printState, printTopicWordWeights, printTopicWordWeights, printTopWords, printTopWords, printTypeTopicCounts, read, setBurninPeriod, setNumIterations, setNumThreads, setOptimizeInterval, setRandomSeed, setSaveSerializedModel, setSaveState, setSymmetricAlpha, setTemperingInterval, setTopicDisplay, sumTypeTopicCounts, temperAlpha, topicPhraseXMLReport, topicXMLReport, write
-
Field Details
-
serialVersionUID
private static final long serialVersionUID- See Also:
- Constant Field Values
-
logger
protected static final org.slf4j.Logger logger -
DEFAULT_INFERENCE_ITERATIONS
public static int DEFAULT_INFERENCE_ITERATIONS -
runnables
protected transient cc.mallet.topics.WorkerRunnable[] runnables -
iteration
protected int iteration -
inferencerVersion
protected transient int inferencerVersion -
inferencer
-
vocabulary
protected org.dice_research.topicmodeling.utils.vocabulary.Vocabulary vocabulary -
inferenceIterations
protected int inferenceIterations -
wordTopicWeights
protected double[][] wordTopicWeights -
topicWeights
protected double[] topicWeights
-
-
Constructor Details
-
MalletLDATopicModeler
public MalletLDATopicModeler(int numberOfTopics, long seed) -
MalletLDATopicModeler
public MalletLDATopicModeler(int numberOfTopics, double alphaSum, double beta, long seed) -
MalletLDATopicModeler
public MalletLDATopicModeler(cc.mallet.types.LabelAlphabet topicAlphabet, double alphaSum, double beta, long seed)
-
-
Method Details
-
initialize
public void initialize(cc.mallet.types.InstanceList instances) -
estimate
public void estimate()- Overrides:
estimatein classcc.mallet.topics.ParallelTopicModel
-
getSmoothedProbabilityOfWord
public double getSmoothedProbabilityOfWord(int wordId, int topicId)- Specified by:
getSmoothedProbabilityOfWordin interfaceorg.dice_research.topicmodeling.algorithms.ProbabilisticWordTopicModel
-
getProbabilityOfWord
public double getProbabilityOfWord(int wordId, int topicId)- Specified by:
getProbabilityOfWordin interfaceorg.dice_research.topicmodeling.algorithms.ProbabilisticWordTopicModel
-
getSmoothedProbabilityOfTopic
public double getSmoothedProbabilityOfTopic(int topicId)- Specified by:
getSmoothedProbabilityOfTopicin interfaceorg.dice_research.topicmodeling.algorithms.ProbabilisticWordTopicModel
-
getNumberOfTopics
public int getNumberOfTopics()- Specified by:
getNumberOfTopicsin interfaceorg.dice_research.topicmodeling.algorithms.ProbabilisticWordTopicModel
-
calculateSmoothedWeights
protected void calculateSmoothedWeights() -
getTopicProbabilitiesForDocument
public double[] getTopicProbabilitiesForDocument(org.dice_research.topicmodeling.utils.doc.DocumentWordCounts wordCounts)- Specified by:
getTopicProbabilitiesForDocumentin interfaceorg.dice_research.topicmodeling.algorithms.ProbabilisticWordTopicModel
-
getVocabulary
public org.dice_research.topicmodeling.utils.vocabulary.Vocabulary getVocabulary()- Specified by:
getVocabularyin interfaceorg.dice_research.topicmodeling.algorithms.VocabularyContaining
-
getInferencer
public cc.mallet.topics.TopicInferencer getInferencer()- Overrides:
getInferencerin classcc.mallet.topics.ParallelTopicModel
-
setVocabularyDecorator
public void setVocabularyDecorator(org.dice_research.topicmodeling.utils.vocabulary.VocabularyDecorator vocabulary) -
setVersion
public void setVersion(int version)- Specified by:
setVersionin interfaceorg.dice_research.topicmodeling.algorithms.Model
-
getVersion
public int getVersion()- Specified by:
getVersionin interfaceorg.dice_research.topicmodeling.algorithms.Model
-
getVocabularyMapping
public org.dice_research.topicmodeling.utils.vocabulary.VocabularyMapping getVocabularyMapping(org.dice_research.topicmodeling.utils.vocabulary.Vocabulary otherVocabulary)- Specified by:
getVocabularyMappingin interfaceorg.dice_research.topicmodeling.algorithms.VocabularyContainingModel
-
getClassificationForDocument
public org.dice_research.topicmodeling.utils.doc.DocumentClassificationResult getClassificationForDocument(org.dice_research.topicmodeling.utils.doc.Document document)- Specified by:
getClassificationForDocumentin interfaceorg.dice_research.topicmodeling.algorithms.ClassificationModel
-
inferTopicAssignmentsForDocument
public int[] inferTopicAssignmentsForDocument(org.dice_research.topicmodeling.utils.doc.Document document)- Specified by:
inferTopicAssignmentsForDocumentin interfaceorg.dice_research.topicmodeling.algorithms.LDAModel
-
inferTopicAssignmentsForDocument
public int[] inferTopicAssignmentsForDocument(org.dice_research.topicmodeling.utils.doc.DocumentWordCounts wordCounts)- Specified by:
inferTopicAssignmentsForDocumentin interfaceorg.dice_research.topicmodeling.algorithms.LDAModel
-
inferTopicAssignmentsForDocument
public int[] inferTopicAssignmentsForDocument(int[] tokens)- Specified by:
inferTopicAssignmentsForDocumentin interfaceorg.dice_research.topicmodeling.algorithms.LDAModel
-
getBeta
public double getBeta()- Specified by:
getBetain interfaceorg.dice_research.topicmodeling.algorithms.LDAModel
-
getAlphas
public double[] getAlphas()- Specified by:
getAlphasin interfaceorg.dice_research.topicmodeling.algorithms.LDAModel
-
setInferenceIterations
public void setInferenceIterations(int inferenceIterations)- Specified by:
setInferenceIterationsin interfaceorg.dice_research.topicmodeling.algorithms.LDAModel
-