public class DocumentFrequencyBasedCorpusPreprocessor extends Object implements CorpusPreprocessor
DocumentTextWordIds of the documents and from the
Vocabulary. It is assumed that the corpus has a
CorpusVocabulary property.| Constructor and Description |
|---|
DocumentFrequencyBasedCorpusPreprocessor(int minDF,
int maxDF) |
| Modifier and Type | Method and Description |
|---|---|
static int[] |
createWordIdMapping(org.dice_research.topicmodeling.utils.vocabulary.Vocabulary vocabulary,
AtomicIntegerArray counts,
int minDF,
int maxDF) |
org.dice_research.topicmodeling.utils.corpus.Corpus |
preprocess(org.dice_research.topicmodeling.utils.corpus.Corpus corpus) |
public DocumentFrequencyBasedCorpusPreprocessor(int minDF,
int maxDF)
public org.dice_research.topicmodeling.utils.corpus.Corpus preprocess(org.dice_research.topicmodeling.utils.corpus.Corpus corpus)
preprocess in interface CorpusPreprocessorpublic static int[] createWordIdMapping(org.dice_research.topicmodeling.utils.vocabulary.Vocabulary vocabulary,
AtomicIntegerArray counts,
int minDF,
int maxDF)
Copyright © 2015–2020. All rights reserved.