public class VocabularyReductionMappingApplyingSupplierDecorator extends AbstractDocumentSupplierDecorator
| Modifier and Type | Field and Description |
|---|---|
private int[] |
mapping |
static int |
REMOVED_WORD |
documentSource| Constructor and Description |
|---|
VocabularyReductionMappingApplyingSupplierDecorator(org.dice_research.topicmodeling.preprocessing.docsupplier.DocumentSupplier documentSource,
int[] mapping) |
| Modifier and Type | Method and Description |
|---|---|
static int[] |
createMapping(org.dice_research.topicmodeling.utils.vocabulary.Vocabulary vocabulary,
com.carrotsearch.hppc.BitSet keptWords) |
protected static int |
lastSetBit(com.carrotsearch.hppc.BitSet bits,
int from)
Returns the last
1 bit in the given bit set before the given position
(excluding) or -1 if there is no such bit. |
protected org.dice_research.topicmodeling.utils.doc.Document |
prepareDocument(org.dice_research.topicmodeling.utils.doc.Document document) |
static org.dice_research.topicmodeling.utils.vocabulary.Vocabulary |
updateVocabulary(org.dice_research.topicmodeling.utils.vocabulary.Vocabulary vocabulary,
int[] mapping) |
protected org.dice_research.topicmodeling.utils.doc.DocumentWordCounts |
updateWordCounts(org.dice_research.topicmodeling.utils.doc.DocumentWordCounts counts) |
protected org.dice_research.topicmodeling.utils.doc.DocumentTextWordIds |
updateWordIds(org.dice_research.topicmodeling.utils.doc.DocumentTextWordIds wordIds) |
apply, getDecoratedDocumentSupplier, getNextDocument, setDecoratedDocumentSupplier, setDocumentStartIdclone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, waitpublic static final int REMOVED_WORD
private int[] mapping
public VocabularyReductionMappingApplyingSupplierDecorator(org.dice_research.topicmodeling.preprocessing.docsupplier.DocumentSupplier documentSource,
int[] mapping)
public static int[] createMapping(org.dice_research.topicmodeling.utils.vocabulary.Vocabulary vocabulary,
com.carrotsearch.hppc.BitSet keptWords)
public static org.dice_research.topicmodeling.utils.vocabulary.Vocabulary updateVocabulary(org.dice_research.topicmodeling.utils.vocabulary.Vocabulary vocabulary,
int[] mapping)
protected static int lastSetBit(com.carrotsearch.hppc.BitSet bits,
int from)
1 bit in the given bit set before the given position
(excluding) or -1 if there is no such bit.bits - the bit set in which should be searchedfrom - the position (excluding) from which the algorithm should start to
search backwards-1 if there is no such bitprotected org.dice_research.topicmodeling.utils.doc.Document prepareDocument(org.dice_research.topicmodeling.utils.doc.Document document)
prepareDocument in class AbstractDocumentSupplierDecoratorprotected org.dice_research.topicmodeling.utils.doc.DocumentTextWordIds updateWordIds(org.dice_research.topicmodeling.utils.doc.DocumentTextWordIds wordIds)
protected org.dice_research.topicmodeling.utils.doc.DocumentWordCounts updateWordCounts(org.dice_research.topicmodeling.utils.doc.DocumentWordCounts counts)
Copyright © 2015–2020. All rights reserved.