Skip navigation links
A B C D E F G H I L M N O P Q R S T U V W 

A

AbstractDocumentPropertyBasedFilter<T extends org.dice_research.topicmodeling.utils.doc.DocumentProperty> - Class in org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.filter
 
AbstractDocumentPropertyBasedFilter(Class<T>) - Constructor for class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.filter.AbstractDocumentPropertyBasedFilter
 
AbstractDocumentPropertyBasedFilter(Class<T>, boolean) - Constructor for class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.filter.AbstractDocumentPropertyBasedFilter
 
AbstractDocumentPropertyMapCreator<T extends org.dice_research.topicmodeling.utils.doc.DocumentProperty> - Class in org.dice_research.topicmodeling.preprocessing.docsupplier.decorator
 
AbstractDocumentPropertyMapCreator(DocumentSupplier, Class<T>) - Constructor for class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.AbstractDocumentPropertyMapCreator
 
AbstractDocumentSheduler - Class in org.dice_research.topicmodeling.preprocessing.shedule
 
AbstractDocumentSheduler(DocumentSupplier, int) - Constructor for class org.dice_research.topicmodeling.preprocessing.shedule.AbstractDocumentSheduler
 
AbstractDocumentSheduler.PartialDocumentSupplier - Class in org.dice_research.topicmodeling.preprocessing.shedule
 
AbstractDocumentSupplierDecorator - Class in org.dice_research.topicmodeling.preprocessing.docsupplier.decorator
 
AbstractDocumentSupplierDecorator(DocumentSupplier) - Constructor for class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.AbstractDocumentSupplierDecorator
 
AbstractNerPropagationPreprocessor - Class in org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.ner
 
AbstractNerPropagationPreprocessor() - Constructor for class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.ner.AbstractNerPropagationPreprocessor
 
AbstractPreprocessor - Class in org.dice_research.topicmodeling.preprocessing
 
AbstractPreprocessor(DocumentSupplier) - Constructor for class org.dice_research.topicmodeling.preprocessing.AbstractPreprocessor
 
AbstractPreprocessor(DocumentSupplier, Corpus) - Constructor for class org.dice_research.topicmodeling.preprocessing.AbstractPreprocessor
 
AbstractPropertyAppendingDocumentSupplierDecorator<T extends org.dice_research.topicmodeling.utils.doc.DocumentProperty> - Class in org.dice_research.topicmodeling.preprocessing.docsupplier.decorator
 
AbstractPropertyAppendingDocumentSupplierDecorator(DocumentSupplier) - Constructor for class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.AbstractPropertyAppendingDocumentSupplierDecorator
 
AbstractPropertyEditingDocumentSupplierDecorator<T extends org.dice_research.topicmodeling.utils.doc.DocumentProperty> - Class in org.dice_research.topicmodeling.preprocessing.docsupplier.decorator
 
AbstractPropertyEditingDocumentSupplierDecorator(DocumentSupplier, Class<T>) - Constructor for class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.AbstractPropertyEditingDocumentSupplierDecorator
 
AbstractScaleablePreprocessor - Class in org.dice_research.topicmodeling.preprocessing
 
AbstractScaleablePreprocessor(DocumentSupplier) - Constructor for class org.dice_research.topicmodeling.preprocessing.AbstractScaleablePreprocessor
 
AbstractScaleablePreprocessor(DocumentSupplier, Corpus) - Constructor for class org.dice_research.topicmodeling.preprocessing.AbstractScaleablePreprocessor
 
AbstractSplittingDocumentSupplierDecorator - Class in org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.splitter
This is an abstract super class for DocumentSupplierDecorators which are splitting up documents into multiple documents.
AbstractSplittingDocumentSupplierDecorator(DocumentSupplier) - Constructor for class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.splitter.AbstractSplittingDocumentSupplierDecorator
 
ACCEPT_DOCUMENT_WITHOUT_PROPERTY - Variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.filter.AbstractDocumentPropertyBasedFilter
 
addDocuments(DocumentSupplier) - Method in class org.dice_research.topicmodeling.preprocessing.AbstractPreprocessor
 
addDocuments(DocumentSupplier) - Method in class org.dice_research.topicmodeling.preprocessing.CorpusWrappingPreprocessor
 
addDocuments(DocumentSupplier) - Method in class org.dice_research.topicmodeling.preprocessing.SingleDocumentPreprocessor
Deprecated.
addDocumentToCorpus(Corpus, Document) - Method in class org.dice_research.topicmodeling.preprocessing.AbstractPreprocessor
 
addDocumentToCorpus(Corpus, Document) - Method in class org.dice_research.topicmodeling.preprocessing.BagOfWordsCorpusCreator
Deprecated.
 
addDocumentToCorpus(Corpus, Document) - Method in class org.dice_research.topicmodeling.preprocessing.ListCorpusCreator
 
addDocumentToQueue(Document) - Method in class org.dice_research.topicmodeling.preprocessing.shedule.AbstractDocumentSheduler.PartialDocumentSupplier
 
additionalFilter - Variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.ner.NerPropagatingSupplierDecorator
 
addRemovableProperty(Class<? extends DocumentProperty>) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.PropertyRemovingSupplierDecorator
 
addToQueue(Document) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.splitter.AbstractSplittingDocumentSupplierDecorator
 
AndConcatenatingDocumentFilter - Class in org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.filter
 
AndConcatenatingDocumentFilter(DocumentFilter[]) - Constructor for class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.filter.AndConcatenatingDocumentFilter
 
apply(Document) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.AbstractDocumentSupplierDecorator
 

B

BagOfWordsCorpusCreator - Class in org.dice_research.topicmodeling.preprocessing
Deprecated.
BagOfWordsCorpusCreator(DocumentSupplier) - Constructor for class org.dice_research.topicmodeling.preprocessing.BagOfWordsCorpusCreator
Deprecated.
 
BagOfWordsCorpusCreator(DocumentSupplier, BagOfWordsCorpus) - Constructor for class org.dice_research.topicmodeling.preprocessing.BagOfWordsCorpusCreator
Deprecated.
 

C

categorieNames - Variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.filter.CategoryBasedDocumentFilter
 
categories - Variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.NSFTextAndCategoryExtractingSupplierDecorator
 
categoriesAreGood - Variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.filter.CategoryBasedDocumentFilter
 
CATEGORY_KEY - Static variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.NSFTextAndCategoryExtractingSupplierDecorator
 
CategoryBasedDocumentFilter - Class in org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.filter
 
CategoryBasedDocumentFilter(Collection<String>, boolean) - Constructor for class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.filter.CategoryBasedDocumentFilter
 
CategoryBasedDocumentFilter(String[], boolean) - Constructor for class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.filter.CategoryBasedDocumentFilter
 
changeCharset(Charset) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.HtmlCharsetExtractingSupplierDecorator.StringWithCharset
 
charAutomaton - Variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.NewsDeMarkupRemovingSupplierDecorator
 
CHARS_TO_INSERT - Static variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.ner.EntityBasedTokenizer
 
CHARS_TO_REPLACE - Static variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.ner.EntityBasedTokenizer
 
charset - Variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.HtmlCharsetExtractingSupplierDecorator.StringWithCharset
 
CharsetDeterminingSupplierDecorator - Class in org.dice_research.topicmodeling.preprocessing.docsupplier.decorator
 
CharsetDeterminingSupplierDecorator(DocumentSupplier) - Constructor for class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.CharsetDeterminingSupplierDecorator
 
checkEncoding(HtmlCharsetExtractingSupplierDecorator.StringWithCharset) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.HtmlCharsetExtractingSupplierDecorator
 
cleanTag(String, String) - Static method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.WikipediaMarkupDeletingDecorator
Deprecated.
 
cleanTag(String, String, String) - Static method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.WikipediaMarkupDeletingDecorator
Deprecated.
 
cleanTagDeleteContent(String, String, String) - Static method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.WikipediaMarkupDeletingDecorator
Deprecated.
 
cleanTagDeleteContent(String, String, String, String) - Static method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.WikipediaMarkupDeletingDecorator
Deprecated.
 
cleanTagRetainContent(String, String, String) - Static method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.WikipediaMarkupDeletingDecorator
Deprecated.
 
cleanTagRetainContent(String, String, String, String) - Static method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.WikipediaMarkupDeletingDecorator
Deprecated.
 
cleanText(String) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.NewsDeMarkupRemovingSupplierDecorator
 
clearText(String) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.SimpleHtmlCleaner
 
consumeDocument(Document) - Method in class org.dice_research.topicmodeling.preprocessing.consume.DocumentFrequencyDeterminer
 
consumer - Variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.DocumentConsumerAdaptingSupplierDecorator
 
containsForDeprecatedEncode(String) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.DocumentTextWithTermInfoParsingSupplierDecorator
 
corpus - Variable in class org.dice_research.topicmodeling.preprocessing.AbstractPreprocessor
 
corpusCreated - Variable in class org.dice_research.topicmodeling.preprocessing.AbstractPreprocessor
 
CorpusPreprocessor - Interface in org.dice_research.topicmodeling.preprocessing.corpus
This preprocessor needs the complete corpus to fulfill its task.
CorpusWrappingDocumentSupplier - Class in org.dice_research.topicmodeling.preprocessing.docsupplier
 
CorpusWrappingDocumentSupplier(Corpus) - Constructor for class org.dice_research.topicmodeling.preprocessing.docsupplier.CorpusWrappingDocumentSupplier
 
CorpusWrappingDocumentSupplier(Corpus, int) - Constructor for class org.dice_research.topicmodeling.preprocessing.docsupplier.CorpusWrappingDocumentSupplier
 
CorpusWrappingPreprocessor - Class in org.dice_research.topicmodeling.preprocessing
 
CorpusWrappingPreprocessor(Corpus) - Constructor for class org.dice_research.topicmodeling.preprocessing.CorpusWrappingPreprocessor
 
CORRECT_SURFACE_FORM_ID - Static variable in interface org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.ner.EntityTokenSurfaceFormMappingSupplier
 
counts - Variable in class org.dice_research.topicmodeling.preprocessing.consume.DocumentFrequencyDeterminer
 
countWordCooccurrences(DocumentTextWordIds) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.SlidingWindowCooccurrenceCounter
 
countWords(int, TermTokenizedText, BagOfWordsCorpus) - Method in class org.dice_research.topicmodeling.preprocessing.BagOfWordsCorpusCreator
Deprecated.
 
createDocument(String, Document) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.splitter.PatternBasedDocumentTextSplitter
 
createDocument(String, int, int, TermTokenizedText, Document) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.splitter.SentenceBasedDocumentTextSplitter
 
createMapping(Vocabulary, BitSet) - Static method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.VocabularyReductionMappingApplyingSupplierDecorator
 
createNextDocuments() - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.splitter.AbstractSplittingDocumentSupplierDecorator
 
createPropertyForDocument(Document) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.AbstractPropertyAppendingDocumentSupplierDecorator
 
createPropertyForDocument(Document) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.DocumentTextWithTermInfoCreatingSupplierDecorator
 
createPropertyForDocument(Document) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.DocumentWordCountingSupplierDecorator
 
createPropertyForDocument(Document) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.WordIndexingSupplierDecorator
 
createStemmedText(TermTokenizedText) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.StemmedTextCreatorSupplierDecorator
 
createTextAndTerms(DocumentTextWithTermInfo, Document) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.DocumentTextWithTermInfoParsingSupplierDecorator
 
createTextWithTermInfo(DocumentText, TermTokenizedText) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.DocumentTextWithTermInfoCreatingSupplierDecorator
 
createWordIdMapping(Vocabulary, AtomicIntegerArray, int, int) - Static method in class org.dice_research.topicmodeling.preprocessing.corpus.DocumentFrequencyBasedCorpusPreprocessor
 
createWordList(File) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.ner.filter.StopwordBasedEntityFilter
 
currentState - Variable in class org.dice_research.topicmodeling.preprocessing.shedule.NFoldCrossValidationSheduler
 

D

DEFAULT_CHARSET - Static variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.DocumentTextCreatingSupplierDecorator
 
DEFAULT_CHARSET - Static variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.HtmlCharsetExtractingSupplierDecorator
 
defaultCharset - Variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.DocumentTextCreatingSupplierDecorator
 
deleteCorpus() - Method in class org.dice_research.topicmodeling.preprocessing.AbstractPreprocessor
 
deleteCorpus() - Method in class org.dice_research.topicmodeling.preprocessing.CorpusWrappingPreprocessor
 
deleteCorpus() - Method in class org.dice_research.topicmodeling.preprocessing.SingleDocumentPreprocessor
 
deleteTextExceptNEs(DocumentText, NamedEntitiesInText) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.TextDeletingExceptNESurfaceFormsSupplierDecorator
 
DEPRECATED_ENCODING_START - Static variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.DocumentTextWithTermInfoParsingSupplierDecorator
 
detectCharset(byte[]) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.CharsetDeterminingSupplierDecorator
 
DeterministicPercentageDocumentSheduler - Class in org.dice_research.topicmodeling.preprocessing.shedule
 
DeterministicPercentageDocumentSheduler(DocumentSupplier, int) - Constructor for class org.dice_research.topicmodeling.preprocessing.shedule.DeterministicPercentageDocumentSheduler
 
discardDocumentsWithoutNEs - Variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.ner.filter.NamedEntityFilteringSupplierDecorator
 
document - Variable in class org.dice_research.topicmodeling.preprocessing.SingleDocumentPreprocessor
 
DOCUMENT_PROPERTY_CLASS - Variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.AbstractDocumentPropertyMapCreator
 
DOCUMENT_PROPERTY_CLASS - Variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.filter.AbstractDocumentPropertyBasedFilter
 
DocumentCategoryRenamingDocumentSupplierDecorator - Class in org.dice_research.topicmodeling.preprocessing.docsupplier.decorator
 
DocumentCategoryRenamingDocumentSupplierDecorator(DocumentSupplier, String[]) - Constructor for class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.DocumentCategoryRenamingDocumentSupplierDecorator
 
DocumentConsumerAdaptingSupplierDecorator - Class in org.dice_research.topicmodeling.preprocessing.docsupplier.decorator
 
DocumentConsumerAdaptingSupplierDecorator(DocumentSupplier, DocumentConsumer) - Constructor for class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.DocumentConsumerAdaptingSupplierDecorator
 
DocumentConsumerAdaptingSupplierDecorator(DocumentSupplier, DocumentConsumer, boolean) - Constructor for class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.DocumentConsumerAdaptingSupplierDecorator
 
DocumentFilter - Interface in org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.filter
 
DocumentFilterBasedSheduler - Class in org.dice_research.topicmodeling.preprocessing.shedule
 
DocumentFilterBasedSheduler(DocumentSupplier, DocumentFilter[]) - Constructor for class org.dice_research.topicmodeling.preprocessing.shedule.DocumentFilterBasedSheduler
 
DocumentFilteringSupplierDecorator - Class in org.dice_research.topicmodeling.preprocessing.docsupplier.decorator
 
DocumentFilteringSupplierDecorator(DocumentSupplier, DocumentFilter) - Constructor for class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.DocumentFilteringSupplierDecorator
 
DocumentFrequencyBasedCorpusPreprocessor - Class in org.dice_research.topicmodeling.preprocessing.corpus
A preprocessor that removes words that occur either too often or too rare in the corpus from the DocumentTextWordIds of the documents and from the Vocabulary.
DocumentFrequencyBasedCorpusPreprocessor(int, int) - Constructor for class org.dice_research.topicmodeling.preprocessing.corpus.DocumentFrequencyBasedCorpusPreprocessor
 
DocumentFrequencyDeterminer - Class in org.dice_research.topicmodeling.preprocessing.consume
 
DocumentFrequencyDeterminer(Vocabulary) - Constructor for class org.dice_research.topicmodeling.preprocessing.consume.DocumentFrequencyDeterminer
 
DocumentPropertyPrintingSupplierDecorator<T extends org.dice_research.topicmodeling.utils.doc.DocumentProperty> - Class in org.dice_research.topicmodeling.preprocessing.docsupplier.decorator
 
DocumentPropertyPrintingSupplierDecorator(DocumentSupplier, Class<T>, String) - Constructor for class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.DocumentPropertyPrintingSupplierDecorator
 
DocumentSheduler - Interface in org.dice_research.topicmodeling.preprocessing.shedule
 
documentSource - Variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.AbstractDocumentSupplierDecorator
 
documentSource - Variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.DocumentFilteringSupplierDecorator
 
documentSource - Variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.splitter.AbstractSplittingDocumentSupplierDecorator
 
documentSource - Variable in class org.dice_research.topicmodeling.preprocessing.shedule.AbstractDocumentSheduler
 
DocumentSupplierDecorator - Interface in org.dice_research.topicmodeling.preprocessing.docsupplier.decorator
 
DocumentTextCreatingSupplierDecorator - Class in org.dice_research.topicmodeling.preprocessing.docsupplier.decorator
 
DocumentTextCreatingSupplierDecorator(DocumentSupplier) - Constructor for class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.DocumentTextCreatingSupplierDecorator
 
DocumentTextCreatingSupplierDecorator(DocumentSupplier, Charset) - Constructor for class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.DocumentTextCreatingSupplierDecorator
 
DocumentTextWithTermInfoCreatingSupplierDecorator - Class in org.dice_research.topicmodeling.preprocessing.docsupplier.decorator
 
DocumentTextWithTermInfoCreatingSupplierDecorator(DocumentSupplier) - Constructor for class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.DocumentTextWithTermInfoCreatingSupplierDecorator
 
DocumentTextWithTermInfoParsingSupplierDecorator - Class in org.dice_research.topicmodeling.preprocessing.docsupplier.decorator
 
DocumentTextWithTermInfoParsingSupplierDecorator(DocumentSupplier) - Constructor for class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.DocumentTextWithTermInfoParsingSupplierDecorator
 
DocumentWordCountingSupplierDecorator - Class in org.dice_research.topicmodeling.preprocessing.docsupplier.decorator
 
DocumentWordCountingSupplierDecorator(DocumentSupplier) - Constructor for class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.DocumentWordCountingSupplierDecorator
 

E

editDocumentProperty(T) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.AbstractPropertyEditingDocumentSupplierDecorator
 
editDocumentProperty(DocumentCategory) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.DocumentCategoryRenamingDocumentSupplierDecorator
 
EMPTY_TEXT - Static variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.NSFTextAndCategoryExtractingSupplierDecorator
 
entities - Variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.ner.EntityTermMapping
 
ENTITY_TOKEN_GONE_THROUGH_POS_TAGGING_ID - Static variable in interface org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.ner.EntityTokenSurfaceFormMappingSupplier
 
EntityBasedTokenizer - Class in org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.ner
 
EntityBasedTokenizer() - Constructor for class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.ner.EntityBasedTokenizer
 
EntityPropagator - Class in org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.ner
 
EntityPropagator() - Constructor for class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.ner.EntityPropagator
 
EntityTermMapping - Class in org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.ner
 
EntityTermMapping() - Constructor for class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.ner.EntityTermMapping
 
EntityTokenReplaceingPostprocessor - Class in org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.ner
 
EntityTokenReplaceingPostprocessor(EntityTokenSurfaceFormMappingSupplier) - Constructor for class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.ner.EntityTokenReplaceingPostprocessor
 
EntityTokenSurfaceFormMappingSupplier - Interface in org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.ner
Interface implemented by a NerPropagationPreprocessor that replaces named entities with a single token.
ESCAPE_CHAR - Static variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.DocumentTextWithTermInfoCreatingSupplierDecorator
 
escapeString(String) - Static method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.DocumentTextWithTermInfoCreatingSupplierDecorator
 
extractCategory(String) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.NSFTextAndCategoryExtractingSupplierDecorator
 
extractCharset(String) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.HtmlCharsetExtractingSupplierDecorator
 
extractCharsetFromMetaTag(String, int, int) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.HtmlCharsetExtractingSupplierDecorator
 
extractLowercasedHead(String) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.HtmlCharsetExtractingSupplierDecorator
 
extractText(String) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.NSFTextAndCategoryExtractingSupplierDecorator
 

F

filter - Variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.DocumentFilteringSupplierDecorator
 
filter - Variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.ner.filter.NamedEntityFilteringSupplierDecorator
 
filter - Variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.TermFilteringSupplierDecorator
 
filter - Variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.TermListCreatingSupplierDecorator
 
FILTER_CATEGORY - Static variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.NSFTextAndCategoryExtractingSupplierDecorator
 
FILTER_TYPE - Variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.filter.StringContainingDocumentPropertyBasedFilter
 
filterEntities(NamedEntitiesInText, Document) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.ner.filter.NamedEntityFilteringSupplierDecorator
 
filters - Variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.filter.AndConcatenatingDocumentFilter
 
filters - Variable in class org.dice_research.topicmodeling.preprocessing.shedule.DocumentFilterBasedSheduler
 
filterWords(List<Term>) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.TermFilteringSupplierDecorator
 
foundEntityPair(NamedEntityInText, NamedEntityInText) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.ner.EntityTokenReplaceingPostprocessor
 
foundEntityPair(NamedEntityInText, NamedEntityInText) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.ner.SimpleNerPropagationPostprocessor
 

G

generateCorpus() - Method in class org.dice_research.topicmodeling.preprocessing.AbstractPreprocessor
 
generateCorpus(int) - Method in class org.dice_research.topicmodeling.preprocessing.AbstractScaleablePreprocessor
 
getCharset() - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.HtmlCharsetExtractingSupplierDecorator.StringWithCharset
 
getCorpus() - Method in class org.dice_research.topicmodeling.preprocessing.AbstractPreprocessor
 
getCorpus(int) - Method in class org.dice_research.topicmodeling.preprocessing.AbstractScaleablePreprocessor
 
getCorpus(int) - Method in interface org.dice_research.topicmodeling.preprocessing.ScaleablePreprocessor
 
getCorpus(Document) - Method in class org.dice_research.topicmodeling.preprocessing.SingleDocumentPreprocessor
 
getCorpus() - Method in class org.dice_research.topicmodeling.preprocessing.SingleDocumentPreprocessor
 
getCounts() - Method in class org.dice_research.topicmodeling.preprocessing.consume.DocumentFrequencyDeterminer
 
getDecoratedDocumentSupplier() - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.AbstractDocumentSupplierDecorator
 
getDecoratedDocumentSupplier() - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.DocumentFilteringSupplierDecorator
 
getDecoratedDocumentSupplier() - Method in interface org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.DocumentSupplierDecorator
 
getDecoratedDocumentSupplier() - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.splitter.AbstractSplittingDocumentSupplierDecorator
 
getFilterType() - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.filter.StringContainingDocumentPropertyBasedFilter
 
getId() - Method in class org.dice_research.topicmodeling.preprocessing.shedule.AbstractDocumentSheduler.PartialDocumentSupplier
 
getLastEntityTokenSurfaceFormMapping() - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.ner.EntityBasedTokenizer
 
getLastEntityTokenSurfaceFormMapping() - Method in interface org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.ner.EntityTokenSurfaceFormMappingSupplier
Returns the mapping created since the last call of this method.
getNewCorpus() - Method in class org.dice_research.topicmodeling.preprocessing.AbstractPreprocessor
 
getNewCorpus() - Method in class org.dice_research.topicmodeling.preprocessing.BagOfWordsCorpusCreator
Deprecated.
 
getNewCorpus() - Method in class org.dice_research.topicmodeling.preprocessing.ListCorpusCreator
 
getNextDocument() - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.CorpusWrappingDocumentSupplier
 
getNextDocument() - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.AbstractDocumentPropertyMapCreator
 
getNextDocument() - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.AbstractDocumentSupplierDecorator
 
getNextDocument() - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.DocumentFilteringSupplierDecorator
 
getNextDocument() - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.ner.NerPropagatingSupplierDecorator
 
getNextDocument() - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.splitter.AbstractSplittingDocumentSupplierDecorator
 
getNextDocument() - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.QueueBasedCorpusWrappingDocumentSupplier
 
getNextDocument(int) - Method in class org.dice_research.topicmodeling.preprocessing.shedule.AbstractDocumentSheduler
 
getNextDocument() - Method in class org.dice_research.topicmodeling.preprocessing.shedule.AbstractDocumentSheduler.PartialDocumentSupplier
 
getNextDocument(int) - Method in class org.dice_research.topicmodeling.preprocessing.shedule.DeterministicPercentageDocumentSheduler
 
getNextDocument(int) - Method in class org.dice_research.topicmodeling.preprocessing.shedule.DocumentFilterBasedSheduler
 
getNextDocument(int) - Method in class org.dice_research.topicmodeling.preprocessing.shedule.ListBasedDocumentSheduler
 
getNextDocument(int) - Method in class org.dice_research.topicmodeling.preprocessing.shedule.NFoldCrossValidationSheduler
 
getNextDocument(int) - Method in class org.dice_research.topicmodeling.preprocessing.shedule.RandomDocumentSheduler
 
getNextDocument() - Method in class org.dice_research.topicmodeling.preprocessing.SingleDocumentPreprocessor
 
getNextPartId() - Method in class org.dice_research.topicmodeling.preprocessing.shedule.DeterministicPercentageDocumentSheduler
 
getNextPartId() - Method in class org.dice_research.topicmodeling.preprocessing.shedule.NFoldCrossValidationSheduler
 
getNumberOfParts() - Method in class org.dice_research.topicmodeling.preprocessing.shedule.AbstractDocumentSheduler
 
getNumberOfParts() - Method in interface org.dice_research.topicmodeling.preprocessing.shedule.DocumentSheduler
 
getPartialDocumentSupplier(int) - Method in class org.dice_research.topicmodeling.preprocessing.shedule.AbstractDocumentSheduler
 
getPartialDocumentSupplier(int) - Method in interface org.dice_research.topicmodeling.preprocessing.shedule.DocumentSheduler
 
getPattern() - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.filter.StringContainingDocumentPropertyBasedFilter
 
getProcessedDocument() - Method in class org.dice_research.topicmodeling.preprocessing.SingleDocumentPreprocessor
 
getRemovableProperties() - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.PropertyRemovingSupplierDecorator
 
getString() - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.HtmlCharsetExtractingSupplierDecorator.StringWithCharset
 
getSupplier() - Method in class org.dice_research.topicmodeling.preprocessing.AbstractPreprocessor
 
getTerms() - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.TermListCreatingSupplierDecorator
 
getTextStartIndex(String) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.UseNetTextExtractingSupplierDecorator
 
getTokensAfterPosTagging(NamedEntityInText, String) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.ner.EntityBasedTokenizer
 
getTokensAfterPosTagging(NamedEntityInText, String) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.ner.EntityPropagator
 
getTrimmedLinesOfValue(String) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.NSFTextAndCategoryExtractingSupplierDecorator
 
getValueForKey(String, String) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.NSFTextAndCategoryExtractingSupplierDecorator
 
getVocabulary() - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.WordIndexingSupplierDecorator
 
GZipExtractingSupplierDecorator - Class in org.dice_research.topicmodeling.preprocessing.docsupplier.decorator
 
GZipExtractingSupplierDecorator(DocumentSupplier) - Constructor for class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.GZipExtractingSupplierDecorator
 

H

handleHtmlEncodedChar(StringBuilder, String, int, int) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.NewsDeMarkupRemovingSupplierDecorator
 
handleHtmlTag(StringBuilder, String, int, int) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.NewsDeMarkupRemovingSupplierDecorator
 
handleToolTip(StringBuilder, StringBuilder, String, int, int) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.NewsDeMarkupRemovingSupplierDecorator
 
hasCorpus() - Method in class org.dice_research.topicmodeling.preprocessing.AbstractPreprocessor
 
hasCorpus() - Method in class org.dice_research.topicmodeling.preprocessing.CorpusWrappingPreprocessor
 
hasCorpus() - Method in class org.dice_research.topicmodeling.preprocessing.SingleDocumentPreprocessor
 
HtmlCharsetExtractingSupplierDecorator - Class in org.dice_research.topicmodeling.preprocessing.docsupplier.decorator
 
HtmlCharsetExtractingSupplierDecorator(DocumentSupplier) - Constructor for class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.HtmlCharsetExtractingSupplierDecorator
 
HtmlCharsetExtractingSupplierDecorator.StringWithCharset - Class in org.dice_research.topicmodeling.preprocessing.docsupplier.decorator
 

I

id - Variable in class org.dice_research.topicmodeling.preprocessing.shedule.AbstractDocumentSheduler.PartialDocumentSupplier
 
identifyTextUsingPgpStatements(String) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.UseNetTextExtractingSupplierDecorator
 
IGNORE_CASE - Variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.filter.StringContainingDocumentPropertyBasedFilter
 
isDocumentGood(Document) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.filter.AbstractDocumentPropertyBasedFilter
 
isDocumentGood(Document) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.filter.AndConcatenatingDocumentFilter
 
isDocumentGood(Document) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.filter.CategoryBasedDocumentFilter
 
isDocumentGood(Document) - Method in interface org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.filter.DocumentFilter
 
isDocumentGood(Document) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.filter.NamedEntitiesInTextBasedDocumentFilter
 
isDocumentPropertyGood(T) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.filter.AbstractDocumentPropertyBasedFilter
 
isDocumentPropertyGood(T) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.filter.StringContainingDocumentPropertyBasedFilter
 
isIgnoreCase() - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.filter.StringContainingDocumentPropertyBasedFilter
 
isNamedEntityGood(Document, NamedEntityInText) - Method in interface org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.ner.filter.NamedEntitiesFilter
 
isNamedEntityGood(Document, NamedEntityInText) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.ner.filter.StopwordBasedEntityFilter
 
isSpaceOrDash(char) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.ner.AbstractNerPropagationPreprocessor
Deprecated.
isSynchronized - Variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.DocumentConsumerAdaptingSupplierDecorator
 
isTermGood(Term) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.ner.NerPropagatingSupplierDecorator
 
isTrimSplittedParts() - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.splitter.PatternBasedDocumentTextSplitter
 

L

lastPartThatGotDocument - Variable in class org.dice_research.topicmodeling.preprocessing.shedule.DeterministicPercentageDocumentSheduler
 
lastSetBit(BitSet, int) - Static method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.VocabularyReductionMappingApplyingSupplierDecorator
Returns the last 1 bit in the given bit set before the given position (excluding) or -1 if there is no such bit.
ListBasedDocumentSheduler - Class in org.dice_research.topicmodeling.preprocessing.shedule
This simple sheduler can shedule the documents based on the values of a single StringContainingDocumentProperty.
ListBasedDocumentSheduler(DocumentSupplier, Class<? extends StringContainingDocumentProperty>, Set<String>[]) - Constructor for class org.dice_research.topicmodeling.preprocessing.shedule.ListBasedDocumentSheduler
 
ListCorpusCreator<T extends List<org.dice_research.topicmodeling.utils.doc.Document>> - Class in org.dice_research.topicmodeling.preprocessing
 
ListCorpusCreator(DocumentSupplier, DocumentListCorpus<T>) - Constructor for class org.dice_research.topicmodeling.preprocessing.ListCorpusCreator
 
listOfParts - Variable in class org.dice_research.topicmodeling.preprocessing.shedule.AbstractDocumentSheduler
 
listTerms(List<Term>) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.TermListCreatingSupplierDecorator
 
localEntityTokenMapping - Variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.ner.EntityTokenReplaceingPostprocessor
 
LOGGER - Static variable in class org.dice_research.topicmodeling.preprocessing.AbstractPreprocessor
 
LOGGER - Static variable in class org.dice_research.topicmodeling.preprocessing.AbstractScaleablePreprocessor
 
LOGGER - Static variable in class org.dice_research.topicmodeling.preprocessing.consume.DocumentFrequencyDeterminer
 
logger - Static variable in class org.dice_research.topicmodeling.preprocessing.CorpusWrappingPreprocessor
 
LOGGER - Static variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.AbstractPropertyAppendingDocumentSupplierDecorator
 
LOGGER - Static variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.AbstractPropertyEditingDocumentSupplierDecorator
 
LOGGER - Static variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.CharsetDeterminingSupplierDecorator
 
LOGGER - Static variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.DocumentPropertyPrintingSupplierDecorator
 
LOGGER - Static variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.DocumentTextWithTermInfoCreatingSupplierDecorator
 
LOGGER - Static variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.DocumentTextWithTermInfoParsingSupplierDecorator
 
LOGGER - Static variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.DocumentWordCountingSupplierDecorator
 
logger - Static variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.filter.CategoryBasedDocumentFilter
 
LOGGER - Static variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.GZipExtractingSupplierDecorator
 
LOGGER - Static variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.HtmlCharsetExtractingSupplierDecorator
 
LOGGER - Static variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.ner.filter.StopwordBasedEntityFilter
 
LOGGER - Static variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.ner.NerPropagatingSupplierDecorator
 
LOGGER - Static variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.ner.SimpleNerPropagationPostprocessor
 
LOGGER - Static variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.NSFTextAndCategoryExtractingSupplierDecorator
 
LOGGER - Static variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.SlidingWindowCooccurrenceCounter
 
LOGGER - Static variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.splitter.PatternBasedDocumentTextSplitter
 
LOGGER - Static variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.splitter.SentenceBasedDocumentTextSplitter
 
LOGGER - Static variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.TextDeletingExceptNESurfaceFormsSupplierDecorator
 
LOGGER - Static variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.WikipediaMarkupDeletingDecorator
Deprecated.
 
LOGGER - Static variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.WordIndexingSupplierDecorator
 
logger - Static variable in class org.dice_research.topicmodeling.preprocessing.shedule.DeterministicPercentageDocumentSheduler
 
LOGGER - Static variable in class org.dice_research.topicmodeling.preprocessing.shedule.ListBasedDocumentSheduler
 
LOGGER - Static variable in class org.dice_research.topicmodeling.preprocessing.shedule.NFoldCrossValidationSheduler
 
logger - Static variable in class org.dice_research.topicmodeling.preprocessing.shedule.RandomDocumentSheduler
 
LOGGER - Static variable in class org.dice_research.topicmodeling.preprocessing.SingleDocumentPreprocessor
 

M

mapCreated(IntObjectOpenHashMap<T>) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.AbstractDocumentPropertyMapCreator
 
mapCreated(IntObjectOpenHashMap<T>) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.DocumentPropertyPrintingSupplierDecorator
 
mapping - Variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.VocabularyReductionMappingApplyingSupplierDecorator
 
maxDF - Variable in class org.dice_research.topicmodeling.preprocessing.corpus.DocumentFrequencyBasedCorpusPreprocessor
 
minDF - Variable in class org.dice_research.topicmodeling.preprocessing.corpus.DocumentFrequencyBasedCorpusPreprocessor
 
MINIMUM_LENGTH_OF_NE - Static variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.ner.filter.StopwordBasedEntityFilter
 

N

NamedEntitiesFilter - Interface in org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.ner.filter
 
NamedEntitiesInTextBasedDocumentFilter - Class in org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.filter
 
NamedEntitiesInTextBasedDocumentFilter() - Constructor for class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.filter.NamedEntitiesInTextBasedDocumentFilter
 
NamedEntityFilteringSupplierDecorator - Class in org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.ner.filter
 
NamedEntityFilteringSupplierDecorator(DocumentSupplier, NamedEntitiesFilter) - Constructor for class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.ner.filter.NamedEntityFilteringSupplierDecorator
 
NamedEntityFilteringSupplierDecorator(DocumentSupplier, NamedEntitiesFilter, boolean) - Constructor for class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.ner.filter.NamedEntityFilteringSupplierDecorator
 
NEGATIVE_CATEGORY_NAME - Static variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.DocumentCategoryRenamingDocumentSupplierDecorator
 
NerPropagatingSupplierDecorator - Class in org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.ner
 
NerPropagatingSupplierDecorator(DocumentSupplier, PosTagger) - Constructor for class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.ner.NerPropagatingSupplierDecorator
 
NerPropagatingSupplierDecorator(DocumentSupplier, PosTagger, NerPropagationPreprocessor) - Constructor for class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.ner.NerPropagatingSupplierDecorator
 
NerPropagatingSupplierDecorator(DocumentSupplier, PosTagger, NerPropagationPreprocessor, NerPropagationPostprocessor) - Constructor for class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.ner.NerPropagatingSupplierDecorator
 
NerPropagationPostprocessor - Interface in org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.ner
 
NerPropagationPreprocessor - Interface in org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.ner
 
NewsDeMarkupRemovingSupplierDecorator - Class in org.dice_research.topicmodeling.preprocessing.docsupplier.decorator
 
NewsDeMarkupRemovingSupplierDecorator(DocumentSupplier) - Constructor for class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.NewsDeMarkupRemovingSupplierDecorator
 
nextDocumentId - Variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.CorpusWrappingDocumentSupplier
 
NFoldCrossValidationSheduler - Class in org.dice_research.topicmodeling.preprocessing.shedule
 
NFoldCrossValidationSheduler(DocumentSupplier, int, int) - Constructor for class org.dice_research.topicmodeling.preprocessing.shedule.NFoldCrossValidationSheduler
 
NSFTextAndCategoryExtractingSupplierDecorator - Class in org.dice_research.topicmodeling.preprocessing.docsupplier.decorator
 
NSFTextAndCategoryExtractingSupplierDecorator(DocumentSupplier) - Constructor for class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.NSFTextAndCategoryExtractingSupplierDecorator
 
numberOfFolds - Variable in class org.dice_research.topicmodeling.preprocessing.shedule.NFoldCrossValidationSheduler
 

O

org.dice_research.topicmodeling.preprocessing - package org.dice_research.topicmodeling.preprocessing
 
org.dice_research.topicmodeling.preprocessing.consume - package org.dice_research.topicmodeling.preprocessing.consume
 
org.dice_research.topicmodeling.preprocessing.corpus - package org.dice_research.topicmodeling.preprocessing.corpus
 
org.dice_research.topicmodeling.preprocessing.docsupplier - package org.dice_research.topicmodeling.preprocessing.docsupplier
 
org.dice_research.topicmodeling.preprocessing.docsupplier.decorator - package org.dice_research.topicmodeling.preprocessing.docsupplier.decorator
 
org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.filter - package org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.filter
 
org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.ner - package org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.ner
 
org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.ner.filter - package org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.ner.filter
 
org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.splitter - package org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.splitter
 
org.dice_research.topicmodeling.preprocessing.shedule - package org.dice_research.topicmodeling.preprocessing.shedule
 
OUTPUT_FILE - Variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.DocumentPropertyPrintingSupplierDecorator
 

P

PartialDocumentSupplier(int) - Constructor for class org.dice_research.topicmodeling.preprocessing.shedule.AbstractDocumentSheduler.PartialDocumentSupplier
 
pattern - Variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.filter.StringContainingDocumentPropertyBasedFilter
 
PatternBasedDocumentTextSplitter<T extends org.dice_research.topicmodeling.utils.doc.StringContainingDocumentProperty> - Class in org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.splitter
 
PatternBasedDocumentTextSplitter(DocumentSupplier, Class<T>, String) - Constructor for class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.splitter.PatternBasedDocumentTextSplitter
 
PatternBasedDocumentTextSplitter(DocumentSupplier, Class<T>, String, boolean) - Constructor for class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.splitter.PatternBasedDocumentTextSplitter
 
percentages - Variable in class org.dice_research.topicmodeling.preprocessing.shedule.DeterministicPercentageDocumentSheduler
 
PGP_MESSAGE_END - Static variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.UseNetTextExtractingSupplierDecorator
 
PGP_MESSAGE_START - Static variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.UseNetTextExtractingSupplierDecorator
 
portions - Variable in class org.dice_research.topicmodeling.preprocessing.shedule.RandomDocumentSheduler
 
POSITIVE_CATEGORY_NAME - Static variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.DocumentCategoryRenamingDocumentSupplierDecorator
 
positiveNames - Variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.DocumentCategoryRenamingDocumentSupplierDecorator
 
postagger - Variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.PosTaggingSupplierDecorator
 
PosTaggingSupplierDecorator - Class in org.dice_research.topicmodeling.preprocessing.docsupplier.decorator
 
PosTaggingSupplierDecorator(DocumentSupplier, PosTagger) - Constructor for class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.PosTaggingSupplierDecorator
 
postprocessNamedEntities(NamedEntitiesInText, DocumentText, TermTokenizedText) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.ner.EntityTokenReplaceingPostprocessor
 
postprocessNamedEntities(NamedEntitiesInText, DocumentText, TermTokenizedText) - Method in interface org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.ner.NerPropagationPostprocessor
 
postprocessNamedEntities(NamedEntitiesInText, DocumentText, TermTokenizedText) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.ner.SimpleNerPropagationPostprocessor
 
postprocessor - Variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.ner.NerPropagatingSupplierDecorator
 
prepareDocument(Document) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.AbstractDocumentPropertyMapCreator
 
prepareDocument(Document) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.AbstractDocumentSupplierDecorator
 
prepareDocument(Document) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.AbstractPropertyAppendingDocumentSupplierDecorator
 
prepareDocument(Document) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.AbstractPropertyEditingDocumentSupplierDecorator
 
prepareDocument(Document) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.CharsetDeterminingSupplierDecorator
 
prepareDocument(Document) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.DocumentConsumerAdaptingSupplierDecorator
 
prepareDocument(Document) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.DocumentTextCreatingSupplierDecorator
 
prepareDocument(Document) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.DocumentTextWithTermInfoParsingSupplierDecorator
 
prepareDocument(Document) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.GZipExtractingSupplierDecorator
 
prepareDocument(Document) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.HtmlCharsetExtractingSupplierDecorator
 
prepareDocument(Document) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.ner.filter.NamedEntityFilteringSupplierDecorator
 
prepareDocument(Document) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.NewsDeMarkupRemovingSupplierDecorator
 
prepareDocument(Document) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.NSFTextAndCategoryExtractingSupplierDecorator
 
prepareDocument(Document) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.PosTaggingSupplierDecorator
 
prepareDocument(Document) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.PropertyRemovingSupplierDecorator
 
prepareDocument(Document) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.SimpleHtmlCleaner
 
prepareDocument(Document) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.SlidingWindowCooccurrenceCounter
 
prepareDocument(Document) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.StemmedTextCreatorSupplierDecorator
 
prepareDocument(Document) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.TermFilteringSupplierDecorator
 
prepareDocument(Document) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.TermListCreatingSupplierDecorator
 
prepareDocument(Document) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.TextDeletingExceptNESurfaceFormsSupplierDecorator
 
prepareDocument(Document) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.UseNetTextExtractingSupplierDecorator
 
prepareDocument(Document) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.VocabularyReductionMappingApplyingSupplierDecorator
 
prepareDocument(Document) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.WikipediaMarkupDeletingDecorator
Deprecated.
 
preprocEntityTokenMapping - Variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.ner.EntityTokenReplaceingPostprocessor
 
preprocess(Corpus) - Method in interface org.dice_research.topicmodeling.preprocessing.corpus.CorpusPreprocessor
 
preprocess(Corpus) - Method in class org.dice_research.topicmodeling.preprocessing.corpus.DocumentFrequencyBasedCorpusPreprocessor
 
preprocessNamedEntities(DocumentText, NamedEntitiesInText) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.ner.AbstractNerPropagationPreprocessor
 
preprocessNamedEntities(DocumentText, NamedEntitiesInText) - Method in interface org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.ner.NerPropagationPreprocessor
 
preprocessor - Variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.ner.EntityTokenReplaceingPostprocessor
 
preprocessor - Variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.ner.NerPropagatingSupplierDecorator
 
processDocument(Document, Corpus) - Method in class org.dice_research.topicmodeling.preprocessing.ListCorpusCreator
 
processDocument(Document) - Method in class org.dice_research.topicmodeling.preprocessing.SingleDocumentPreprocessor
 
processEntity(NamedEntityInText, String, Set<String>) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.ner.AbstractNerPropagationPreprocessor
 
processEntity(NamedEntityInText, String, Set<String>) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.ner.EntityBasedTokenizer
 
processEntity(NamedEntityInText, String, Set<String>) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.ner.SimpleNerPropagationPreprocessor
 
processEntity(NamedEntityInText, String, Set<String>) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.ner.WordSensePropagator
 
properties - Variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.AbstractDocumentPropertyMapCreator
 
propertyClass - Variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.AbstractPropertyEditingDocumentSupplierDecorator
 
propertyClass - Variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.splitter.PatternBasedDocumentTextSplitter
 
propertyClass - Variable in class org.dice_research.topicmodeling.preprocessing.shedule.ListBasedDocumentSheduler
 
propertyConstructor - Variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.splitter.PatternBasedDocumentTextSplitter
 
PropertyRemovingSupplierDecorator - Class in org.dice_research.topicmodeling.preprocessing.docsupplier.decorator
 
PropertyRemovingSupplierDecorator(DocumentSupplier) - Constructor for class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.PropertyRemovingSupplierDecorator
 
PropertyRemovingSupplierDecorator(DocumentSupplier, Class<? extends DocumentProperty>) - Constructor for class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.PropertyRemovingSupplierDecorator
 
PropertyRemovingSupplierDecorator(DocumentSupplier, Collection<? extends Class<? extends DocumentProperty>>) - Constructor for class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.PropertyRemovingSupplierDecorator
 
propertyValuesTestPartition - Variable in class org.dice_research.topicmodeling.preprocessing.shedule.ListBasedDocumentSheduler
 

Q

queue - Variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.splitter.AbstractSplittingDocumentSupplierDecorator
 
queue - Variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.QueueBasedCorpusWrappingDocumentSupplier
 
queue - Variable in class org.dice_research.topicmodeling.preprocessing.shedule.AbstractDocumentSheduler.PartialDocumentSupplier
 
QueueBasedCorpusWrappingDocumentSupplier - Class in org.dice_research.topicmodeling.preprocessing.docsupplier
 
QueueBasedCorpusWrappingDocumentSupplier(Corpus) - Constructor for class org.dice_research.topicmodeling.preprocessing.docsupplier.QueueBasedCorpusWrappingDocumentSupplier
 
QueueBasedCorpusWrappingDocumentSupplier(Corpus, int) - Constructor for class org.dice_research.topicmodeling.preprocessing.docsupplier.QueueBasedCorpusWrappingDocumentSupplier
 
QueueBasedCorpusWrappingDocumentSupplier(Corpus, int, int) - Constructor for class org.dice_research.topicmodeling.preprocessing.docsupplier.QueueBasedCorpusWrappingDocumentSupplier
 

R

random - Variable in class org.dice_research.topicmodeling.preprocessing.shedule.RandomDocumentSheduler
 
RandomDocumentSheduler - Class in org.dice_research.topicmodeling.preprocessing.shedule
 
RandomDocumentSheduler(DocumentSupplier, int) - Constructor for class org.dice_research.topicmodeling.preprocessing.shedule.RandomDocumentSheduler
 
RandomDocumentSheduler(DocumentSupplier, int, long) - Constructor for class org.dice_research.topicmodeling.preprocessing.shedule.RandomDocumentSheduler
 
remainingPercentages - Variable in class org.dice_research.topicmodeling.preprocessing.shedule.DeterministicPercentageDocumentSheduler
 
removableProperties - Variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.PropertyRemovingSupplierDecorator
 
REMOVE_DOCUMENTS_WITHOUT_CATEGORY - Static variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.NSFTextAndCategoryExtractingSupplierDecorator
 
removeAddresses(String) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.UseNetTextExtractingSupplierDecorator
 
removeAuthorLine(String, int) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.UseNetTextExtractingSupplierDecorator
 
REMOVED_WORD - Static variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.VocabularyReductionMappingApplyingSupplierDecorator
 
removeWikiMarkup(String) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.WikipediaMarkupDeletingDecorator
Deprecated.
 

S

ScaleablePreprocessor - Interface in org.dice_research.topicmodeling.preprocessing
 
SentenceBasedDocumentTextSplitter - Class in org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.splitter
 
SentenceBasedDocumentTextSplitter(DocumentSupplier, SentenceDetectorME) - Constructor for class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.splitter.SentenceBasedDocumentTextSplitter
 
sentenceDetector - Variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.splitter.SentenceBasedDocumentTextSplitter
 
SEPARATION_CHAR - Static variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.DocumentTextWithTermInfoCreatingSupplierDecorator
 
serialVersionUID - Static variable in class org.dice_research.topicmodeling.preprocessing.CorpusWrappingPreprocessor
 
serialVersionUID - Static variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.CorpusWrappingDocumentSupplier
 
setDecoratedDocumentSupplier(DocumentSupplier) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.AbstractDocumentSupplierDecorator
 
setDecoratedDocumentSupplier(DocumentSupplier) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.DocumentFilteringSupplierDecorator
 
setDecoratedDocumentSupplier(DocumentSupplier) - Method in interface org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.DocumentSupplierDecorator
 
setDecoratedDocumentSupplier(DocumentSupplier) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.splitter.AbstractSplittingDocumentSupplierDecorator
 
setDocument(Document) - Method in class org.dice_research.topicmodeling.preprocessing.SingleDocumentPreprocessor
 
setDocumentStartId(int) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.CorpusWrappingDocumentSupplier
 
setDocumentStartId(int) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.AbstractDocumentSupplierDecorator
 
setDocumentStartId(int) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.DocumentFilteringSupplierDecorator
 
setDocumentStartId(int) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.QueueBasedCorpusWrappingDocumentSupplier
 
setDocumentSupplier(DocumentSupplier) - Method in class org.dice_research.topicmodeling.preprocessing.SingleDocumentPreprocessor
 
setPattern(String) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.filter.StringContainingDocumentPropertyBasedFilter
 
setPercentageOfPart(int, int) - Method in class org.dice_research.topicmodeling.preprocessing.shedule.DeterministicPercentageDocumentSheduler
 
setPortionOfPart(int, double) - Method in class org.dice_research.topicmodeling.preprocessing.shedule.RandomDocumentSheduler
 
setPosTaggerFilter(PosTaggingTermFilter) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.ner.NerPropagatingSupplierDecorator
 
setPosTaggerFilter(PosTaggingTermFilter) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.PosTaggingSupplierDecorator
 
setRemovableProperties(ArrayList<Class<? extends DocumentProperty>>) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.PropertyRemovingSupplierDecorator
 
setString(String) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.HtmlCharsetExtractingSupplierDecorator.StringWithCharset
 
setTrimSplittedParts(boolean) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.splitter.PatternBasedDocumentTextSplitter
 
SimpleHtmlCleaner - Class in org.dice_research.topicmodeling.preprocessing.docsupplier.decorator
 
SimpleHtmlCleaner(DocumentSupplier) - Constructor for class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.SimpleHtmlCleaner
 
SimpleHtmlCleaner.States - Enum in org.dice_research.topicmodeling.preprocessing.docsupplier.decorator
 
SimpleNerPropagationPostprocessor - Class in org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.ner
 
SimpleNerPropagationPostprocessor() - Constructor for class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.ner.SimpleNerPropagationPostprocessor
 
SimpleNerPropagationPreprocessor - Class in org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.ner
 
SimpleNerPropagationPreprocessor() - Constructor for class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.ner.SimpleNerPropagationPreprocessor
 
SingleDocumentPreprocessor - Class in org.dice_research.topicmodeling.preprocessing
 
SingleDocumentPreprocessor() - Constructor for class org.dice_research.topicmodeling.preprocessing.SingleDocumentPreprocessor
 
SingleDocumentPreprocessor(boolean) - Constructor for class org.dice_research.topicmodeling.preprocessing.SingleDocumentPreprocessor
 
SlidingWindowCooccurrenceCounter - Class in org.dice_research.topicmodeling.preprocessing.docsupplier.decorator
 
SlidingWindowCooccurrenceCounter(DocumentSupplier, int) - Constructor for class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.SlidingWindowCooccurrenceCounter
 
SPACE - Static variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.ner.AbstractNerPropagationPreprocessor
 
splitDocument(Document) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.splitter.AbstractSplittingDocumentSupplierDecorator
In this method the splitter should split up the given document and add all new documents to the AbstractSplittingDocumentSupplierDecorator.queue.
splitDocument(Document) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.splitter.PatternBasedDocumentTextSplitter
 
splitDocument(Document) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.splitter.SentenceBasedDocumentTextSplitter
 
splitDocument(Document, String, List<Term>) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.splitter.SentenceBasedDocumentTextSplitter
 
splitDocument(Document, String) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.splitter.SentenceBasedDocumentTextSplitter
 
splitPattern - Variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.splitter.PatternBasedDocumentTextSplitter
 
States() - Constructor for enum org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.SimpleHtmlCleaner.States
 
StemmedTextCreatorSupplierDecorator - Class in org.dice_research.topicmodeling.preprocessing.docsupplier.decorator
 
StemmedTextCreatorSupplierDecorator(DocumentSupplier) - Constructor for class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.StemmedTextCreatorSupplierDecorator
 
StopwordBasedEntityFilter - Class in org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.ner.filter
 
StopwordBasedEntityFilter() - Constructor for class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.ner.filter.StopwordBasedEntityFilter
 
stopwordlist - Variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.ner.filter.StopwordBasedEntityFilter
 
string - Variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.HtmlCharsetExtractingSupplierDecorator.StringWithCharset
 
StringContainingDocumentPropertyBasedFilter<T extends org.dice_research.topicmodeling.utils.doc.StringContainingDocumentProperty> - Class in org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.filter
 
StringContainingDocumentPropertyBasedFilter(StringContainingDocumentPropertyBasedFilter.StringContainingDocumentPropertyBasedFilterType, Class<T>, String) - Constructor for class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.filter.StringContainingDocumentPropertyBasedFilter
 
StringContainingDocumentPropertyBasedFilter(StringContainingDocumentPropertyBasedFilter.StringContainingDocumentPropertyBasedFilterType, Class<T>, String, boolean) - Constructor for class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.filter.StringContainingDocumentPropertyBasedFilter
 
StringContainingDocumentPropertyBasedFilter.StringContainingDocumentPropertyBasedFilterType - Enum in org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.filter
 
StringContainingDocumentPropertyBasedFilterType() - Constructor for enum org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.filter.StringContainingDocumentPropertyBasedFilter.StringContainingDocumentPropertyBasedFilterType
 
StringWithCharset(String, Charset) - Constructor for class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.HtmlCharsetExtractingSupplierDecorator.StringWithCharset
 
supplier - Variable in class org.dice_research.topicmodeling.preprocessing.AbstractPreprocessor
 
supplier - Variable in class org.dice_research.topicmodeling.preprocessing.SingleDocumentPreprocessor
 
surfaceFormsMapping - Variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.ner.EntityBasedTokenizer
 

T

tagAutomaton - Variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.NewsDeMarkupRemovingSupplierDecorator
 
TERM_END_CHAR - Static variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.DocumentTextWithTermInfoCreatingSupplierDecorator
 
TERM_START_CHAR - Static variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.DocumentTextWithTermInfoCreatingSupplierDecorator
 
TermFilteringSupplierDecorator - Class in org.dice_research.topicmodeling.preprocessing.docsupplier.decorator
 
TermFilteringSupplierDecorator(DocumentSupplier, PosTaggingTermFilter) - Constructor for class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.TermFilteringSupplierDecorator
 
TermListCreatingSupplierDecorator - Class in org.dice_research.topicmodeling.preprocessing.docsupplier.decorator
 
TermListCreatingSupplierDecorator(DocumentSupplier, PosTaggingTermFilter) - Constructor for class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.TermListCreatingSupplierDecorator
 
terms - Variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.ner.EntityTermMapping
 
terms - Variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.TermListCreatingSupplierDecorator
 
test(Document) - Method in interface org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.filter.DocumentFilter
 
TEST_DOCUMENTS_SUPPLIER_ID - Static variable in class org.dice_research.topicmodeling.preprocessing.shedule.NFoldCrossValidationSheduler
 
TEXT_KEY - Static variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.NSFTextAndCategoryExtractingSupplierDecorator
 
TextDeletingExceptNESurfaceFormsSupplierDecorator - Class in org.dice_research.topicmodeling.preprocessing.docsupplier.decorator
 
TextDeletingExceptNESurfaceFormsSupplierDecorator(DocumentSupplier) - Constructor for class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.TextDeletingExceptNESurfaceFormsSupplierDecorator
 
tokens - Variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.ner.NerPropagatingSupplierDecorator
 
tooltipAutomaton - Variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.NewsDeMarkupRemovingSupplierDecorator
 
TRAIN_DOCUMENTS_SUPPLIER_ID - Static variable in class org.dice_research.topicmodeling.preprocessing.shedule.NFoldCrossValidationSheduler
 
trimSplittedParts - Variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.splitter.PatternBasedDocumentTextSplitter
 
trimText(String) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.ner.AbstractNerPropagationPreprocessor
 

U

unescapeString(String) - Static method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.DocumentTextWithTermInfoParsingSupplierDecorator
 
unescapeSymbols(String) - Static method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.WikipediaMarkupDeletingDecorator
Deprecated.
 
updateVocabulary(Vocabulary, int[]) - Static method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.VocabularyReductionMappingApplyingSupplierDecorator
 
updateWordCounts(DocumentWordCounts) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.VocabularyReductionMappingApplyingSupplierDecorator
 
updateWordIds(DocumentTextWordIds) - Method in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.VocabularyReductionMappingApplyingSupplierDecorator
 
USE_NET_HEADING_KEYS - Static variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.UseNetTextExtractingSupplierDecorator
 
useConsecutiveNumbering - Variable in class org.dice_research.topicmodeling.preprocessing.SingleDocumentPreprocessor
 
UseNetTextExtractingSupplierDecorator - Class in org.dice_research.topicmodeling.preprocessing.docsupplier.decorator
 
UseNetTextExtractingSupplierDecorator(DocumentSupplier) - Constructor for class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.UseNetTextExtractingSupplierDecorator
 

V

valueOf(String) - Static method in enum org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.filter.StringContainingDocumentPropertyBasedFilter.StringContainingDocumentPropertyBasedFilterType
Returns the enum constant of this type with the specified name.
valueOf(String) - Static method in enum org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.SimpleHtmlCleaner.States
Returns the enum constant of this type with the specified name.
values() - Static method in enum org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.filter.StringContainingDocumentPropertyBasedFilter.StringContainingDocumentPropertyBasedFilterType
Returns an array containing the constants of this enum type, in the order they are declared.
values() - Static method in enum org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.SimpleHtmlCleaner.States
Returns an array containing the constants of this enum type, in the order they are declared.
vocabulary - Variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.WordIndexingSupplierDecorator
 
VocabularyReductionMappingApplyingSupplierDecorator - Class in org.dice_research.topicmodeling.preprocessing.docsupplier.decorator
 
VocabularyReductionMappingApplyingSupplierDecorator(DocumentSupplier, int[]) - Constructor for class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.VocabularyReductionMappingApplyingSupplierDecorator
 

W

WikipediaMarkupDeletingDecorator - Class in org.dice_research.topicmodeling.preprocessing.docsupplier.decorator
Deprecated.
WikipediaMarkupDeletingDecorator(DocumentSupplier) - Constructor for class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.WikipediaMarkupDeletingDecorator
Deprecated.
 
windowSize - Variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.SlidingWindowCooccurrenceCounter
 
word - Variable in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.ner.WordSensePropagator
 
WordIndexingSupplierDecorator - Class in org.dice_research.topicmodeling.preprocessing.docsupplier.decorator
 
WordIndexingSupplierDecorator(DocumentSupplier, Vocabulary) - Constructor for class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.WordIndexingSupplierDecorator
 
WordSensePropagator - Class in org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.ner
 
WordSensePropagator(String) - Constructor for class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.ner.WordSensePropagator
 
A B C D E F G H I L M N O P Q R S T U V W 
Skip navigation links

Copyright © 2015–2020. All rights reserved.