public class NSFTextAndCategoryExtractingSupplierDecorator extends AbstractDocumentSupplierDecorator
| Modifier and Type | Field and Description |
|---|---|
private com.carrotsearch.hppc.ObjectIntOpenHashMap<String> |
categories |
private static String |
CATEGORY_KEY |
private static String |
EMPTY_TEXT |
private static String |
FILTER_CATEGORY |
private static org.slf4j.Logger |
LOGGER |
private static boolean |
REMOVE_DOCUMENTS_WITHOUT_CATEGORY |
private static String |
TEXT_KEY |
documentSource| Constructor and Description |
|---|
NSFTextAndCategoryExtractingSupplierDecorator(org.dice_research.topicmodeling.preprocessing.docsupplier.DocumentSupplier documentSource) |
| Modifier and Type | Method and Description |
|---|---|
private String |
extractCategory(String text) |
private String |
extractText(String text) |
private String[] |
getTrimmedLinesOfValue(String value) |
private String |
getValueForKey(String text,
String key) |
org.dice_research.topicmodeling.utils.doc.Document |
prepareDocument(org.dice_research.topicmodeling.utils.doc.Document document) |
apply, getDecoratedDocumentSupplier, getNextDocument, setDecoratedDocumentSupplier, setDocumentStartIdclone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, waitprivate static final org.slf4j.Logger LOGGER
private static final boolean REMOVE_DOCUMENTS_WITHOUT_CATEGORY
private static final String CATEGORY_KEY
private static final String TEXT_KEY
private static final String EMPTY_TEXT
private static final String FILTER_CATEGORY
private com.carrotsearch.hppc.ObjectIntOpenHashMap<String> categories
public NSFTextAndCategoryExtractingSupplierDecorator(org.dice_research.topicmodeling.preprocessing.docsupplier.DocumentSupplier documentSource)
public org.dice_research.topicmodeling.utils.doc.Document prepareDocument(org.dice_research.topicmodeling.utils.doc.Document document)
prepareDocument in class AbstractDocumentSupplierDecoratorCopyright © 2015–2020. All rights reserved.