public class PatternBasedDocumentTextSplitter<T extends org.dice_research.topicmodeling.utils.doc.StringContainingDocumentProperty> extends AbstractSplittingDocumentSupplierDecorator
| Modifier and Type | Field and Description |
|---|---|
private static org.slf4j.Logger |
LOGGER |
private Class<T> |
propertyClass |
private Constructor<T> |
propertyConstructor |
private String |
splitPattern |
private boolean |
trimSplittedParts |
documentSource, queue| Constructor and Description |
|---|
PatternBasedDocumentTextSplitter(org.dice_research.topicmodeling.preprocessing.docsupplier.DocumentSupplier documentSource,
Class<T> propertyClass,
String splitPattern) |
PatternBasedDocumentTextSplitter(org.dice_research.topicmodeling.preprocessing.docsupplier.DocumentSupplier documentSource,
Class<T> propertyClass,
String splitPattern,
boolean trimSplittedParts) |
| Modifier and Type | Method and Description |
|---|---|
private org.dice_research.topicmodeling.utils.doc.Document |
createDocument(String value,
org.dice_research.topicmodeling.utils.doc.Document document) |
boolean |
isTrimSplittedParts() |
void |
setTrimSplittedParts(boolean trimSplittedParts) |
protected void |
splitDocument(org.dice_research.topicmodeling.utils.doc.Document document)
In this method the splitter should split up the given document and add all new documents to the
AbstractSplittingDocumentSupplierDecorator.queue. |
addToQueue, getDecoratedDocumentSupplier, getNextDocument, setDecoratedDocumentSuppliergetNextDocumentId, setDocumentStartIdprivate static final org.slf4j.Logger LOGGER
private String splitPattern
private Class<T extends org.dice_research.topicmodeling.utils.doc.StringContainingDocumentProperty> propertyClass
private Constructor<T extends org.dice_research.topicmodeling.utils.doc.StringContainingDocumentProperty> propertyConstructor
private boolean trimSplittedParts
public PatternBasedDocumentTextSplitter(org.dice_research.topicmodeling.preprocessing.docsupplier.DocumentSupplier documentSource,
Class<T> propertyClass,
String splitPattern)
public boolean isTrimSplittedParts()
public void setTrimSplittedParts(boolean trimSplittedParts)
protected void splitDocument(org.dice_research.topicmodeling.utils.doc.Document document)
AbstractSplittingDocumentSupplierDecoratorAbstractSplittingDocumentSupplierDecorator.queue.splitDocument in class AbstractSplittingDocumentSupplierDecoratorprivate org.dice_research.topicmodeling.utils.doc.Document createDocument(String value, org.dice_research.topicmodeling.utils.doc.Document document)
Copyright © 2015–2020. All rights reserved.