public class WikipediaMarkupDeletingDocumentSupplierDecorator
extends org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.AbstractDocumentSupplierDecorator
| Modifier and Type | Field and Description |
|---|---|
private boolean |
keepTableContents |
private static org.slf4j.Logger |
LOGGER |
private boolean |
removeCategoryLinks |
| Constructor and Description |
|---|
WikipediaMarkupDeletingDocumentSupplierDecorator(org.dice_research.topicmodeling.preprocessing.docsupplier.DocumentSupplier documentSource) |
WikipediaMarkupDeletingDocumentSupplierDecorator(org.dice_research.topicmodeling.preprocessing.docsupplier.DocumentSupplier documentSource,
boolean removeCategoryLinks) |
WikipediaMarkupDeletingDocumentSupplierDecorator(org.dice_research.topicmodeling.preprocessing.docsupplier.DocumentSupplier documentSource,
boolean removeCategoryLinks,
boolean keepTableContents) |
| Modifier and Type | Method and Description |
|---|---|
StackBasedMarkupDeletingMachine |
createDeleter() |
org.dice_research.topicmodeling.utils.doc.Document |
prepareDocument(org.dice_research.topicmodeling.utils.doc.Document document) |
void |
setKeepTableContents(boolean keepTableContents) |
void |
setRemoveCategoryLinks(boolean removeCategoryLinks) |
apply, getDecoratedDocumentSupplier, getNextDocument, setDecoratedDocumentSupplier, setDocumentStartIdclone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, waitprivate static final org.slf4j.Logger LOGGER
private boolean removeCategoryLinks
private boolean keepTableContents
public WikipediaMarkupDeletingDocumentSupplierDecorator(org.dice_research.topicmodeling.preprocessing.docsupplier.DocumentSupplier documentSource)
public WikipediaMarkupDeletingDocumentSupplierDecorator(org.dice_research.topicmodeling.preprocessing.docsupplier.DocumentSupplier documentSource,
boolean removeCategoryLinks)
public WikipediaMarkupDeletingDocumentSupplierDecorator(org.dice_research.topicmodeling.preprocessing.docsupplier.DocumentSupplier documentSource,
boolean removeCategoryLinks,
boolean keepTableContents)
public org.dice_research.topicmodeling.utils.doc.Document prepareDocument(org.dice_research.topicmodeling.utils.doc.Document document)
prepareDocument in class org.dice_research.topicmodeling.preprocessing.docsupplier.decorator.AbstractDocumentSupplierDecoratorpublic StackBasedMarkupDeletingMachine createDeleter()
public void setRemoveCategoryLinks(boolean removeCategoryLinks)
public void setKeepTableContents(boolean keepTableContents)
Copyright © 2015–2020. All rights reserved.