public class UseNetTextExtractingSupplierDecorator extends AbstractDocumentSupplierDecorator
| Modifier and Type | Field and Description |
|---|---|
private static String |
PGP_MESSAGE_END |
private static String |
PGP_MESSAGE_START |
private static Set<String> |
USE_NET_HEADING_KEYS |
documentSource| Constructor and Description |
|---|
UseNetTextExtractingSupplierDecorator(org.dice_research.topicmodeling.preprocessing.docsupplier.DocumentSupplier documentSource) |
| Modifier and Type | Method and Description |
|---|---|
private int |
getTextStartIndex(String originalText) |
private String |
identifyTextUsingPgpStatements(String text) |
org.dice_research.topicmodeling.utils.doc.Document |
prepareDocument(org.dice_research.topicmodeling.utils.doc.Document document) |
private String |
removeAddresses(String originalText) |
private int |
removeAuthorLine(String originalText,
int lineStartIndex) |
apply, getDecoratedDocumentSupplier, getNextDocument, setDecoratedDocumentSupplier, setDocumentStartIdclone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, waitprivate static final String PGP_MESSAGE_START
private static final String PGP_MESSAGE_END
public UseNetTextExtractingSupplierDecorator(org.dice_research.topicmodeling.preprocessing.docsupplier.DocumentSupplier documentSource)
public org.dice_research.topicmodeling.utils.doc.Document prepareDocument(org.dice_research.topicmodeling.utils.doc.Document document)
prepareDocument in class AbstractDocumentSupplierDecoratorprivate int getTextStartIndex(String originalText)
private int removeAuthorLine(String originalText, int lineStartIndex)
Copyright © 2015–2020. All rights reserved.