Index
All Classes and Interfaces|All Packages|Constant Field Values|Serialized Form
A
- abbreviate(InputStreamReader, int, String) - Static method in class net.sansa_stack.hadoop.core.RecordReaderGenericBase
- abbreviate(InputStream, Charset, int, String) - Static method in class net.sansa_stack.hadoop.core.RecordReaderGenericBase
- abbreviateAsUTF8(InputStream, int, String) - Static method in class net.sansa_stack.hadoop.core.RecordReaderGenericBase
- absPosToBlockOffset - Variable in class net.sansa_stack.hadoop.core.SeekableSourceOverSplit
- accumulate(A, T) - Method in interface net.sansa_stack.hadoop.core.Accumulating
- accumulate(DatasetOneNg, Quad) - Method in class net.sansa_stack.hadoop.format.jena.trig.RecordReaderRdfTrigDataset.AccumulatingDataset
- accumulatedValue(A) - Method in interface net.sansa_stack.hadoop.core.Accumulating
- accumulatedValue(DatasetOneNg) - Method in class net.sansa_stack.hadoop.format.jena.trig.RecordReaderRdfTrigDataset.AccumulatingDataset
- accumulating - Variable in class net.sansa_stack.hadoop.core.RecordReaderGenericBase
- Accumulating<T,
G, A, U> - Interface in net.sansa_stack.hadoop.core - AccumulatingDataset() - Constructor for class net.sansa_stack.hadoop.format.jena.trig.RecordReaderRdfTrigDataset.AccumulatingDataset
- adapt(Dialect, int) - Static method in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsvFromCsvwOld
- afterSeek() - Method in class net.sansa_stack.hadoop.util.DeferredSeekablePushbackInputStream
- afterSeek() - Method in class net.sansa_stack.hadoop.util.SeekablePushbackInputStream
- aggregate(boolean, Stream<U>, Stream<U>) - Method in class net.sansa_stack.hadoop.core.RecordReaderGenericBase
-
Modify a flow to perform aggregation of items into records according to specification The complex part here is to correctly combine the two flows: - The first group of the splitAggregateFlow needs to be skipped as this in handled by the previous split's processor - If there are no further groups in splitFlow then no items are emitted at all (because all items belong to s previous split) - ONLY if the splitFlow owned at least one group: The first group in the tailFlow needs to be emitted
- aggregate(Stream<T>, Accumulating<T, G, A, U>) - Static method in class net.sansa_stack.hadoop.core.RecordReaderGenericBase
-
Direct aggregation of a stream via an accumulating instance
- autoDetectStartInQuotedField(CustomPatternCsv.CustomMatcherCsv2, int) - Static method in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsv
-
Attempt to auto-detect whether the csv row matcher is positioned inside of a quoted field.
B
- base - Variable in class net.sansa_stack.hadoop.core.pattern.CharSequenceReverse
- BASE_IRI_KEY - Static variable in class net.sansa_stack.hadoop.format.jena.base.FileInputFormatRdfBase
- baseIri - Variable in class net.sansa_stack.hadoop.format.jena.base.RecordReaderGenericRdfBase
- baseIriKey - Variable in class net.sansa_stack.hadoop.format.jena.base.RecordReaderGenericRdfBase
- basePath - Variable in class net.sansa_stack.hadoop.util.JsonHadoopBridge
- baseStream - Variable in class net.sansa_stack.hadoop.core.SeekableSourceOverSplit
-
The total number of bytes that need to be read from base until the split boundary is reached.
- bwdPattern - Variable in class net.sansa_stack.hadoop.core.pattern.CustomPatternFiltered
C
- CanParseRdf - Interface in net.sansa_stack.hadoop.format.jena.base
- CELL_MAXLENGTH_DEFAULT_VALUE - Static variable in class net.sansa_stack.hadoop.format.commons_csv.csv.RecordReaderCsv
- CELL_MAXLENGTH_DEFAULT_VALUE - Static variable in class net.sansa_stack.hadoop.format.univocity.csv.csv.RecordReaderCsvUnivocity
- CELL_MAXLENGTH_KEY - Static variable in class net.sansa_stack.hadoop.format.commons_csv.csv.RecordReaderCsv
-
The maximum length of a CSV cell containing new lines
- CELL_MAXLENGTH_KEY - Static variable in class net.sansa_stack.hadoop.format.univocity.csv.csv.RecordReaderCsvUnivocity
-
The maximum length of a CSV cell containing new lines
- CELL_MAXLINES_DEFAULT_VALUE - Static variable in class net.sansa_stack.hadoop.format.univocity.csv.csv.RecordReaderCsvUnivocity
- CELL_MAXLINES_KEY - Static variable in class net.sansa_stack.hadoop.format.univocity.csv.csv.RecordReaderCsvUnivocity
- cellMaxLength - Variable in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsv
- charAt(int) - Method in class net.sansa_stack.hadoop.core.pattern.CharSequenceReverse
- charAt(CharSequence, int) - Static method in class net.sansa_stack.hadoop.core.pattern.CharSequences
-
Returns -1 if offset out of bounds
- chars() - Method in class net.sansa_stack.hadoop.core.pattern.CharSequenceReverse
- charSequence - Variable in class net.sansa_stack.hadoop.core.pattern.CustomMatcherBase
- CharSequenceReverse - Class in net.sansa_stack.hadoop.core.pattern
- CharSequenceReverse(CharSequence, int) - Constructor for class net.sansa_stack.hadoop.core.pattern.CharSequenceReverse
- CharSequenceReverse(CharSequence, int, int) - Constructor for class net.sansa_stack.hadoop.core.pattern.CharSequenceReverse
-
Create a char sequence from the start offset down to the end offset
- CharSequences - Class in net.sansa_stack.hadoop.core.pattern
- CharSequences() - Constructor for class net.sansa_stack.hadoop.core.pattern.CharSequences
- classify(Quad) - Method in class net.sansa_stack.hadoop.format.jena.trig.RecordReaderRdfTrigDataset.AccumulatingDataset
- classify(T) - Method in interface net.sansa_stack.hadoop.core.Accumulating
- close() - Method in class net.sansa_stack.hadoop.core.RecordReaderGenericBase
- close() - Method in class net.sansa_stack.hadoop.core.RecordReaderGenericBaseStatsWrapper
- close() - Method in class net.sansa_stack.hadoop.core.SeekableSourceOverSplit
- close() - Method in class net.sansa_stack.hadoop.format.gson.json.JsonElementArrayIterator
- close() - Method in class net.sansa_stack.hadoop.format.gson.json.JsonElementSequenceIterator
- close() - Method in class net.sansa_stack.hadoop.util.DeferredSeekablePushbackInputStream
- close() - Method in class net.sansa_stack.hadoop.util.InputStreamWithCloseLogging
- close(TaskAttemptContext) - Method in class net.sansa_stack.hadoop.output.jena.base.RecordWriterRowSetStream
- close(TaskAttemptContext) - Method in class net.sansa_stack.hadoop.output.jena.base.RecordWriterStreamRDF
- closeAction - Variable in class net.sansa_stack.hadoop.output.jena.base.RecordWriterStreamRDF
- codec - Variable in class net.sansa_stack.hadoop.core.RecordReaderGenericBase
- codePoints() - Method in class net.sansa_stack.hadoop.core.pattern.CharSequenceReverse
- columnMaxLength - Variable in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsvOld
-
Deprecated.
- columnMaxLength - Variable in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsvOld.Config
-
Deprecated.
- compile(String) - Static method in class net.sansa_stack.hadoop.core.pattern.CustomPatternJava
-
Convenience factory methods
- compile(String, int) - Static method in class net.sansa_stack.hadoop.core.pattern.CustomPatternJava
- computeNext() - Method in class net.sansa_stack.hadoop.format.gson.json.JsonElementArrayIterator
- computeNext() - Method in class net.sansa_stack.hadoop.format.gson.json.JsonElementSequenceIterator
- Config(char, char, String, String, int, int) - Constructor for class net.sansa_stack.hadoop.core.pattern.CustomPatternCsvOld.Config
-
Deprecated.
- ConfigurationUtils - Class in net.sansa_stack.hadoop.util
- ConfigurationUtils() - Constructor for class net.sansa_stack.hadoop.util.ConfigurationUtils
- content - Variable in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsv.Field
- convert(Resource, ProbeResult) - Static method in class net.sansa_stack.hadoop.core.RecordReaderGenericBase
- create(char, char, int) - Static method in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsvOld.Config
-
Deprecated.
- create(char, int) - Static method in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsvOld.Config
-
Deprecated.
- create(int) - Static method in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsv
- create(int, int) - Static method in class net.sansa_stack.hadoop.output.jena.base.FragmentOutputSpec
- create(CustomPatternCsvOld.Config) - Static method in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsvOld
-
Deprecated.
- create(ReadableChannel<byte[]>, ReadableChannel<byte[]>, byte[], NavigableMap<Long, Long>) - Static method in class net.sansa_stack.hadoop.core.SeekableSourceOverSplit
- create(Dialect, int, int) - Static method in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsv
- createAccumulator(G) - Method in interface net.sansa_stack.hadoop.core.Accumulating
- createAccumulator(Node) - Method in class net.sansa_stack.hadoop.format.jena.trig.RecordReaderRdfTrigDataset.AccumulatingDataset
- createBwdPatternClosingQuote(CustomPatternCsvOld.Config) - Static method in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsvOld
-
Deprecated.
- createBwdPatternClosingQuote2(CustomPatternCsvOld.Config) - Static method in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsvOld
-
Deprecated.
- createExcel(int) - Static method in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsvOld.Config
-
Deprecated.
- createFlow(Job, InputFormat<?, T>, InputSplit) - Static method in class net.sansa_stack.hadoop.util.FileSplitUtils
-
Create a flow of records for a given input split w.r.t.
- createFlow2(Job, InputFormat<?, T>, InputSplit) - Static method in class net.sansa_stack.hadoop.util.FileSplitUtils
- createForBlockEncodedStream(SeekableInputStream, long, byte[]) - Static method in class net.sansa_stack.hadoop.core.SeekableSourceOverSplit
- createForNonEncodedStream(SeekableInputStream, long, byte[]) - Static method in class net.sansa_stack.hadoop.core.SeekableSourceOverSplit
- createFromPrototype(Object, String) - Static method in class net.sansa_stack.hadoop.util.JsonHadoopBridge
-
This method takes as argument an object that serves as a prototype: It is converted to json (null values retained) and the obtained (nested) keys are used as attributes
- createFwdQuotePattern(CustomPatternCsvOld.Config) - Static method in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsvOld
-
Deprecated.Pattern for matching an effective quote in forward direction.
- createFwdQuotePatternOld(CustomPatternCsvOld.Config) - Static method in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsvOld
-
Deprecated.
- createIsEscapedBwdPattern(CustomPatternCsvOld.Config) - Static method in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsvOld
-
Deprecated.
- createMatcherFactory(CustomPattern, int, LongPredicate) - Static method in class net.sansa_stack.hadoop.core.RecordReaderGenericBase
- createPattern(Dialect) - Static method in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsv
- createRecordFlow() - Method in class net.sansa_stack.hadoop.core.RecordReaderGenericBase
- createRecordFlow() - Method in class net.sansa_stack.hadoop.format.commons_csv.csv.RecordReaderCsv
-
Override createRecordFlow to skip the first record if the requested format demands so.
- createRecordFlow() - Method in class net.sansa_stack.hadoop.format.univocity.csv.csv.RecordReaderCsvUnivocity
-
Override createRecordFlow to skip the first record if the requested format demands so.
- createRecordReader(InputSplit, TaskAttemptContext) - Method in class net.sansa_stack.hadoop.core.InputFormatStats
- createRecordReader(InputSplit, TaskAttemptContext) - Method in class net.sansa_stack.hadoop.format.commons_csv.csv.FileInputFormatCsv
- createRecordReader(InputSplit, TaskAttemptContext) - Method in class net.sansa_stack.hadoop.format.gson.json.FileInputFormatJsonArray
- createRecordReader(InputSplit, TaskAttemptContext) - Method in class net.sansa_stack.hadoop.format.gson.json.FileInputFormatJsonSequence
- createRecordReader(InputSplit, TaskAttemptContext) - Method in class net.sansa_stack.hadoop.format.jena.base.FileInputFormatRdfBase
- createRecordReader(InputSplit, TaskAttemptContext) - Method in class net.sansa_stack.hadoop.format.univocity.csv.csv.FileInputFormatCsvUnivocity
- createRecordReaderActual(InputSplit, TaskAttemptContext) - Method in class net.sansa_stack.hadoop.format.jena.base.FileInputFormatRdfBase
- createRecordReaderActual(InputSplit, TaskAttemptContext) - Method in class net.sansa_stack.hadoop.format.jena.nquads.FileInputFormatRdfNQuads
- createRecordReaderActual(InputSplit, TaskAttemptContext) - Method in class net.sansa_stack.hadoop.format.jena.ntriples.FileInputFormatRdfNTriples
- createRecordReaderActual(InputSplit, TaskAttemptContext) - Method in class net.sansa_stack.hadoop.format.jena.trig.FileInputFormatRdfTrigDataset
- createRecordReaderActual(InputSplit, TaskAttemptContext) - Method in class net.sansa_stack.hadoop.format.jena.trig.FileInputFormatRdfTrigQuad
- createRecordReaderActual(InputSplit, TaskAttemptContext) - Method in class net.sansa_stack.hadoop.format.jena.turtle.FileInputFormatRdfTurtleTriple
- createStartOfCsvRecordPattern(long) - Static method in class net.sansa_stack.hadoop.format.commons_csv.csv.RecordReaderCsv
-
Create a regex for matching csv record starts.
- createStreamRDFFactory(RDFFormat, boolean, PrefixMap, FragmentOutputSpec) - Static method in class net.sansa_stack.hadoop.output.jena.base.StreamRDFUtils
-
Create a function that can create a StreamRDF instance that is backed by the given OutputStream.
- createTestParameters(Map<String, Range<Integer>>) - Static method in class net.sansa_stack.hadoop.util.FileSplitUtils
-
Util method typically for use with split-related unit tests
- creationStackTrace - Variable in class net.sansa_stack.hadoop.util.InputStreamWithCloseLogging
- CSV_FORMAT_RAW_KEY - Static variable in class net.sansa_stack.hadoop.format.commons_csv.csv.RecordReaderCsv
-
Key for the serialized bytes of a
CSVFormatinstance - CSV_FORMAT_RAW_KEY - Static variable in class net.sansa_stack.hadoop.format.univocity.csv.csv.RecordReaderCsvUnivocity
-
Key for the serialized bytes of a
CSVFormatinstance - CsvUtils - Class in net.sansa_stack.hadoop.format.commons_csv.csv
- CsvUtils() - Constructor for class net.sansa_stack.hadoop.format.commons_csv.csv.CsvUtils
- currentFieldContentStart - Variable in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsv.CustomMatcherCsv2
-
Start of field content (excluding a possible leading a quote)
- currentFieldContentStart - Variable in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsv.MatchState
- currentKey - Variable in class net.sansa_stack.hadoop.core.RecordReaderGenericBase
- currentLineCount - Variable in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsv.CustomMatcherCsv2
- currentValue - Variable in class net.sansa_stack.hadoop.core.RecordReaderGenericBase
- CustomMatcher - Interface in net.sansa_stack.hadoop.core.pattern
- CustomMatcherBase - Class in net.sansa_stack.hadoop.core.pattern
- CustomMatcherBase(CharSequence) - Constructor for class net.sansa_stack.hadoop.core.pattern.CustomMatcherBase
- CustomMatcherCsv(CharSequence) - Constructor for class net.sansa_stack.hadoop.core.pattern.CustomPatternCsvOld.CustomMatcherCsv
-
Deprecated.
- CustomMatcherCsv2(CharSequence) - Constructor for class net.sansa_stack.hadoop.core.pattern.CustomPatternCsv.CustomMatcherCsv2
- CustomMatcherDecorator - Class in net.sansa_stack.hadoop.core.pattern
- CustomMatcherDecorator() - Constructor for class net.sansa_stack.hadoop.core.pattern.CustomMatcherDecorator
- CustomMatcherFilter(CharSequence) - Constructor for class net.sansa_stack.hadoop.core.pattern.CustomPatternFiltered.CustomMatcherFilter
- CustomMatcherJava - Class in net.sansa_stack.hadoop.core.pattern
- CustomMatcherJava(Matcher) - Constructor for class net.sansa_stack.hadoop.core.pattern.CustomMatcherJava
- CustomMatcherReplay(CustomMatcher) - Constructor for class net.sansa_stack.hadoop.core.pattern.CustomPatternReplay.CustomMatcherReplay
- CustomMatcherTrigGraph(CharSequence) - Constructor for class net.sansa_stack.hadoop.core.pattern.CustomPatternTrigGraph.CustomMatcherTrigGraph
- CustomPattern - Interface in net.sansa_stack.hadoop.core.pattern
- CustomPatternCsv - Class in net.sansa_stack.hadoop.core.pattern
- CustomPatternCsv(Dialect, int, int) - Constructor for class net.sansa_stack.hadoop.core.pattern.CustomPatternCsv
- CustomPatternCsv.CustomMatcherCsv2 - Class in net.sansa_stack.hadoop.core.pattern
- CustomPatternCsv.Field - Class in net.sansa_stack.hadoop.core.pattern
- CustomPatternCsv.MatchRegion - Class in net.sansa_stack.hadoop.core.pattern
- CustomPatternCsv.MatchState - Class in net.sansa_stack.hadoop.core.pattern
- CustomPatternCsv.Row - Class in net.sansa_stack.hadoop.core.pattern
- CustomPatternCsvFromCsvwOld - Class in net.sansa_stack.hadoop.core.pattern
-
Adapter method to configure the CustomPatternCsv from a Csvw Dialect
- CustomPatternCsvFromCsvwOld() - Constructor for class net.sansa_stack.hadoop.core.pattern.CustomPatternCsvFromCsvwOld
- CustomPatternCsvOld - Class in net.sansa_stack.hadoop.core.pattern
-
Deprecated.
- CustomPatternCsvOld(int, CustomPattern, CustomPattern, CustomPattern) - Constructor for class net.sansa_stack.hadoop.core.pattern.CustomPatternCsvOld
-
Deprecated.
- CustomPatternCsvOld.Config - Class in net.sansa_stack.hadoop.core.pattern
-
Deprecated.
- CustomPatternCsvOld.CustomMatcherCsv - Class in net.sansa_stack.hadoop.core.pattern
-
Deprecated.
- CustomPatternDecoratorBase - Class in net.sansa_stack.hadoop.core.pattern
- CustomPatternDecoratorBase(CustomPattern) - Constructor for class net.sansa_stack.hadoop.core.pattern.CustomPatternDecoratorBase
- CustomPatternFiltered - Class in net.sansa_stack.hadoop.core.pattern
- CustomPatternFiltered(CustomPattern, CustomPattern) - Constructor for class net.sansa_stack.hadoop.core.pattern.CustomPatternFiltered
- CustomPatternFiltered.CustomMatcherFilter - Class in net.sansa_stack.hadoop.core.pattern
- CustomPatternJava - Class in net.sansa_stack.hadoop.core.pattern
- CustomPatternJava(Pattern) - Constructor for class net.sansa_stack.hadoop.core.pattern.CustomPatternJava
- CustomPatternReplay - Class in net.sansa_stack.hadoop.core.pattern
-
A caching pattern whose matchers can be replay prior matches
- CustomPatternReplay(CustomPattern) - Constructor for class net.sansa_stack.hadoop.core.pattern.CustomPatternReplay
- CustomPatternReplay.CustomMatcherReplay - Class in net.sansa_stack.hadoop.core.pattern
- CustomPatternReplay.Match - Class in net.sansa_stack.hadoop.core.pattern
- CustomPatternReplay.Region - Class in net.sansa_stack.hadoop.core.pattern
- CustomPatternTrigGraph - Class in net.sansa_stack.hadoop.core.pattern
- CustomPatternTrigGraph() - Constructor for class net.sansa_stack.hadoop.core.pattern.CustomPatternTrigGraph
- CustomPatternTrigGraph.CustomMatcherTrigGraph - Class in net.sansa_stack.hadoop.core.pattern
D
- datasetFlow - Variable in class net.sansa_stack.hadoop.core.RecordReaderGenericBase
- debufferedHead - Variable in class net.sansa_stack.hadoop.core.SeekableSourceOverSplit
- decompressor - Variable in class net.sansa_stack.hadoop.core.RecordReaderGenericBase
- decoratee - Variable in class net.sansa_stack.hadoop.core.InputFormatStats
- decoratee - Variable in class net.sansa_stack.hadoop.core.pattern.CustomPatternReplay.CustomMatcherReplay
- decoratee - Variable in class net.sansa_stack.hadoop.core.RecordReaderGenericBaseStatsWrapper
- DeferredSeekablePushbackInputStream - Class in net.sansa_stack.hadoop.util
-
A wrapper for hadoop input streams created from codecs in ReadMode.BY_BLOCK: Defers reading by one byte such that position changes are advertised on the byte BEFORE the block boundary rather than on the byte AFTER it.
- DeferredSeekablePushbackInputStream(InputStream) - Constructor for class net.sansa_stack.hadoop.util.DeferredSeekablePushbackInputStream
- DeferredSeekablePushbackInputStream(InputStream, Seekable) - Constructor for class net.sansa_stack.hadoop.util.DeferredSeekablePushbackInputStream
- DELEGATE - Static variable in class net.sansa_stack.hadoop.core.SansaHadoopConstants
- delimiter - Variable in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsvOld.Config
-
Deprecated.
- detectTail(BufferOverReadableChannel<byte[]>) - Method in class net.sansa_stack.hadoop.core.RecordReaderGenericBase
-
Try to detected the nth record offset in the subsequent split.
- dialect - Variable in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsv
- didHitSplitBound(Seekable, long) - Method in class net.sansa_stack.hadoop.core.RecordReaderGenericBase
- disableSkipHeaderRecord(CSVFormat) - Method in class net.sansa_stack.hadoop.format.commons_csv.csv.RecordReaderCsv
E
- effectiveCsvFormat - Variable in class net.sansa_stack.hadoop.format.commons_csv.csv.RecordReaderCsv
- effectiveInputStream(InputStream) - Method in class net.sansa_stack.hadoop.core.RecordReaderGenericBase
-
This method is meant for overriding the input stream
- effectiveInputStream(InputStream) - Method in class net.sansa_stack.hadoop.format.gson.json.RecordReaderJsonArray
-
Always replace the first character (which is either a comma or open bracket) with an open bracket in order to mimick a JSON array start.
- effectiveInputStreamSupp(ReadableChannel<byte[]>) - Method in class net.sansa_stack.hadoop.core.RecordReaderGenericBase
- ELEMENT_PROBECOUNT_KEY - Static variable in class net.sansa_stack.hadoop.format.jena.trig.RecordReaderRdfTrigDataset
- emitHead - Variable in class net.sansa_stack.hadoop.output.jena.base.FragmentOutputSpec
- emitTail - Variable in class net.sansa_stack.hadoop.output.jena.base.FragmentOutputSpec
- EMPTY_BYTE_ARRAY - Static variable in class net.sansa_stack.hadoop.core.RecordReaderGenericBase
- enableStats - Variable in class net.sansa_stack.hadoop.core.RecordReaderGenericBase
- end - Variable in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsv.MatchRegion
- end() - Method in class net.sansa_stack.hadoop.core.pattern.CustomMatcherDecorator
- end() - Method in class net.sansa_stack.hadoop.core.pattern.CustomMatcherJava
- end() - Method in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsv.CustomMatcherCsv2
- end() - Method in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsvOld.CustomMatcherCsv
-
Deprecated.
- end() - Method in class net.sansa_stack.hadoop.core.pattern.CustomPatternFiltered.CustomMatcherFilter
- end() - Method in class net.sansa_stack.hadoop.core.pattern.CustomPatternReplay.CustomMatcherReplay
- end() - Method in class net.sansa_stack.hadoop.core.pattern.CustomPatternTrigGraph.CustomMatcherTrigGraph
- end(int) - Method in class net.sansa_stack.hadoop.core.pattern.CustomMatcherBase
- end(int) - Method in class net.sansa_stack.hadoop.core.pattern.CustomMatcherDecorator
- end(int) - Method in class net.sansa_stack.hadoop.core.pattern.CustomMatcherJava
- end(int) - Method in class net.sansa_stack.hadoop.core.pattern.CustomPatternReplay.CustomMatcherReplay
- end(String) - Method in interface net.sansa_stack.hadoop.core.pattern.CustomMatcher
- end(String) - Method in class net.sansa_stack.hadoop.core.pattern.CustomMatcherBase
- end(String) - Method in class net.sansa_stack.hadoop.core.pattern.CustomMatcherJava
- end(String) - Method in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsv.CustomMatcherCsv2
- end(String) - Method in class net.sansa_stack.hadoop.core.pattern.CustomPatternReplay.CustomMatcherReplay
- endOfQuotedFieldFwdPattern - Variable in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsvOld
-
Deprecated.
- endPos - Variable in class net.sansa_stack.hadoop.core.OffsetSeekResult
- ensureInit(Configuration) - Method in class net.sansa_stack.hadoop.core.InputFormatStats
- escapeChar - Variable in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsvOld.Config
-
Deprecated.
F
- fallbackBuffer - Variable in class net.sansa_stack.hadoop.util.DeferredSeekablePushbackInputStream
- Field(long, long, CharSequence, CharSequence, CharSequence) - Constructor for class net.sansa_stack.hadoop.core.pattern.CustomPatternCsv.Field
- fields - Variable in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsv.Row
- fieldSeparatorAndNewlineMatcher - Variable in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsv.CustomMatcherCsv2
-
Matcher to move to the next field separator or newline
- fieldSeparatorAndNewlinePattern - Variable in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsv
- FileInputFormatCsv - Class in net.sansa_stack.hadoop.format.commons_csv.csv
- FileInputFormatCsv() - Constructor for class net.sansa_stack.hadoop.format.commons_csv.csv.FileInputFormatCsv
- FileInputFormatCsvUnivocity - Class in net.sansa_stack.hadoop.format.univocity.csv.csv
- FileInputFormatCsvUnivocity() - Constructor for class net.sansa_stack.hadoop.format.univocity.csv.csv.FileInputFormatCsvUnivocity
- FileInputFormatJsonArray - Class in net.sansa_stack.hadoop.format.gson.json
- FileInputFormatJsonArray() - Constructor for class net.sansa_stack.hadoop.format.gson.json.FileInputFormatJsonArray
- FileInputFormatJsonSequence - Class in net.sansa_stack.hadoop.format.gson.json
- FileInputFormatJsonSequence() - Constructor for class net.sansa_stack.hadoop.format.gson.json.FileInputFormatJsonSequence
- FileInputFormatRdfBase<T> - Class in net.sansa_stack.hadoop.format.jena.base
-
Base class for unit testing of reading an RDF file with an arbitrary number of splits.
- FileInputFormatRdfBase(Lang, String) - Constructor for class net.sansa_stack.hadoop.format.jena.base.FileInputFormatRdfBase
- FileInputFormatRdfNQuads - Class in net.sansa_stack.hadoop.format.jena.nquads
- FileInputFormatRdfNQuads() - Constructor for class net.sansa_stack.hadoop.format.jena.nquads.FileInputFormatRdfNQuads
- FileInputFormatRdfNTriples - Class in net.sansa_stack.hadoop.format.jena.ntriples
- FileInputFormatRdfNTriples() - Constructor for class net.sansa_stack.hadoop.format.jena.ntriples.FileInputFormatRdfNTriples
- FileInputFormatRdfTrigDataset - Class in net.sansa_stack.hadoop.format.jena.trig
- FileInputFormatRdfTrigDataset() - Constructor for class net.sansa_stack.hadoop.format.jena.trig.FileInputFormatRdfTrigDataset
- FileInputFormatRdfTrigQuad - Class in net.sansa_stack.hadoop.format.jena.trig
- FileInputFormatRdfTrigQuad() - Constructor for class net.sansa_stack.hadoop.format.jena.trig.FileInputFormatRdfTrigQuad
- FileInputFormatRdfTurtleTriple - Class in net.sansa_stack.hadoop.format.jena.turtle
- FileInputFormatRdfTurtleTriple() - Constructor for class net.sansa_stack.hadoop.format.jena.turtle.FileInputFormatRdfTurtleTriple
- FileSplitUtils - Class in net.sansa_stack.hadoop.util
- FileSplitUtils() - Constructor for class net.sansa_stack.hadoop.util.FileSplitUtils
- fileSystem - Variable in class net.sansa_stack.hadoop.jena.locator.LocatorHdfs
- FileSystemUtils - Class in net.sansa_stack.hadoop.util
- FileSystemUtils() - Constructor for class net.sansa_stack.hadoop.util.FileSystemUtils
- find() - Method in interface net.sansa_stack.hadoop.core.pattern.CustomMatcher
- find() - Method in class net.sansa_stack.hadoop.core.pattern.CustomMatcherDecorator
- find() - Method in class net.sansa_stack.hadoop.core.pattern.CustomMatcherJava
- find() - Method in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsv.CustomMatcherCsv2
- find() - Method in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsvOld.CustomMatcherCsv
-
Deprecated.The find method operates as follows: 1.) set startPos to 0 2.) Find the first character after a newline after startPos - save this position and candidatePos 3.) Verify that the newline is a new row start: Check backwards up to the startPos (not exceeding it) - if there is no start of an escaped field then the newline is a row start and break 4.) Set startPos to candidatePos and got to 2 Notes: - An empty quoted field: Assume the following data: "","",hello "",""$ The issue is that a quoted empty field may look exactly like an escaped double quote.
- find() - Method in class net.sansa_stack.hadoop.core.pattern.CustomPatternFiltered.CustomMatcherFilter
- find() - Method in class net.sansa_stack.hadoop.core.pattern.CustomPatternReplay.CustomMatcherReplay
- find() - Method in class net.sansa_stack.hadoop.core.pattern.CustomPatternTrigGraph.CustomMatcherTrigGraph
- findFirstPositionWithProbeSuccess(SeekableReadableChannel<byte[]>, LongPredicate, MatcherFactory, boolean, BufferOverReadableChannel<U[]>, Prober<U>) - Static method in class net.sansa_stack.hadoop.core.RecordReaderGenericBase
-
Uses the matcher to find candidate probing positions, and returns the first position where probing succeeds.
- findNext() - Method in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsv.CustomMatcherCsv2
- findNextRegion(CustomPattern, SeekableReadableChannel<byte[]>, long, long, long, long, LongPredicate, LongPredicate, BufferOverReadableChannel<U[]>, Prober<U>) - Static method in class net.sansa_stack.hadoop.core.RecordReaderGenericBase
- firstCharOnNewLinePattern - Variable in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsvOld
-
Deprecated.
- fragmentOutputSpec - Variable in class net.sansa_stack.hadoop.output.jena.base.RecordWriterRowSetStream
- FragmentOutputSpec - Class in net.sansa_stack.hadoop.output.jena.base
- FragmentOutputSpec(boolean, boolean) - Constructor for class net.sansa_stack.hadoop.output.jena.base.FragmentOutputSpec
- fwdMatcher - Variable in class net.sansa_stack.hadoop.core.pattern.CustomPatternFiltered.CustomMatcherFilter
- fwdPattern - Variable in class net.sansa_stack.hadoop.core.pattern.CustomPatternFiltered
G
- getAbsPosToBlockOffset() - Method in class net.sansa_stack.hadoop.core.SeekableSourceOverSplit
- getArrayOps() - Method in class net.sansa_stack.hadoop.core.SeekableSourceOverSplit
- getBlockForPos(long) - Method in class net.sansa_stack.hadoop.core.SeekableSourceOverSplit
- getBufferByBaseOffset(long) - Method in class net.sansa_stack.hadoop.core.SeekableSourceOverSplit
- getBufferByIndex(int) - Method in class net.sansa_stack.hadoop.core.SeekableSourceOverSplit
- getBufferByIndexUnsafe(int) - Method in class net.sansa_stack.hadoop.core.SeekableSourceOverSplit
- getCandidatePos() - Method in interface net.sansa_stack.hadoop.core.ProbeStats
- getColumnMaxLength() - Method in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsvOld.Config
-
Deprecated.
- getContent() - Method in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsv.Field
- getCsvFormat(Configuration, CSVFormat) - Static method in class net.sansa_stack.hadoop.format.commons_csv.csv.FileInputFormatCsv
- getCurrentKey() - Method in class net.sansa_stack.hadoop.core.RecordReaderGenericBase
- getCurrentKey() - Method in class net.sansa_stack.hadoop.core.RecordReaderGenericBaseStatsWrapper
- getCurrentValue() - Method in class net.sansa_stack.hadoop.core.RecordReaderGenericBase
- getCurrentValue() - Method in class net.sansa_stack.hadoop.core.RecordReaderGenericBaseStatsWrapper
- getDecodedStreamFromSplit(FileSplit, Configuration) - Static method in class net.sansa_stack.hadoop.util.FileSplitUtils
-
Util method to open a decoded stream from a split.
- getDecoratee() - Method in class net.sansa_stack.hadoop.core.pattern.CustomMatcherDecorator
- getDecoratee() - Method in class net.sansa_stack.hadoop.core.pattern.CustomPatternReplay.CustomMatcherReplay
- getDefaultRdfFormat() - Method in class net.sansa_stack.hadoop.output.jena.base.OutputFormatStreamRdfBase
- getDefaultRdfFormat() - Method in class net.sansa_stack.hadoop.output.jena.base.OutputFormatStreamRdfQuad
- getDefaultRdfFormat() - Method in class net.sansa_stack.hadoop.output.jena.base.OutputFormatStreamRdfTriple
- getDefaultResultSetLang() - Method in class net.sansa_stack.hadoop.output.jena.base.OutputFormatRowSet
- getDelimiter() - Method in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsvOld.Config
-
Deprecated.
- getEnd() - Method in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsv.MatchRegion
- getEndPos() - Method in class net.sansa_stack.hadoop.core.OffsetSeekResult
- getErrorMessage() - Method in interface net.sansa_stack.hadoop.core.Stats2
- getEscapeChar() - Method in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsvOld.Config
-
Deprecated.
- getFirstBlock() - Method in interface net.sansa_stack.hadoop.core.Stats2
-
Null if the split is not backed by a blocked stream; -1 if the stream uses blocks but none was detected
- getHeadBuffer() - Method in class net.sansa_stack.hadoop.core.SeekableSourceOverSplit
- getHeadSize() - Method in class net.sansa_stack.hadoop.core.SeekableSourceOverSplit
- getJson(Configuration, Gson, String, Class<T>, T) - Static method in class net.sansa_stack.hadoop.util.ConfigurationUtils
- getKnownSize() - Method in class net.sansa_stack.hadoop.core.SeekableSourceOverSplit
- getLang() - Method in class net.sansa_stack.hadoop.format.jena.base.RecordReaderRdfConf
- getLang(Configuration, Lang) - Static method in class net.sansa_stack.hadoop.output.jena.base.RdfOutputUtils
- getLastMatchedFields() - Method in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsv.CustomMatcherCsv2
- getLineTerminatorPattern() - Method in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsvOld.Config
-
Deprecated.
- getMapQuadsToTriplesForTripleLangs(Configuration) - Static method in class net.sansa_stack.hadoop.output.jena.base.RdfOutputUtils
- getMaxConsecutiveEscapeChars() - Method in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsvOld.Config
-
Deprecated.
- getMaxRecordLengthKey() - Method in class net.sansa_stack.hadoop.format.jena.base.RecordReaderConf
- getMinRecordLengthKey() - Method in class net.sansa_stack.hadoop.format.jena.base.RecordReaderConf
- getModel(Configuration) - Static method in class net.sansa_stack.hadoop.format.jena.base.FileInputFormatRdfBase
-
Extract a Model from a hadoop conf using
FileInputFormatRdfBase.PREFIXES_KEY - getModel(Configuration, String) - Static method in class net.sansa_stack.hadoop.format.jena.base.FileInputFormatRdfBase
-
Extract a Model from a hadoop conf.
- getName() - Method in class net.sansa_stack.hadoop.jena.locator.LocatorHdfs
- getPos() - Method in class net.sansa_stack.hadoop.util.DeferredSeekablePushbackInputStream
- getPos() - Method in interface net.sansa_stack.hadoop.util.SeekableDecorator
- getPos(Seekable) - Static method in class net.sansa_stack.hadoop.core.RecordReaderGenericBase
- getPosition(SeekableReadableChannel<byte[]>) - Static method in class net.sansa_stack.hadoop.core.RecordReaderGenericBase
- getPrefix() - Method in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsv.Field
- getPrefixByteCount(Configuration) - Method in class net.sansa_stack.hadoop.format.jena.base.FileInputFormatRdfBase
- getPrefixes(Configuration) - Static method in class net.sansa_stack.hadoop.output.jena.base.RdfOutputUtils
- getPrefixesMaxLengthKey() - Method in class net.sansa_stack.hadoop.format.jena.base.RecordReaderRdfConf
- getProbeCount() - Method in interface net.sansa_stack.hadoop.core.ProbeStats
- getProbeElementCountKey() - Method in class net.sansa_stack.hadoop.format.jena.base.RecordReaderConf
- getProbeRecordCountKey() - Method in class net.sansa_stack.hadoop.format.jena.base.RecordReaderConf
- getProgress() - Method in class net.sansa_stack.hadoop.core.RecordReaderGenericBase
- getProgress() - Method in class net.sansa_stack.hadoop.core.RecordReaderGenericBaseStatsWrapper
- getQuoteChar() - Method in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsvOld.Config
-
Deprecated.
- getQuoteErrorCount() - Method in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsv.CustomMatcherCsv2
- getRdfFormat(Configuration, RDFFormat) - Static method in class net.sansa_stack.hadoop.output.jena.base.RdfOutputUtils
- getRecordCount() - Method in class net.sansa_stack.hadoop.core.Stats
- getRecordCount() - Method in interface net.sansa_stack.hadoop.core.Stats2
- getRecordSearchPattern() - Method in class net.sansa_stack.hadoop.format.jena.base.RecordReaderConf
- getRecordWriter(Configuration, OutputStream, FragmentOutputSpec) - Method in class net.sansa_stack.hadoop.output.jena.base.OutputFormatBase
- getRecordWriter(Configuration, OutputStream, FragmentOutputSpec) - Method in class net.sansa_stack.hadoop.output.jena.base.OutputFormatRowSet
- getRecordWriter(Configuration, OutputStream, FragmentOutputSpec) - Method in class net.sansa_stack.hadoop.output.jena.base.OutputFormatStreamRdfBase
- getRecordWriter(TaskAttemptContext) - Method in class net.sansa_stack.hadoop.output.jena.base.OutputFormatBase
- getRegionEndProbeResult() - Method in class net.sansa_stack.hadoop.core.Stats
- getRegionEndProbeResult() - Method in interface net.sansa_stack.hadoop.core.Stats2
- getRegionStartProbeResult() - Method in class net.sansa_stack.hadoop.core.Stats
- getRegionStartProbeResult() - Method in interface net.sansa_stack.hadoop.core.Stats2
- getRows() - Method in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsv.MatchState
- getSeekable() - Method in class net.sansa_stack.hadoop.util.DeferredSeekablePushbackInputStream
- getSeekable() - Method in interface net.sansa_stack.hadoop.util.SeekableDecorator
- getSeekable() - Method in class net.sansa_stack.hadoop.util.SeekableInputStream
-
You should not change the position of the underlying seekable directly while this input stream is in use.
- getSeekable() - Method in class net.sansa_stack.hadoop.util.SeekablePushbackInputStream
- getSerializable(Configuration, String, T) - Static method in class net.sansa_stack.hadoop.util.ConfigurationUtils
-
Get a (non-null) string as a base64 url encoded serialized object Obviously results in non-human-readable configuration objects and should thus be avoided.
- getSplitCount(Configuration) - Static method in class net.sansa_stack.hadoop.output.jena.base.OutputUtils
- getSplitEnd() - Method in class net.sansa_stack.hadoop.core.Stats
- getSplits(JobContext) - Method in class net.sansa_stack.hadoop.core.InputFormatStats
- getSplits(JobContext) - Method in class net.sansa_stack.hadoop.format.jena.base.FileInputFormatRdfBase
- getSplitSize() - Method in interface net.sansa_stack.hadoop.core.Stats2
-
Note: The size is probably more helpful than its absolute end offset
- getSplitStart() - Method in class net.sansa_stack.hadoop.core.Stats
- getSplitStart() - Method in interface net.sansa_stack.hadoop.core.Stats2
- getStart() - Method in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsv.MatchRegion
- getStartPos() - Method in class net.sansa_stack.hadoop.core.OffsetSeekResult
- getStats() - Method in class net.sansa_stack.hadoop.core.RecordReaderGenericBase
- getSuffix() - Method in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsv.Field
- getTailBuffer() - Method in class net.sansa_stack.hadoop.core.SeekableSourceOverSplit
- getTailElementCount() - Method in class net.sansa_stack.hadoop.core.Stats
- getTailElementCount() - Method in interface net.sansa_stack.hadoop.core.Stats2
- getTotalBytesRead() - Method in class net.sansa_stack.hadoop.core.Stats
- getTotalBytesRead() - Method in interface net.sansa_stack.hadoop.core.Stats2
- getTotalDuration() - Method in interface net.sansa_stack.hadoop.core.ProbeStats
- getTotalElementCount() - Method in class net.sansa_stack.hadoop.core.Stats
- getTotalElementCount() - Method in interface net.sansa_stack.hadoop.core.Stats2
- getTotalRecordCount() - Method in class net.sansa_stack.hadoop.core.Stats
- getTotalRecordCount() - Method in interface net.sansa_stack.hadoop.core.Stats2
- getTotalTime() - Method in interface net.sansa_stack.hadoop.core.Stats2
- getUnivocityConfig(Configuration) - Static method in class net.sansa_stack.hadoop.format.univocity.csv.csv.FileInputFormatCsvUnivocity
- getVars(Configuration) - Static method in class net.sansa_stack.hadoop.output.jena.base.RdfOutputUtils
- group() - Method in class net.sansa_stack.hadoop.core.pattern.CustomMatcherDecorator
- group() - Method in class net.sansa_stack.hadoop.core.pattern.CustomMatcherJava
- group() - Method in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsv.CustomMatcherCsv2
- group() - Method in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsvOld.CustomMatcherCsv
-
Deprecated.
- group() - Method in class net.sansa_stack.hadoop.core.pattern.CustomPatternFiltered.CustomMatcherFilter
- group() - Method in class net.sansa_stack.hadoop.core.pattern.CustomPatternTrigGraph.CustomMatcherTrigGraph
- group(int) - Method in class net.sansa_stack.hadoop.core.pattern.CustomMatcherBase
- group(int) - Method in class net.sansa_stack.hadoop.core.pattern.CustomMatcherDecorator
- group(int) - Method in class net.sansa_stack.hadoop.core.pattern.CustomMatcherJava
- groupCount() - Method in class net.sansa_stack.hadoop.core.pattern.CustomMatcherBase
- groupCount() - Method in class net.sansa_stack.hadoop.core.pattern.CustomMatcherDecorator
- groupCount() - Method in class net.sansa_stack.hadoop.core.pattern.CustomMatcherJava
- groupCount() - Method in class net.sansa_stack.hadoop.core.pattern.CustomPatternReplay.CustomMatcherReplay
- groups - Variable in class net.sansa_stack.hadoop.core.pattern.CustomPatternReplay.Match
- gson - Variable in class net.sansa_stack.hadoop.format.gson.json.JsonElementArrayIterator
- gson - Variable in class net.sansa_stack.hadoop.format.gson.json.JsonElementSequenceIterator
- gson - Variable in class net.sansa_stack.hadoop.format.gson.json.RecordReaderJsonArray
- gson - Variable in class net.sansa_stack.hadoop.format.gson.json.RecordReaderJsonSequence
H
- hashCode() - Method in class net.sansa_stack.hadoop.jena.locator.LocatorHdfs
- headBuffer - Variable in class net.sansa_stack.hadoop.core.SeekableSourceOverSplit
- headerBytesKey - Variable in class net.sansa_stack.hadoop.format.jena.base.RecordReaderGenericRdfBase
I
- identity() - Static method in interface net.sansa_stack.hadoop.core.Accumulating
-
Identity accumulator - turns each item into a group that contains only the item and whose value is the item
- init(RowSetStreamWriter, FragmentOutputSpec) - Static method in class net.sansa_stack.hadoop.output.jena.base.RowSetStreamWriterUtils
-
Based on fragmentOutputSpec does the following: If this is the first partition then writes the header.
- initialize(InputSplit, TaskAttemptContext) - Method in class net.sansa_stack.hadoop.core.RecordReaderGenericBase
-
Read out config paramaters (prefixes, length thresholds, ...) and examine the codec in order to set an internal flag whether the stream will be encoded or not.
- initialize(InputSplit, TaskAttemptContext) - Method in class net.sansa_stack.hadoop.core.RecordReaderGenericBaseStatsWrapper
- initialize(InputSplit, TaskAttemptContext) - Method in class net.sansa_stack.hadoop.format.commons_csv.csv.RecordReaderCsv
- initialize(InputSplit, TaskAttemptContext) - Method in class net.sansa_stack.hadoop.format.gson.json.RecordReaderJsonArray
- initialize(InputSplit, TaskAttemptContext) - Method in class net.sansa_stack.hadoop.format.gson.json.RecordReaderJsonSequence
- initialize(InputSplit, TaskAttemptContext) - Method in class net.sansa_stack.hadoop.format.jena.base.RecordReaderGenericRdfBase
- initialize(InputSplit, TaskAttemptContext) - Method in class net.sansa_stack.hadoop.format.univocity.csv.csv.RecordReaderCsvUnivocity
- initRecordFlow() - Method in class net.sansa_stack.hadoop.core.RecordReaderGenericBase
- InputFormatStats - Class in net.sansa_stack.hadoop.core
- InputFormatStats() - Constructor for class net.sansa_stack.hadoop.core.InputFormatStats
- InputFormatStats(InputFormat<?, ?>) - Constructor for class net.sansa_stack.hadoop.core.InputFormatStats
- InputStreamWithCloseLogging - Class in net.sansa_stack.hadoop.util
-
Util class to debug a stream already closed exception
- InputStreamWithCloseLogging(InputStream, BiConsumer<? super Throwable, ? super Throwable>) - Constructor for class net.sansa_stack.hadoop.util.InputStreamWithCloseLogging
- InterruptingSeekableByteChannel - Class in net.sansa_stack.nio.util
- InterruptingSeekableByteChannel(SeekableByteChannel, long) - Constructor for class net.sansa_stack.nio.util.InterruptingSeekableByteChannel
- interruptPos - Variable in class net.sansa_stack.nio.util.InterruptingSeekableByteChannel
- isEffectiveQuoteBwd(CharSequence, int, int, char, char) - Static method in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsv
- isEffectiveQuoteFwd(CharSequence, int, char, char) - Static method in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsv
- isEmitHead() - Method in class net.sansa_stack.hadoop.output.jena.base.FragmentOutputSpec
- isEmitTail() - Method in class net.sansa_stack.hadoop.output.jena.base.FragmentOutputSpec
- isEncoded - Variable in class net.sansa_stack.hadoop.core.RecordReaderGenericBase
- isFinished - Variable in class net.sansa_stack.hadoop.core.pattern.CustomPatternReplay.CustomMatcherReplay
- isFirstElt - Variable in class net.sansa_stack.hadoop.format.gson.json.JsonElementArrayIterator
- isFirstSplit - Variable in class net.sansa_stack.hadoop.core.RecordReaderGenericBase
- isFollowedByEffectiveQuote(CharSequence, int, char, char) - Static method in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsv
-
Determine whether the next character is an effective quote.
- isInQuotedField - Variable in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsv.CustomMatcherCsv2
- isInQuotedField - Variable in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsv.MatchState
- isInQuotedField() - Method in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsv.MatchState
- IsInQuotedField() - Method in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsv.CustomMatcherCsv2
- isOpen() - Method in class net.sansa_stack.hadoop.util.SeekableByteChannelFromSeekableInputStream
- isPrecededByEffectiveQuote(CharSequence, int, int, char, char) - Static method in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsv
-
Checks whether the previous position has a quote that is not escaped
- isRegionStartSearchReadOverRegionEnd() - Method in class net.sansa_stack.hadoop.core.Stats
- isRegionStartSearchReadOverRegionEnd() - Method in interface net.sansa_stack.hadoop.core.Stats2
- isRegionStartSearchReadOverSplitEnd() - Method in class net.sansa_stack.hadoop.core.Stats
- isRegionStartSearchReadOverSplitEnd() - Method in interface net.sansa_stack.hadoop.core.Stats2
- isSplitable(JobContext, Path) - Method in class net.sansa_stack.hadoop.format.commons_csv.csv.FileInputFormatCsv
- isSplitable(JobContext, Path) - Method in class net.sansa_stack.hadoop.format.gson.json.FileInputFormatJsonArray
- isSplitable(JobContext, Path) - Method in class net.sansa_stack.hadoop.format.gson.json.FileInputFormatJsonSequence
- isSplitable(JobContext, Path) - Method in class net.sansa_stack.hadoop.format.jena.base.FileInputFormatRdfBase
- isSplitable(JobContext, Path) - Method in class net.sansa_stack.hadoop.format.univocity.csv.csv.FileInputFormatCsvUnivocity
- isSuccess() - Method in class net.sansa_stack.hadoop.core.OffsetSeekResult
- isValidPos(CharSequence, int) - Static method in class net.sansa_stack.hadoop.core.pattern.CharSequences
J
- JenaPluginSansaHadoop - Class in net.sansa_stack.hadoop.core.plugin
- JenaPluginSansaHadoop() - Constructor for class net.sansa_stack.hadoop.core.plugin.JenaPluginSansaHadoop
- JsonElementArrayIterator - Class in net.sansa_stack.hadoop.format.gson.json
- JsonElementArrayIterator(Gson, JsonReader) - Constructor for class net.sansa_stack.hadoop.format.gson.json.JsonElementArrayIterator
- JsonElementSequenceIterator - Class in net.sansa_stack.hadoop.format.gson.json
-
Read a sequence of JSON elements without separator.
- JsonElementSequenceIterator(Gson, JsonReader) - Constructor for class net.sansa_stack.hadoop.format.gson.json.JsonElementSequenceIterator
- jsonFwdPattern - Static variable in class net.sansa_stack.hadoop.format.gson.json.RecordReaderJsonArray
- jsonFwdPattern - Static variable in class net.sansa_stack.hadoop.format.gson.json.RecordReaderJsonSequence
- JsonHadoopBridge - Class in net.sansa_stack.hadoop.util
-
Jackson-based mapper that can read/write java beans from/to a hadoop configuration object (essentially a Map<String, String>).
- JsonHadoopBridge(Path<String>, JsonNode) - Constructor for class net.sansa_stack.hadoop.util.JsonHadoopBridge
L
- lang - Variable in class net.sansa_stack.hadoop.format.jena.base.FileInputFormatRdfBase
-
Input language
- lang - Variable in class net.sansa_stack.hadoop.format.jena.base.RecordReaderGenericRdfBase
- lang - Variable in class net.sansa_stack.hadoop.format.jena.base.RecordReaderRdfConf
- lastMatchedFields - Variable in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsv.CustomMatcherCsv2
- lastMatchPosEnd - Variable in class net.sansa_stack.hadoop.core.pattern.CustomPatternTrigGraph.CustomMatcherTrigGraph
- lastMatchPosStart - Variable in class net.sansa_stack.hadoop.core.pattern.CustomPatternTrigGraph.CustomMatcherTrigGraph
- lastRowEnd - Variable in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsv.CustomMatcherCsv2
- lastRowStart - Variable in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsv.CustomMatcherCsv2
- length() - Method in class net.sansa_stack.hadoop.core.pattern.CharSequenceReverse
- lines(Seekable) - Static method in class net.sansa_stack.hadoop.core.RecordReaderGenericBase
- lineTerminatorPattern - Variable in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsvOld.Config
-
Deprecated.
- listFileSplits(Path, long, long) - Static method in class net.sansa_stack.hadoop.util.FileSplitUtils
- LocatorHdfs - Class in net.sansa_stack.hadoop.jena.locator
-
Support for resources using the "hdfs:" scheme.
- LocatorHdfs(FileSystem) - Constructor for class net.sansa_stack.hadoop.jena.locator.LocatorHdfs
- LocatorHdfs(FileSystem, String[]) - Constructor for class net.sansa_stack.hadoop.jena.locator.LocatorHdfs
- log() - Method in class net.sansa_stack.hadoop.jena.locator.LocatorHdfs
- logClose(String) - Static method in class net.sansa_stack.hadoop.core.RecordReaderGenericBase
- logUnexpectedClose(String) - Static method in class net.sansa_stack.hadoop.core.RecordReaderGenericBase
M
- main(String[]) - Static method in class net.sansa_stack.hadoop.util.JsonHadoopBridge
- mainX(String[]) - Static method in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsv
- Match() - Constructor for class net.sansa_stack.hadoop.core.pattern.CustomPatternReplay.Match
- matcher - Variable in class net.sansa_stack.hadoop.core.pattern.CustomMatcherJava
- matcher(CharSequence) - Method in interface net.sansa_stack.hadoop.core.pattern.CustomPattern
- matcher(CharSequence) - Method in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsv
- matcher(CharSequence) - Method in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsvOld
-
Deprecated.
- matcher(CharSequence) - Method in class net.sansa_stack.hadoop.core.pattern.CustomPatternFiltered
- matcher(CharSequence) - Method in class net.sansa_stack.hadoop.core.pattern.CustomPatternJava
- matcher(CharSequence) - Method in class net.sansa_stack.hadoop.core.pattern.CustomPatternReplay
- matcher(CharSequence) - Method in class net.sansa_stack.hadoop.core.pattern.CustomPatternTrigGraph
- MatcherFactory - Interface in net.sansa_stack.hadoop.core
- matches - Variable in class net.sansa_stack.hadoop.core.pattern.CustomPatternReplay.CustomMatcherReplay
- matchId - Variable in class net.sansa_stack.hadoop.core.pattern.CustomPatternReplay.CustomMatcherReplay
- MatchRegion(long, long) - Constructor for class net.sansa_stack.hadoop.core.pattern.CustomPatternCsv.MatchRegion
- MatchState(boolean, List<CustomPatternCsv.Row>) - Constructor for class net.sansa_stack.hadoop.core.pattern.CustomPatternCsv.MatchState
- maxConsecutiveEscapeChars - Variable in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsvOld.Config
-
Deprecated.
- maxExtraByteCount - Variable in class net.sansa_stack.hadoop.core.RecordReaderGenericBase
- maxRecordLength - Variable in class net.sansa_stack.hadoop.core.RecordReaderGenericBase
- maxRecordLengthKey - Variable in class net.sansa_stack.hadoop.core.RecordReaderGenericBase
- maxRecordLengthKey - Variable in class net.sansa_stack.hadoop.format.jena.base.RecordReaderConf
- merge(T, Object) - Static method in class net.sansa_stack.hadoop.util.JsonHadoopBridge
-
Merge the state of src into dst via json serialization
- minRecordLength - Variable in class net.sansa_stack.hadoop.core.RecordReaderGenericBase
- minRecordLengthKey - Variable in class net.sansa_stack.hadoop.core.RecordReaderGenericBase
- minRecordLengthKey - Variable in class net.sansa_stack.hadoop.format.jena.base.RecordReaderConf
- multilineFieldMaxLines - Variable in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsv
N
- namedGroups - Variable in class net.sansa_stack.hadoop.core.pattern.CustomPatternReplay.Match
- namedGroups() - Method in class net.sansa_stack.hadoop.core.pattern.CustomPatternReplay.Match
- net.sansa_stack.hadoop.core - package net.sansa_stack.hadoop.core
- net.sansa_stack.hadoop.core.pattern - package net.sansa_stack.hadoop.core.pattern
- net.sansa_stack.hadoop.core.plugin - package net.sansa_stack.hadoop.core.plugin
- net.sansa_stack.hadoop.format.commons_csv.csv - package net.sansa_stack.hadoop.format.commons_csv.csv
- net.sansa_stack.hadoop.format.gson.json - package net.sansa_stack.hadoop.format.gson.json
- net.sansa_stack.hadoop.format.jena.base - package net.sansa_stack.hadoop.format.jena.base
- net.sansa_stack.hadoop.format.jena.nquads - package net.sansa_stack.hadoop.format.jena.nquads
- net.sansa_stack.hadoop.format.jena.ntriples - package net.sansa_stack.hadoop.format.jena.ntriples
- net.sansa_stack.hadoop.format.jena.trig - package net.sansa_stack.hadoop.format.jena.trig
- net.sansa_stack.hadoop.format.jena.turtle - package net.sansa_stack.hadoop.format.jena.turtle
- net.sansa_stack.hadoop.format.univocity.csv.csv - package net.sansa_stack.hadoop.format.univocity.csv.csv
- net.sansa_stack.hadoop.jena.locator - package net.sansa_stack.hadoop.jena.locator
- net.sansa_stack.hadoop.output.jena.base - package net.sansa_stack.hadoop.output.jena.base
- net.sansa_stack.hadoop.util - package net.sansa_stack.hadoop.util
- net.sansa_stack.nio.util - package net.sansa_stack.nio.util
- newCsvParser(Reader) - Method in class net.sansa_stack.hadoop.format.commons_csv.csv.RecordReaderCsv
- newInputStream(Path, Configuration) - Static method in class net.sansa_stack.hadoop.util.FileSystemUtils
- newInputStream(Path, FileSystem, Configuration) - Static method in class net.sansa_stack.hadoop.util.FileSystemUtils
- newInputStream(Path, FileSystem, CompressionCodecFactory) - Static method in class net.sansa_stack.hadoop.util.FileSystemUtils
- newlineMatchStart - Variable in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsvOld.CustomMatcherCsv
-
Deprecated.
- newReadableChannel() - Method in class net.sansa_stack.hadoop.core.SeekableSourceOverSplit
- nextKeyValue() - Method in class net.sansa_stack.hadoop.core.RecordReaderGenericBase
- nextKeyValue() - Method in class net.sansa_stack.hadoop.core.RecordReaderGenericBaseStatsWrapper
- nextQuoteEnd - Variable in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsvOld.CustomMatcherCsv
-
Deprecated.
- nextQuoteExamined - Variable in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsvOld.CustomMatcherCsv
-
Deprecated.
- nextQuoteStart - Variable in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsvOld.CustomMatcherCsv
-
Deprecated.
- nQuadsRecordStartPattern - Static variable in class net.sansa_stack.hadoop.format.jena.nquads.RecordReaderRdfNQuads
-
Match the first character after a newline
- NS_CSV_FORMAT - Static variable in class net.sansa_stack.hadoop.format.univocity.csv.csv.FileInputFormatCsvUnivocity
- nTriplesRecordStartPattern - Static variable in class net.sansa_stack.hadoop.format.jena.ntriples.RecordReaderRdfNTriples
-
Match the first character after a newline
- NUM_SPLITS - Static variable in class net.sansa_stack.hadoop.output.jena.base.OutputUtils
O
- OffsetSeekResult - Class in net.sansa_stack.hadoop.core
- OffsetSeekResult(boolean, long, long) - Constructor for class net.sansa_stack.hadoop.core.OffsetSeekResult
- OutputFormatBase<T> - Class in net.sansa_stack.hadoop.output.jena.base
- OutputFormatBase() - Constructor for class net.sansa_stack.hadoop.output.jena.base.OutputFormatBase
- OutputFormatRowSet - Class in net.sansa_stack.hadoop.output.jena.base
- OutputFormatRowSet() - Constructor for class net.sansa_stack.hadoop.output.jena.base.OutputFormatRowSet
- OutputFormatStreamRdfBase<T> - Class in net.sansa_stack.hadoop.output.jena.base
- OutputFormatStreamRdfBase() - Constructor for class net.sansa_stack.hadoop.output.jena.base.OutputFormatStreamRdfBase
- OutputFormatStreamRdfQuad - Class in net.sansa_stack.hadoop.output.jena.base
- OutputFormatStreamRdfQuad() - Constructor for class net.sansa_stack.hadoop.output.jena.base.OutputFormatStreamRdfQuad
- OutputFormatStreamRdfTriple - Class in net.sansa_stack.hadoop.output.jena.base
- OutputFormatStreamRdfTriple() - Constructor for class net.sansa_stack.hadoop.output.jena.base.OutputFormatStreamRdfTriple
- OutputUtils - Class in net.sansa_stack.hadoop.output.jena.base
- OutputUtils() - Constructor for class net.sansa_stack.hadoop.output.jena.base.OutputUtils
P
- parse(InputStream, boolean) - Method in class net.sansa_stack.hadoop.core.RecordReaderGenericBase
-
Create a flowable from the input stream.
- parse(InputStream, boolean) - Method in class net.sansa_stack.hadoop.format.commons_csv.csv.RecordReaderCsv
- parse(InputStream, boolean) - Method in class net.sansa_stack.hadoop.format.gson.json.RecordReaderJsonArray
- parse(InputStream, boolean) - Method in class net.sansa_stack.hadoop.format.gson.json.RecordReaderJsonSequence
- parse(InputStream, boolean) - Method in class net.sansa_stack.hadoop.format.jena.base.RecordReaderGenericRdfQuadBase
- parse(InputStream, boolean) - Method in class net.sansa_stack.hadoop.format.jena.base.RecordReaderGenericRdfTripleBase
- parse(InputStream, boolean) - Method in class net.sansa_stack.hadoop.format.jena.trig.RecordReaderRdfTrigDataset
- parse(InputStream, boolean) - Method in class net.sansa_stack.hadoop.format.univocity.csv.csv.RecordReaderCsvUnivocity
- parse(Callable<InputStream>) - Method in class net.sansa_stack.hadoop.format.gson.json.RecordReaderJsonArray
- parse(Callable<InputStream>) - Method in class net.sansa_stack.hadoop.format.jena.nquads.RecordReaderRdfNQuads
- parse(Callable<InputStream>) - Method in class net.sansa_stack.hadoop.format.jena.ntriples.RecordReaderRdfNTriples
- parse(Callable<InputStream>) - Method in class net.sansa_stack.hadoop.format.jena.trig.RecordReaderRdfTrigDataset
- parse(Callable<InputStream>) - Method in class net.sansa_stack.hadoop.format.jena.trig.RecordReaderRdfTrigQuad
- parse(Callable<InputStream>) - Method in class net.sansa_stack.hadoop.format.jena.turtle.RecordReaderRdfTurtleTriple
- parse(Callable<InputStream>) - Method in class net.sansa_stack.hadoop.format.univocity.csv.csv.RecordReaderCsvUnivocity
- PARSED_PREFIXES_LENGTH_DEFAULT - Static variable in class net.sansa_stack.hadoop.format.jena.base.FileInputFormatRdfBase
- parseFromSeekable(ReadableChannel<byte[]>, boolean) - Method in class net.sansa_stack.hadoop.core.RecordReaderGenericBase
- parsePrefixes(InputStream, Configuration) - Method in interface net.sansa_stack.hadoop.format.jena.base.CanParseRdf
-
This is currently a stub method - the final method should return an Iterator of parse events, however this API only recently became public in Jena
- parsePrefixes(InputStream, Configuration) - Method in class net.sansa_stack.hadoop.format.jena.base.FileInputFormatRdfBase
-
Public method to parse prefixes w.r.t.
- parsePrefixes(FileSystem, Path, Configuration) - Method in interface net.sansa_stack.hadoop.format.jena.base.CanParseRdf
- parserFactory - Variable in class net.sansa_stack.hadoop.format.univocity.csv.csv.RecordReaderCsvUnivocity
- pattern - Variable in class net.sansa_stack.hadoop.core.pattern.CustomPatternDecoratorBase
- pattern - Variable in class net.sansa_stack.hadoop.core.pattern.CustomPatternJava
- performOpen(String) - Method in class net.sansa_stack.hadoop.jena.locator.LocatorHdfs
- pos - Variable in class net.sansa_stack.hadoop.core.pattern.CustomMatcherBase
- position() - Method in class net.sansa_stack.hadoop.util.SeekableByteChannelFromSeekableInputStream
- position(long) - Method in class net.sansa_stack.hadoop.util.SeekableByteChannelFromSeekableInputStream
- postambleBuffer - Variable in class net.sansa_stack.hadoop.core.SeekableSourceOverSplit
-
The postamble buffer is only served if a limit is set via
SeekableSourceOverSplit.Channel.setLimit(long)If no limit is set then the remainder of the stream is consumed which is assumed to include the postamble - postambleBytes - Variable in class net.sansa_stack.hadoop.core.RecordReaderGenericBase
- posToIndex - Variable in class net.sansa_stack.hadoop.core.SeekableSourceOverSplit
- postProcessRowSkipCount - Variable in class net.sansa_stack.hadoop.format.univocity.csv.csv.RecordReaderCsvUnivocity
- preambleBytes - Variable in class net.sansa_stack.hadoop.core.RecordReaderGenericBase
-
Subclasses may initialize the pre/post-amble bytes in the
RecordReaderGenericBase.initialize(InputSplit, TaskAttemptContext)method rather than the ctor! A (possibly empty) sequence of bytes to prepended to any stream passed to the parser. - prefix - Variable in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsv.Field
- PREFIXES_KEY - Static variable in class net.sansa_stack.hadoop.format.jena.base.FileInputFormatRdfBase
- PREFIXES_MAXLENGTH_KEY - Static variable in class net.sansa_stack.hadoop.format.jena.trig.RecordReaderRdfTrigDataset
- PREFIXES_MAXLENGTH_KEY - Static variable in class net.sansa_stack.hadoop.format.jena.trig.RecordReaderRdfTrigQuad
- PREFIXES_MAXLENGTH_KEY - Static variable in class net.sansa_stack.hadoop.format.jena.turtle.RecordReaderRdfTurtleTriple
- prefixesLengthMaxKey - Variable in class net.sansa_stack.hadoop.format.jena.base.FileInputFormatRdfBase
- prefixesMaxLengthKey - Variable in class net.sansa_stack.hadoop.format.jena.base.RecordReaderGenericRdfBase
- prefixesMaxLengthKey - Variable in class net.sansa_stack.hadoop.format.jena.base.RecordReaderRdfConf
- prefixMap - Variable in class net.sansa_stack.hadoop.format.jena.base.RecordReaderGenericRdfBase
- probeElementCount - Variable in class net.sansa_stack.hadoop.core.RecordReaderGenericBase
- probeElementCountKey - Variable in class net.sansa_stack.hadoop.core.RecordReaderGenericBase
- probeElementCountKey - Variable in class net.sansa_stack.hadoop.format.jena.base.RecordReaderConf
-
The maximum number of elements to parse during probing.
- prober(SeekableReadableChannel<byte[]>, BufferOverReadableChannel<U[]>) - Method in class net.sansa_stack.hadoop.core.RecordReaderGenericBase
-
Turn a sequence of bytes into one of records.
- Prober<U> - Interface in net.sansa_stack.hadoop.core
-
Tests whether parsing a certain amount of records from the channel at a given offset returns successfully.
- probeRecordCount - Variable in class net.sansa_stack.hadoop.core.RecordReaderGenericBase
- probeRecordCountKey - Variable in class net.sansa_stack.hadoop.core.RecordReaderGenericBase
- probeRecordCountKey - Variable in class net.sansa_stack.hadoop.format.jena.base.RecordReaderConf
- ProbeStats - Interface in net.sansa_stack.hadoop.core
- prototype - Variable in class net.sansa_stack.hadoop.util.JsonHadoopBridge
Q
- quoteChar - Variable in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsvOld.Config
-
Deprecated.
- quoteErrorCount - Variable in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsv.CustomMatcherCsv2
-
Number of unexpected quotations
R
- rawStream - Variable in class net.sansa_stack.hadoop.core.RecordReaderGenericBase
- RDF_DOWNGRADE_QUADS - Static variable in class net.sansa_stack.hadoop.output.jena.base.RdfOutputUtils
- RDF_FORMAT - Static variable in class net.sansa_stack.hadoop.output.jena.base.RdfOutputUtils
- RDF_PREFIXES - Static variable in class net.sansa_stack.hadoop.output.jena.base.RdfOutputUtils
- RdfOutputUtils<T> - Class in net.sansa_stack.hadoop.output.jena.base
- RdfOutputUtils() - Constructor for class net.sansa_stack.hadoop.output.jena.base.RdfOutputUtils
- read() - Method in class net.sansa_stack.hadoop.util.DeferredSeekablePushbackInputStream
- read(byte[], int, int) - Method in class net.sansa_stack.hadoop.util.DeferredSeekablePushbackInputStream
-
This method essentially delays reads by one byte.
- read(JsonNode, T) - Static method in class net.sansa_stack.hadoop.util.JsonHadoopBridge
- read(ByteBuffer) - Method in class net.sansa_stack.nio.util.InterruptingSeekableByteChannel
- read(Configuration) - Method in class net.sansa_stack.hadoop.util.JsonHadoopBridge
- read(Configuration, T) - Method in class net.sansa_stack.hadoop.util.JsonHadoopBridge
- readCsvRecords(String, FileSystem, UnivocityParserFactory) - Static method in class net.sansa_stack.hadoop.format.univocity.csv.csv.UnivocityRxUtils
-
Create a flowable to a CSV file via hadoop.
- readCsvRecords(String, FileSystem, CSVFormat) - Static method in class net.sansa_stack.hadoop.format.commons_csv.csv.CsvUtils
-
Create a flowable to a CSV file via hadoop.
- readCsvRecords(Callable<? extends InputStream>, UnivocityParserFactory) - Static method in class net.sansa_stack.hadoop.format.univocity.csv.csv.UnivocityRxUtils
-
Create a flowable to a CSV file from a supplier of input streams
- readCsvRecords(Callable<? extends InputStream>, CSVFormat) - Static method in class net.sansa_stack.hadoop.format.commons_csv.csv.CsvUtils
-
Create a flowable to a CSV file from a supplier of input streams
- readCsvRecords(Path, FileSystem, UnivocityParserFactory) - Static method in class net.sansa_stack.hadoop.format.univocity.csv.csv.UnivocityRxUtils
-
Create a flowable to a CSV file via hadoop.
- readCsvRecords(Path, FileSystem, CSVFormat) - Static method in class net.sansa_stack.hadoop.format.commons_csv.csv.CsvUtils
-
Create a flowable to a CSV file via hadoop.
- readCsvRows(String, FileSystem, UnivocityParserFactory) - Static method in class net.sansa_stack.hadoop.format.univocity.csv.csv.UnivocityRxUtils
- readCsvRows(String, FileSystem, CSVFormat) - Static method in class net.sansa_stack.hadoop.format.commons_csv.csv.CsvUtils
- readCsvRows(Path, FileSystem, UnivocityParserFactory) - Static method in class net.sansa_stack.hadoop.format.univocity.csv.csv.UnivocityRxUtils
- readCsvRows(Path, FileSystem, CSVFormat) - Static method in class net.sansa_stack.hadoop.format.commons_csv.csv.CsvUtils
- reader - Variable in class net.sansa_stack.hadoop.format.gson.json.JsonElementArrayIterator
- reader - Variable in class net.sansa_stack.hadoop.format.gson.json.JsonElementSequenceIterator
- readInternal(byte[], int, int) - Method in class net.sansa_stack.hadoop.util.DeferredSeekablePushbackInputStream
-
This method is assumed to be invoked with len >= 2
- readMode - Variable in class net.sansa_stack.hadoop.util.DeferredSeekablePushbackInputStream
-
Unsafe reads modify the byte after the reported number of read bytes in the read buffer.
- readPrefixes(Callable<InputStream>, Configuration) - Method in class net.sansa_stack.hadoop.format.jena.base.FileInputFormatRdfBase
- readPrefixesIntoModel(PrefixMap, InputStream, Lang) - Static method in class net.sansa_stack.hadoop.format.jena.base.FileInputFormatRdfBase
- readPrefixesIntoModel(PrefixMap, InputStream, Lang, Long) - Static method in class net.sansa_stack.hadoop.format.jena.base.FileInputFormatRdfBase
-
Read prefixes from an input stream.
- readPrefixesIntoModel(PrefixMap, Callable<InputStream>, Lang, Long) - Static method in class net.sansa_stack.hadoop.format.jena.base.FileInputFormatRdfBase
- readRecursively(JsonNode, JsonNode, Path<String>, Function<Path<String>, String>) - Static method in class net.sansa_stack.hadoop.util.JsonHadoopBridge
-
If dst is non-null, then the return value will be dst; otherwise, the return value is either and object or textual node matching the prototype
- ReadTooFarException() - Constructor for exception net.sansa_stack.hadoop.core.RecordReaderGenericBase.ReadTooFarException
- RECORD_MAXLENGTH_KEY - Static variable in class net.sansa_stack.hadoop.format.commons_csv.csv.RecordReaderCsv
- RECORD_MAXLENGTH_KEY - Static variable in class net.sansa_stack.hadoop.format.gson.json.RecordReaderJsonArray
- RECORD_MAXLENGTH_KEY - Static variable in class net.sansa_stack.hadoop.format.gson.json.RecordReaderJsonSequence
- RECORD_MAXLENGTH_KEY - Static variable in class net.sansa_stack.hadoop.format.jena.nquads.RecordReaderRdfNQuads
- RECORD_MAXLENGTH_KEY - Static variable in class net.sansa_stack.hadoop.format.jena.ntriples.RecordReaderRdfNTriples
- RECORD_MAXLENGTH_KEY - Static variable in class net.sansa_stack.hadoop.format.jena.trig.RecordReaderRdfTrigDataset
- RECORD_MAXLENGTH_KEY - Static variable in class net.sansa_stack.hadoop.format.jena.trig.RecordReaderRdfTrigQuad
- RECORD_MAXLENGTH_KEY - Static variable in class net.sansa_stack.hadoop.format.jena.turtle.RecordReaderRdfTurtleTriple
- RECORD_MAXLENGTH_KEY - Static variable in class net.sansa_stack.hadoop.format.univocity.csv.csv.RecordReaderCsvUnivocity
- RECORD_MINLENGTH_KEY - Static variable in class net.sansa_stack.hadoop.format.commons_csv.csv.RecordReaderCsv
- RECORD_MINLENGTH_KEY - Static variable in class net.sansa_stack.hadoop.format.gson.json.RecordReaderJsonArray
- RECORD_MINLENGTH_KEY - Static variable in class net.sansa_stack.hadoop.format.gson.json.RecordReaderJsonSequence
- RECORD_MINLENGTH_KEY - Static variable in class net.sansa_stack.hadoop.format.jena.nquads.RecordReaderRdfNQuads
- RECORD_MINLENGTH_KEY - Static variable in class net.sansa_stack.hadoop.format.jena.ntriples.RecordReaderRdfNTriples
- RECORD_MINLENGTH_KEY - Static variable in class net.sansa_stack.hadoop.format.jena.trig.RecordReaderRdfTrigDataset
- RECORD_MINLENGTH_KEY - Static variable in class net.sansa_stack.hadoop.format.jena.trig.RecordReaderRdfTrigQuad
- RECORD_MINLENGTH_KEY - Static variable in class net.sansa_stack.hadoop.format.jena.turtle.RecordReaderRdfTurtleTriple
- RECORD_MINLENGTH_KEY - Static variable in class net.sansa_stack.hadoop.format.univocity.csv.csv.RecordReaderCsvUnivocity
- RECORD_PROBECOUNT_KEY - Static variable in class net.sansa_stack.hadoop.format.commons_csv.csv.RecordReaderCsv
- RECORD_PROBECOUNT_KEY - Static variable in class net.sansa_stack.hadoop.format.gson.json.RecordReaderJsonArray
- RECORD_PROBECOUNT_KEY - Static variable in class net.sansa_stack.hadoop.format.gson.json.RecordReaderJsonSequence
- RECORD_PROBECOUNT_KEY - Static variable in class net.sansa_stack.hadoop.format.jena.nquads.RecordReaderRdfNQuads
- RECORD_PROBECOUNT_KEY - Static variable in class net.sansa_stack.hadoop.format.jena.ntriples.RecordReaderRdfNTriples
- RECORD_PROBECOUNT_KEY - Static variable in class net.sansa_stack.hadoop.format.jena.trig.RecordReaderRdfTrigDataset
- RECORD_PROBECOUNT_KEY - Static variable in class net.sansa_stack.hadoop.format.jena.trig.RecordReaderRdfTrigQuad
- RECORD_PROBECOUNT_KEY - Static variable in class net.sansa_stack.hadoop.format.jena.turtle.RecordReaderRdfTurtleTriple
- RECORD_PROBECOUNT_KEY - Static variable in class net.sansa_stack.hadoop.format.univocity.csv.csv.RecordReaderCsvUnivocity
- recordFlowCloseable - Variable in class net.sansa_stack.hadoop.core.RecordReaderGenericBase
- RecordReaderConf - Class in net.sansa_stack.hadoop.format.jena.base
- RecordReaderConf(String, String, String, String, CustomPattern) - Constructor for class net.sansa_stack.hadoop.format.jena.base.RecordReaderConf
- RecordReaderCsv - Class in net.sansa_stack.hadoop.format.commons_csv.csv
-
A generic parser implementation for CSV with the offset-seeking condition that CSV rows must all have the same length.
- RecordReaderCsv() - Constructor for class net.sansa_stack.hadoop.format.commons_csv.csv.RecordReaderCsv
- RecordReaderCsv(RecordReaderConf) - Constructor for class net.sansa_stack.hadoop.format.commons_csv.csv.RecordReaderCsv
- RecordReaderCsvUnivocity - Class in net.sansa_stack.hadoop.format.univocity.csv.csv
-
A generic parser implementation for CSV with the offset-seeking condition that CSV rows must all have the same length.
- RecordReaderCsvUnivocity() - Constructor for class net.sansa_stack.hadoop.format.univocity.csv.csv.RecordReaderCsvUnivocity
-
Create a regex for matching csv record starts.
- RecordReaderCsvUnivocity(RecordReaderConf) - Constructor for class net.sansa_stack.hadoop.format.univocity.csv.csv.RecordReaderCsvUnivocity
- RecordReaderGenericBase<U,
G, A, T> - Class in net.sansa_stack.hadoop.core -
A generic record reader that uses a callback mechanism to detect a consecutive sequence of records that must start in the current split and which may extend over any number of successor splits.
- RecordReaderGenericBase(RecordReaderConf, Accumulating<U, G, A, T>) - Constructor for class net.sansa_stack.hadoop.core.RecordReaderGenericBase
- RecordReaderGenericBase.ReadTooFarException - Exception in net.sansa_stack.hadoop.core
-
Remove buffering from a channel.
- RecordReaderGenericBaseStatsWrapper - Class in net.sansa_stack.hadoop.core
- RecordReaderGenericBaseStatsWrapper(RecordReaderGenericBase<?, ?, ?, ?>) - Constructor for class net.sansa_stack.hadoop.core.RecordReaderGenericBaseStatsWrapper
- RecordReaderGenericRdfAccumulatingBase<U,
G, A, T> - Class in net.sansa_stack.hadoop.format.jena.base - RecordReaderGenericRdfAccumulatingBase(RecordReaderRdfConf, Accumulating<U, G, A, T>) - Constructor for class net.sansa_stack.hadoop.format.jena.base.RecordReaderGenericRdfAccumulatingBase
- RecordReaderGenericRdfBase<U,
G, A, T> - Class in net.sansa_stack.hadoop.format.jena.base - RecordReaderGenericRdfBase(RecordReaderRdfConf, Accumulating<U, G, A, T>) - Constructor for class net.sansa_stack.hadoop.format.jena.base.RecordReaderGenericRdfBase
- RecordReaderGenericRdfNonAccumulatingBase<T> - Class in net.sansa_stack.hadoop.format.jena.base
- RecordReaderGenericRdfNonAccumulatingBase(RecordReaderRdfConf) - Constructor for class net.sansa_stack.hadoop.format.jena.base.RecordReaderGenericRdfNonAccumulatingBase
- RecordReaderGenericRdfQuadBase - Class in net.sansa_stack.hadoop.format.jena.base
- RecordReaderGenericRdfQuadBase(RecordReaderRdfConf) - Constructor for class net.sansa_stack.hadoop.format.jena.base.RecordReaderGenericRdfQuadBase
- RecordReaderGenericRdfTripleBase - Class in net.sansa_stack.hadoop.format.jena.base
- RecordReaderGenericRdfTripleBase(RecordReaderRdfConf) - Constructor for class net.sansa_stack.hadoop.format.jena.base.RecordReaderGenericRdfTripleBase
- RecordReaderJsonArray - Class in net.sansa_stack.hadoop.format.gson.json
- RecordReaderJsonArray() - Constructor for class net.sansa_stack.hadoop.format.gson.json.RecordReaderJsonArray
- RecordReaderJsonArray(Gson) - Constructor for class net.sansa_stack.hadoop.format.gson.json.RecordReaderJsonArray
- RecordReaderJsonArray(RecordReaderConf, Gson) - Constructor for class net.sansa_stack.hadoop.format.gson.json.RecordReaderJsonArray
- RecordReaderJsonSequence - Class in net.sansa_stack.hadoop.format.gson.json
- RecordReaderJsonSequence() - Constructor for class net.sansa_stack.hadoop.format.gson.json.RecordReaderJsonSequence
- RecordReaderJsonSequence(Gson) - Constructor for class net.sansa_stack.hadoop.format.gson.json.RecordReaderJsonSequence
- RecordReaderJsonSequence(RecordReaderConf, Gson) - Constructor for class net.sansa_stack.hadoop.format.gson.json.RecordReaderJsonSequence
- RecordReaderRdfConf - Class in net.sansa_stack.hadoop.format.jena.base
- RecordReaderRdfConf(String, String, String, CustomPattern, String, Lang) - Constructor for class net.sansa_stack.hadoop.format.jena.base.RecordReaderRdfConf
- RecordReaderRdfNQuads - Class in net.sansa_stack.hadoop.format.jena.nquads
- RecordReaderRdfNQuads() - Constructor for class net.sansa_stack.hadoop.format.jena.nquads.RecordReaderRdfNQuads
- RecordReaderRdfNTriples - Class in net.sansa_stack.hadoop.format.jena.ntriples
- RecordReaderRdfNTriples() - Constructor for class net.sansa_stack.hadoop.format.jena.ntriples.RecordReaderRdfNTriples
- RecordReaderRdfTrigDataset - Class in net.sansa_stack.hadoop.format.jena.trig
-
RecordReader for the Trig RDF format that groups consecutive quads having the same IRI for the graph component into Datasets.
- RecordReaderRdfTrigDataset() - Constructor for class net.sansa_stack.hadoop.format.jena.trig.RecordReaderRdfTrigDataset
- RecordReaderRdfTrigDataset.AccumulatingDataset - Class in net.sansa_stack.hadoop.format.jena.trig
- RecordReaderRdfTrigQuad - Class in net.sansa_stack.hadoop.format.jena.trig
- RecordReaderRdfTrigQuad() - Constructor for class net.sansa_stack.hadoop.format.jena.trig.RecordReaderRdfTrigQuad
- RecordReaderRdfTurtleTriple - Class in net.sansa_stack.hadoop.format.jena.turtle
- RecordReaderRdfTurtleTriple() - Constructor for class net.sansa_stack.hadoop.format.jena.turtle.RecordReaderRdfTurtleTriple
- recordSearchPattern - Variable in class net.sansa_stack.hadoop.format.jena.base.RecordReaderConf
- recordStartPattern - Variable in class net.sansa_stack.hadoop.core.RecordReaderGenericBase
-
Regex pattern to search for candidate record starts used to avoid having to invoke the actual parser (which may start a new thread) on each single character
- RecordWriterRowSetStream - Class in net.sansa_stack.hadoop.output.jena.base
-
RecordWriter implementation over
RowSetStreamWriter. - RecordWriterRowSetStream(RowSetStreamWriter, FragmentOutputSpec) - Constructor for class net.sansa_stack.hadoop.output.jena.base.RecordWriterRowSetStream
- RecordWriterStreamRDF<T> - Class in net.sansa_stack.hadoop.output.jena.base
- RecordWriterStreamRDF(StreamRDF, BiConsumer<StreamRDF, T>, AutoCloseable) - Constructor for class net.sansa_stack.hadoop.output.jena.base.RecordWriterStreamRDF
- region(int, int) - Method in interface net.sansa_stack.hadoop.core.pattern.CustomMatcher
- region(int, int) - Method in class net.sansa_stack.hadoop.core.pattern.CustomMatcherBase
- region(int, int) - Method in class net.sansa_stack.hadoop.core.pattern.CustomMatcherDecorator
- region(int, int) - Method in class net.sansa_stack.hadoop.core.pattern.CustomMatcherJava
- region(int, int) - Method in class net.sansa_stack.hadoop.core.pattern.CustomPatternFiltered.CustomMatcherFilter
- region(int, int) - Method in class net.sansa_stack.hadoop.core.pattern.CustomPatternReplay.CustomMatcherReplay
- Region(int, int) - Constructor for class net.sansa_stack.hadoop.core.pattern.CustomPatternReplay.Region
- regionEnd - Variable in class net.sansa_stack.hadoop.core.pattern.CustomMatcherBase
- regionEndProbeResult - Variable in class net.sansa_stack.hadoop.core.Stats
- regionStart - Variable in class net.sansa_stack.hadoop.core.pattern.CustomMatcherBase
- regionStartProbeResult - Variable in class net.sansa_stack.hadoop.core.Stats
- regionStartSearchReadOverRegionEnd - Variable in class net.sansa_stack.hadoop.core.RecordReaderGenericBase
- regionStartSearchReadOverRegionEnd - Variable in class net.sansa_stack.hadoop.core.Stats
-
If the search read over the region end it means that the parser had to be restarted with the detected region start and end offsets.
- regionStartSearchReadOverSplitEnd - Variable in class net.sansa_stack.hadoop.core.RecordReaderGenericBase
- regionStartSearchReadOverSplitEnd - Variable in class net.sansa_stack.hadoop.core.Stats
- requestedCsvFormat - Variable in class net.sansa_stack.hadoop.format.commons_csv.csv.RecordReaderCsv
- requestedDialect - Variable in class net.sansa_stack.hadoop.format.univocity.csv.csv.RecordReaderCsvUnivocity
- reset() - Method in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsv.CustomMatcherCsv2
- reset() - Method in class net.sansa_stack.hadoop.core.pattern.CustomPatternReplay.CustomMatcherReplay
- reverse(CharSequence, int) - Static method in class net.sansa_stack.hadoop.core.pattern.CharSequences
- reverse(CharSequence, int, int) - Static method in class net.sansa_stack.hadoop.core.pattern.CharSequences
- reverseEnd - Variable in class net.sansa_stack.hadoop.core.pattern.CharSequenceReverse
- reverseStart - Variable in class net.sansa_stack.hadoop.core.pattern.CharSequenceReverse
- Row(long, long, List<CustomPatternCsv.Field>, int) - Constructor for class net.sansa_stack.hadoop.core.pattern.CustomPatternCsv.Row
- rows - Variable in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsv.MatchState
- RowSetStreamWriterUtils - Class in net.sansa_stack.hadoop.output.jena.base
- RowSetStreamWriterUtils() - Constructor for class net.sansa_stack.hadoop.output.jena.base.RowSetStreamWriterUtils
- RS_LANG - Static variable in class net.sansa_stack.hadoop.output.jena.base.RdfOutputUtils
- RS_VARS - Static variable in class net.sansa_stack.hadoop.output.jena.base.RdfOutputUtils
S
- SansaHadoopConstants - Class in net.sansa_stack.hadoop.core
- SansaHadoopConstants() - Constructor for class net.sansa_stack.hadoop.core.SansaHadoopConstants
- SCHEME_NAMES - Static variable in class net.sansa_stack.hadoop.jena.locator.LocatorHdfs
- seek(long) - Method in class net.sansa_stack.hadoop.util.DeferredSeekablePushbackInputStream
- seek(long) - Method in interface net.sansa_stack.hadoop.util.SeekableDecorator
- seek(long) - Method in class net.sansa_stack.hadoop.util.SeekablePushbackInputStream
- seekable - Variable in class net.sansa_stack.hadoop.util.DeferredSeekablePushbackInputStream
- seekable - Variable in class net.sansa_stack.hadoop.util.SeekableByteChannelFromSeekableInputStream
- seekable - Variable in class net.sansa_stack.hadoop.util.SeekableInputStream
- seekable - Variable in class net.sansa_stack.hadoop.util.SeekablePushbackInputStream
- SeekableByteChannelFromSeekableInputStream - Class in net.sansa_stack.hadoop.util
- SeekableByteChannelFromSeekableInputStream(InputStream) - Constructor for class net.sansa_stack.hadoop.util.SeekableByteChannelFromSeekableInputStream
- SeekableByteChannelFromSeekableInputStream(InputStream, Seekable) - Constructor for class net.sansa_stack.hadoop.util.SeekableByteChannelFromSeekableInputStream
- SeekableDecorator - Interface in net.sansa_stack.hadoop.util
- SeekableInputStream - Class in net.sansa_stack.hadoop.util
-
A basic wrapper that combines Hadoop's Seekable and InputStream into one class.
- SeekableInputStream(InputStream) - Constructor for class net.sansa_stack.hadoop.util.SeekableInputStream
- SeekableInputStream(InputStream, Seekable) - Constructor for class net.sansa_stack.hadoop.util.SeekableInputStream
-
Constructs a new ProxyInputStream.
- SeekablePushbackInputStream - Class in net.sansa_stack.hadoop.util
- SeekablePushbackInputStream(InputStream, int) - Constructor for class net.sansa_stack.hadoop.util.SeekablePushbackInputStream
- SeekablePushbackInputStream(InputStream, Seekable, int) - Constructor for class net.sansa_stack.hadoop.util.SeekablePushbackInputStream
- SeekableSourceOverSplit - Class in net.sansa_stack.hadoop.core
-
A seekable source over a split (usually a hadoop input split).
- SeekableSourceOverSplit(ReadableChannel<byte[]>, BufferOverReadableChannel<byte[]>, BufferOverReadableChannel<byte[]>, BufferOverReadableChannel<byte[]>, NavigableMap<Long, Long>) - Constructor for class net.sansa_stack.hadoop.core.SeekableSourceOverSplit
-
If true then the headStream can no longer be used.
- seekToNewSource(long) - Method in class net.sansa_stack.hadoop.util.DeferredSeekablePushbackInputStream
- seekToNewSource(long) - Method in interface net.sansa_stack.hadoop.util.SeekableDecorator
- seekToNewSource(long) - Method in class net.sansa_stack.hadoop.util.SeekablePushbackInputStream
- sendRecordToStreamRdf - Variable in class net.sansa_stack.hadoop.output.jena.base.RecordWriterStreamRDF
- sendRecordToStreamRdf(StreamRDF, Triple) - Method in class net.sansa_stack.hadoop.output.jena.base.OutputFormatStreamRdfTriple
- sendRecordToStreamRdf(StreamRDF, Quad) - Method in class net.sansa_stack.hadoop.output.jena.base.OutputFormatStreamRdfQuad
- sendRecordToStreamRdf(StreamRDF, T) - Method in class net.sansa_stack.hadoop.output.jena.base.OutputFormatStreamRdfBase
- setCandidatePos(Long) - Method in interface net.sansa_stack.hadoop.core.ProbeStats
- setCsvFormat(Configuration, CSVFormat) - Static method in class net.sansa_stack.hadoop.format.commons_csv.csv.FileInputFormatCsv
- setErrorMessage(String) - Method in interface net.sansa_stack.hadoop.core.Stats2
- setFirstBlock(Long) - Method in interface net.sansa_stack.hadoop.core.Stats2
- setInQuotedField(boolean) - Method in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsv.CustomMatcherCsv2
- setJson(Configuration, Gson, String, Object) - Static method in class net.sansa_stack.hadoop.util.ConfigurationUtils
- setLang(Configuration, Lang) - Static method in class net.sansa_stack.hadoop.output.jena.base.RdfOutputUtils
- setMapQuadsToTriplesForTripleLangs(Configuration, boolean) - Static method in class net.sansa_stack.hadoop.output.jena.base.RdfOutputUtils
- setPrefixes(Configuration, PrefixMapping) - Static method in class net.sansa_stack.hadoop.output.jena.base.RdfOutputUtils
- setProbeCount(Long) - Method in interface net.sansa_stack.hadoop.core.ProbeStats
- setRdfFormat(Configuration, RDFFormat) - Static method in class net.sansa_stack.hadoop.output.jena.base.RdfOutputUtils
- setRecordCount(long) - Method in class net.sansa_stack.hadoop.core.Stats
- setRecordCount(Long) - Method in interface net.sansa_stack.hadoop.core.Stats2
- setRegionEndProbeResult(ProbeResult) - Method in class net.sansa_stack.hadoop.core.Stats
- setRegionEndProbeResult(ProbeStats) - Method in interface net.sansa_stack.hadoop.core.Stats2
- setRegionStartProbeResult(ProbeResult) - Method in class net.sansa_stack.hadoop.core.Stats
- setRegionStartProbeResult(ProbeStats) - Method in interface net.sansa_stack.hadoop.core.Stats2
- setRegionStartSearchReadOverRegionEnd(Boolean) - Method in class net.sansa_stack.hadoop.core.Stats
- setRegionStartSearchReadOverRegionEnd(Boolean) - Method in interface net.sansa_stack.hadoop.core.Stats2
- setRegionStartSearchReadOverSplitEnd(Boolean) - Method in class net.sansa_stack.hadoop.core.Stats
- setRegionStartSearchReadOverSplitEnd(Boolean) - Method in interface net.sansa_stack.hadoop.core.Stats2
- setSerializable(Configuration, String, Serializable) - Static method in class net.sansa_stack.hadoop.util.ConfigurationUtils
-
Set a serializable object as a base64 url encoded string
- setSplitCount(Configuration, int) - Static method in class net.sansa_stack.hadoop.output.jena.base.OutputUtils
- setSplitEnd(long) - Method in class net.sansa_stack.hadoop.core.Stats
- setSplitSize(Long) - Method in interface net.sansa_stack.hadoop.core.Stats2
- setSplitStart(long) - Method in class net.sansa_stack.hadoop.core.Stats
- setSplitStart(Long) - Method in interface net.sansa_stack.hadoop.core.Stats2
- setStreamToInterval(long, long) - Method in class net.sansa_stack.hadoop.core.RecordReaderGenericBase
-
Seek to a given offset and prepare to read up to the 'end' position (exclusive) For non-encoded streams this is just performs a seek on th stream and returns start/end unchanged.
- setTailElementCount(long) - Method in class net.sansa_stack.hadoop.core.Stats
- setTailElementCount(Integer) - Method in interface net.sansa_stack.hadoop.core.Stats2
- setTotalBytesRead(Long) - Method in class net.sansa_stack.hadoop.core.Stats
- setTotalBytesRead(Long) - Method in interface net.sansa_stack.hadoop.core.Stats2
- setTotalDuration(Double) - Method in interface net.sansa_stack.hadoop.core.ProbeStats
- setTotalElementCount(long) - Method in class net.sansa_stack.hadoop.core.Stats
- setTotalElementCount(Long) - Method in interface net.sansa_stack.hadoop.core.Stats2
- setTotalRecordCount(long) - Method in class net.sansa_stack.hadoop.core.Stats
- setTotalRecordCount(Long) - Method in interface net.sansa_stack.hadoop.core.Stats2
- setTotalTime(Double) - Method in interface net.sansa_stack.hadoop.core.Stats2
- setUnivocityConfig(Configuration, UnivocityCsvwConf) - Static method in class net.sansa_stack.hadoop.format.univocity.csv.csv.FileInputFormatCsvUnivocity
- setupParser(InputStream, boolean) - Method in class net.sansa_stack.hadoop.format.jena.base.RecordReaderGenericRdfBase
- setupTailBuffer() - Method in class net.sansa_stack.hadoop.core.SeekableSourceOverSplit
- setVars(Configuration, List<Var>) - Static method in class net.sansa_stack.hadoop.output.jena.base.RdfOutputUtils
- size() - Method in class net.sansa_stack.hadoop.core.SeekableSourceOverSplit
- size() - Method in class net.sansa_stack.hadoop.util.SeekableByteChannelFromSeekableInputStream
- skipRecordCount - Variable in class net.sansa_stack.hadoop.core.RecordReaderGenericBase
- split - Variable in class net.sansa_stack.hadoop.core.RecordReaderGenericBase
- splitEnd - Variable in class net.sansa_stack.hadoop.core.RecordReaderGenericBase
- splitEnd - Variable in class net.sansa_stack.hadoop.core.Stats
- splitId - Variable in class net.sansa_stack.hadoop.core.RecordReaderGenericBase
- splitIdFn(long, long) - Static method in class net.sansa_stack.hadoop.core.TailBufferChannel
- splitLength - Variable in class net.sansa_stack.hadoop.core.RecordReaderGenericBase
- splitName - Variable in class net.sansa_stack.hadoop.core.RecordReaderGenericBase
- splitStart - Variable in class net.sansa_stack.hadoop.core.RecordReaderGenericBase
- splitStart - Variable in class net.sansa_stack.hadoop.core.Stats
- stackTraceConsumer - Variable in class net.sansa_stack.hadoop.util.InputStreamWithCloseLogging
- start - Variable in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsv.MatchRegion
- start() - Method in class net.sansa_stack.hadoop.core.pattern.CustomMatcherDecorator
- start() - Method in class net.sansa_stack.hadoop.core.pattern.CustomMatcherJava
- start() - Method in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsv.CustomMatcherCsv2
- start() - Method in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsvOld.CustomMatcherCsv
-
Deprecated.
- start() - Method in class net.sansa_stack.hadoop.core.pattern.CustomPatternFiltered.CustomMatcherFilter
- start() - Method in class net.sansa_stack.hadoop.core.pattern.CustomPatternReplay.CustomMatcherReplay
- start() - Method in class net.sansa_stack.hadoop.core.pattern.CustomPatternTrigGraph.CustomMatcherTrigGraph
- start() - Method in class net.sansa_stack.hadoop.core.plugin.JenaPluginSansaHadoop
- start(int) - Method in class net.sansa_stack.hadoop.core.pattern.CustomMatcherBase
- start(int) - Method in class net.sansa_stack.hadoop.core.pattern.CustomMatcherDecorator
- start(int) - Method in class net.sansa_stack.hadoop.core.pattern.CustomMatcherJava
- start(int) - Method in class net.sansa_stack.hadoop.core.pattern.CustomPatternReplay.CustomMatcherReplay
- start(String) - Method in interface net.sansa_stack.hadoop.core.pattern.CustomMatcher
- start(String) - Method in class net.sansa_stack.hadoop.core.pattern.CustomMatcherBase
- start(String) - Method in class net.sansa_stack.hadoop.core.pattern.CustomMatcherJava
- start(String) - Method in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsv.CustomMatcherCsv2
- start(String) - Method in class net.sansa_stack.hadoop.core.pattern.CustomPatternReplay.CustomMatcherReplay
- startOfQuotedFieldBwdPattern - Variable in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsvOld
-
Deprecated.
- startPos - Variable in class net.sansa_stack.hadoop.core.OffsetSeekResult
- stats - Variable in class net.sansa_stack.hadoop.core.RecordReaderGenericBaseStatsWrapper
- Stats - Class in net.sansa_stack.hadoop.core
- Stats() - Constructor for class net.sansa_stack.hadoop.core.Stats
- Stats2 - Interface in net.sansa_stack.hadoop.core
- stop() - Method in class net.sansa_stack.hadoop.core.plugin.JenaPluginSansaHadoop
- stream - Variable in class net.sansa_stack.hadoop.core.RecordReaderGenericBase
- streamFileSplits(Path, long, long) - Static method in class net.sansa_stack.hadoop.util.FileSplitUtils
-
Utility method to create a specific number of splits for a file.
- streamRdf - Variable in class net.sansa_stack.hadoop.output.jena.base.RecordWriterStreamRDF
- StreamRDFUtils - Class in net.sansa_stack.hadoop.output.jena.base
- StreamRDFUtils() - Constructor for class net.sansa_stack.hadoop.output.jena.base.StreamRDFUtils
- subSequence(int, int) - Method in class net.sansa_stack.hadoop.core.pattern.CharSequenceReverse
- substitute(String, CustomPatternCsvOld.Config) - Static method in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsvOld
-
Deprecated.
- success - Variable in class net.sansa_stack.hadoop.core.OffsetSeekResult
- suffix - Variable in class net.sansa_stack.hadoop.core.pattern.CustomPatternCsv.Field
T
- tailBuffer - Variable in class net.sansa_stack.hadoop.core.SeekableSourceOverSplit
- TailBufferChannel - Class in net.sansa_stack.hadoop.core
- TailBufferChannel() - Constructor for class net.sansa_stack.hadoop.core.TailBufferChannel
- tailByteBuffer - Variable in class net.sansa_stack.hadoop.core.RecordReaderGenericBase
- tailBytes - Variable in class net.sansa_stack.hadoop.core.RecordReaderGenericBase
- tailElementCount - Variable in class net.sansa_stack.hadoop.core.Stats
- tailEltBuffer - Variable in class net.sansa_stack.hadoop.core.RecordReaderGenericBase
- tailElts - Variable in class net.sansa_stack.hadoop.core.RecordReaderGenericBase
- tailEltsTime - Variable in class net.sansa_stack.hadoop.core.RecordReaderGenericBase
- tailRecordOffset - Variable in class net.sansa_stack.hadoop.core.RecordReaderGenericBase
- toString() - Method in class net.sansa_stack.hadoop.core.pattern.CharSequenceReverse
- toString() - Method in class net.sansa_stack.hadoop.core.pattern.CustomPatternReplay.Region
- toString() - Method in class net.sansa_stack.hadoop.core.Stats
- toString(CharSequence) - Static method in class net.sansa_stack.hadoop.core.pattern.CharSequences
- totalBytesRead - Variable in class net.sansa_stack.hadoop.core.Stats
- totalElementCount - Variable in class net.sansa_stack.hadoop.core.Stats
- totalEltCount - Variable in class net.sansa_stack.hadoop.core.RecordReaderGenericBase
- totalRecordCount - Variable in class net.sansa_stack.hadoop.core.RecordReaderGenericBase
- totalRecordCount - Variable in class net.sansa_stack.hadoop.core.Stats
- TRIG_GRAPH_REVERSE_PATTERN - Static variable in class net.sansa_stack.hadoop.core.pattern.CustomPatternTrigGraph
- trigFwdPattern - Static variable in class net.sansa_stack.hadoop.format.jena.trig.RecordReaderRdfTrigDataset
- trigFwdPattern - Static variable in class net.sansa_stack.hadoop.format.jena.trig.RecordReaderRdfTrigQuad
- trigFwdPatternGraphFollowedByCurlyBrace - Static variable in class net.sansa_stack.hadoop.format.jena.trig.RecordReaderRdfTrigDataset
-
This is the pattern for directives or trig data where graphs are separated by '{'.
- trigFwdPatternNew - Static variable in class net.sansa_stack.hadoop.format.jena.trig.RecordReaderRdfTrigDataset
-
This is the pattern for directives or (compact) IRIs.
- truncate(long) - Method in class net.sansa_stack.hadoop.util.SeekableByteChannelFromSeekableInputStream
- turtleRecordStartPattern - Static variable in class net.sansa_stack.hadoop.format.jena.turtle.RecordReaderRdfTurtleTriple
-
Syntatic constructs in Turtle can start with: TODO Anything missing? base / @base prefix / @prefix @lt;foo;> - an IRI [ ] - a blank node foo: - a CURIE
U
- unbufferedStream(BufferOverReadableChannel<T[]>) - Static method in class net.sansa_stack.hadoop.core.RecordReaderGenericBase
- UnivocityRxUtils - Class in net.sansa_stack.hadoop.format.univocity.csv.csv
- UnivocityRxUtils() - Constructor for class net.sansa_stack.hadoop.format.univocity.csv.csv.UnivocityRxUtils
W
- wrap(InputStream, Function<? super Throwable, String>, Consumer<? super String>) - Static method in class net.sansa_stack.hadoop.util.InputStreamWithCloseLogging
-
Convenience method for e.g.
- wrap(CustomPattern) - Static method in class net.sansa_stack.hadoop.core.pattern.CustomPatternReplay
- write(Long, Binding) - Method in class net.sansa_stack.hadoop.output.jena.base.RecordWriterRowSetStream
- write(Long, T) - Method in class net.sansa_stack.hadoop.output.jena.base.RecordWriterStreamRDF
- write(Object) - Static method in class net.sansa_stack.hadoop.util.JsonHadoopBridge
- write(ByteBuffer) - Method in class net.sansa_stack.hadoop.util.SeekableByteChannelFromSeekableInputStream
- write(Configuration, JsonNode) - Method in class net.sansa_stack.hadoop.util.JsonHadoopBridge
- write(Configuration, Object) - Method in class net.sansa_stack.hadoop.util.JsonHadoopBridge
- writer - Variable in class net.sansa_stack.hadoop.output.jena.base.RecordWriterRowSetStream
- writeRecursively(JsonNode, JsonNode, Path<String>, BiConsumer<Path<String>, String>) - Static method in class net.sansa_stack.hadoop.util.JsonHadoopBridge
X
- xschemeNames - Variable in class net.sansa_stack.hadoop.jena.locator.LocatorHdfs
All Classes and Interfaces|All Packages|Constant Field Values|Serialized Form