Class RecordReaderRdfNTriples
java.lang.Object
org.apache.hadoop.mapreduce.RecordReader<org.apache.hadoop.io.LongWritable,T>
net.sansa_stack.hadoop.core.RecordReaderGenericBase<U,G,A,T>
net.sansa_stack.hadoop.format.jena.base.RecordReaderGenericRdfBase<T,T,T,T>
net.sansa_stack.hadoop.format.jena.base.RecordReaderGenericRdfNonAccumulatingBase<org.apache.jena.graph.Triple>
net.sansa_stack.hadoop.format.jena.base.RecordReaderGenericRdfTripleBase
net.sansa_stack.hadoop.format.jena.ntriples.RecordReaderRdfNTriples
- All Implemented Interfaces:
Closeable,AutoCloseable
-
Nested Class Summary
Nested classes/interfaces inherited from class net.sansa_stack.hadoop.core.RecordReaderGenericBase
RecordReaderGenericBase.ReadTooFarException -
Field Summary
FieldsModifier and TypeFieldDescriptionprotected static final CustomPatternMatch the first character after a newlinestatic final Stringstatic final Stringstatic final StringFields inherited from class net.sansa_stack.hadoop.format.jena.base.RecordReaderGenericRdfBase
baseIri, baseIriKey, headerBytesKey, lang, prefixesMaxLengthKey, prefixMapFields inherited from class net.sansa_stack.hadoop.core.RecordReaderGenericBase
accumulating, codec, currentKey, currentValue, datasetFlow, decompressor, EMPTY_BYTE_ARRAY, enableStats, isEncoded, isFirstSplit, maxExtraByteCount, maxRecordLength, maxRecordLengthKey, minRecordLength, minRecordLengthKey, postambleBytes, preambleBytes, probeElementCount, probeElementCountKey, probeRecordCount, probeRecordCountKey, rawStream, recordFlowCloseable, recordStartPattern, regionStartSearchReadOverRegionEnd, regionStartSearchReadOverSplitEnd, skipRecordCount, split, splitEnd, splitId, splitLength, splitName, splitStart, stream, tailByteBuffer, tailBytes, tailEltBuffer, tailElts, tailEltsTime, tailRecordOffset, totalEltCount, totalRecordCount -
Constructor Summary
Constructors -
Method Summary
Modifier and TypeMethodDescriptionprotected io.reactivex.rxjava3.core.Flowable<org.apache.jena.graph.Triple>parse(Callable<InputStream> inputStreamSupplier) Methods inherited from class net.sansa_stack.hadoop.format.jena.base.RecordReaderGenericRdfTripleBase
parseMethods inherited from class net.sansa_stack.hadoop.format.jena.base.RecordReaderGenericRdfBase
initialize, setupParserMethods inherited from class net.sansa_stack.hadoop.core.RecordReaderGenericBase
abbreviate, abbreviate, abbreviateAsUTF8, aggregate, aggregate, close, convert, createMatcherFactory, createRecordFlow, detectTail, didHitSplitBound, effectiveInputStream, effectiveInputStreamSupp, findFirstPositionWithProbeSuccess, findNextRegion, getCurrentKey, getCurrentValue, getPos, getPosition, getProgress, getStats, initRecordFlow, lines, logClose, logUnexpectedClose, nextKeyValue, parseFromSeekable, prober, setStreamToInterval, unbufferedStream
-
Field Details
-
RECORD_MINLENGTH_KEY
- See Also:
-
RECORD_MAXLENGTH_KEY
- See Also:
-
RECORD_PROBECOUNT_KEY
- See Also:
-
nTriplesRecordStartPattern
Match the first character after a newline
-
-
Constructor Details
-
RecordReaderRdfNTriples
public RecordReaderRdfNTriples()
-
-
Method Details
-
parse
protected io.reactivex.rxjava3.core.Flowable<org.apache.jena.graph.Triple> parse(Callable<InputStream> inputStreamSupplier)
-