Class RecordReaderRdfTurtleTriple
- java.lang.Object
-
- org.apache.hadoop.mapreduce.RecordReader<org.apache.hadoop.io.LongWritable,T>
-
- net.sansa_stack.hadoop.core.RecordReaderGenericBase<U,G,A,T>
-
- net.sansa_stack.hadoop.format.jena.base.RecordReaderGenericRdfBase<T,T,T,T>
-
- net.sansa_stack.hadoop.format.jena.base.RecordReaderGenericRdfNonAccumulatingBase<org.apache.jena.graph.Triple>
-
- net.sansa_stack.hadoop.format.jena.turtle.RecordReaderRdfTurtleTriple
-
- All Implemented Interfaces:
Closeable
,AutoCloseable
public class RecordReaderRdfTurtleTriple extends RecordReaderGenericRdfNonAccumulatingBase<org.apache.jena.graph.Triple>
-
-
Field Summary
Fields Modifier and Type Field Description static String
PREFIXES_MAXLENGTH_KEY
static String
RECORD_MAXLENGTH_KEY
static String
RECORD_MINLENGTH_KEY
static String
RECORD_PROBECOUNT_KEY
protected static Pattern
turtleRecordStartPattern
Syntatic constructs in Turtle can start with: TODO Anything missing? base / @base prefix / @prefix @lt;foo;> - an IRI [ ] - a blank node foo: - a CURIE-
Fields inherited from class net.sansa_stack.hadoop.format.jena.base.RecordReaderGenericRdfBase
baseIri, baseIriKey, headerBytesKey, lang, prefixesMaxLengthKey
-
Fields inherited from class net.sansa_stack.hadoop.core.RecordReaderGenericBase
accumulating, codec, currentKey, currentValue, datasetFlow, decompressor, EMPTY_BYTE_ARRAY, isEncoded, isFirstSplit, maxRecordLength, maxRecordLengthKey, minRecordLength, minRecordLengthKey, postambleBytes, preambleBytes, probeRecordCount, probeRecordCountKey, raisedThrowable, rawStream, recordStartPattern, split, splitEnd, splitLength, splitStart, stream
-
-
Constructor Summary
Constructors Constructor Description RecordReaderRdfTurtleTriple()
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description protected io.reactivex.rxjava3.core.Flowable<org.apache.jena.graph.Triple>
parse(Callable<InputStream> inputStreamSupplier)
Create a flowable from the input stream.-
Methods inherited from class net.sansa_stack.hadoop.format.jena.base.RecordReaderGenericRdfBase
initialize
-
Methods inherited from class net.sansa_stack.hadoop.core.RecordReaderGenericBase
aggregate, close, createRecordFlow, didHitSplitBound, effectiveInputStream, effectiveInputStreamSupp, findFirstPositionWithProbeSuccess, findNextRecord, getCurrentKey, getCurrentValue, getPos, getProgress, initRecordFlow, lines, logClose, logUnexpectedClose, nextKeyValue, parseFromSeekable, prober, setStreamToInterval
-
-
-
-
Field Detail
-
RECORD_MINLENGTH_KEY
public static final String RECORD_MINLENGTH_KEY
- See Also:
- Constant Field Values
-
RECORD_MAXLENGTH_KEY
public static final String RECORD_MAXLENGTH_KEY
- See Also:
- Constant Field Values
-
RECORD_PROBECOUNT_KEY
public static final String RECORD_PROBECOUNT_KEY
- See Also:
- Constant Field Values
-
PREFIXES_MAXLENGTH_KEY
public static final String PREFIXES_MAXLENGTH_KEY
- See Also:
- Constant Field Values
-
turtleRecordStartPattern
protected static final Pattern turtleRecordStartPattern
Syntatic constructs in Turtle can start with: TODO Anything missing?- base / @base
- prefix / @prefix
- @lt;foo;> - an IRI
- [ ] - a blank node
- foo: - a CURIE
-
-
Method Detail
-
parse
protected io.reactivex.rxjava3.core.Flowable<org.apache.jena.graph.Triple> parse(Callable<InputStream> inputStreamSupplier)
Description copied from class:RecordReaderGenericBase
Create a flowable from the input stream. The input stream may be incorrectly positioned in which case the Flowable is expected to indicate this by raising an error event.- Specified by:
parse
in classRecordReaderGenericBase<org.apache.jena.graph.Triple,org.apache.jena.graph.Triple,org.apache.jena.graph.Triple,org.apache.jena.graph.Triple>
- Parameters:
inputStreamSupplier
- A supplier of input streams. May supply the same underlying stream on each call hence only at most a single stream should be taken from the supplier. Supplied streams are safe to use in try-with-resources blocks (possibly using CloseShieldInputStream). Taken streams should be closed by the client code.- Returns:
-
-