Class RecordReaderGenericRdfBase<U,G,A,T>
java.lang.Object
org.apache.hadoop.mapreduce.RecordReader<org.apache.hadoop.io.LongWritable,T>
net.sansa_stack.hadoop.core.RecordReaderGenericBase<U,G,A,T>
net.sansa_stack.hadoop.format.jena.base.RecordReaderGenericRdfBase<U,G,A,T>
- All Implemented Interfaces:
Closeable,AutoCloseable
- Direct Known Subclasses:
RecordReaderGenericRdfAccumulatingBase,RecordReaderGenericRdfNonAccumulatingBase
-
Nested Class Summary
Nested classes/interfaces inherited from class net.sansa_stack.hadoop.core.RecordReaderGenericBase
RecordReaderGenericBase.ReadTooFarException -
Field Summary
FieldsModifier and TypeFieldDescriptionprotected Stringprotected final Stringprotected final Stringprotected org.apache.jena.riot.Langprotected Stringprotected org.apache.jena.riot.system.PrefixMapFields inherited from class net.sansa_stack.hadoop.core.RecordReaderGenericBase
accumulating, codec, currentKey, currentValue, datasetFlow, decompressor, EMPTY_BYTE_ARRAY, enableStats, isEncoded, isFirstSplit, maxExtraByteCount, maxRecordLength, maxRecordLengthKey, minRecordLength, minRecordLengthKey, postambleBytes, preambleBytes, probeElementCount, probeElementCountKey, probeRecordCount, probeRecordCountKey, rawStream, recordFlowCloseable, recordStartPattern, regionStartSearchReadOverRegionEnd, regionStartSearchReadOverSplitEnd, skipRecordCount, split, splitEnd, splitId, splitLength, splitName, splitStart, stream, tailByteBuffer, tailBytes, tailEltBuffer, tailElts, tailEltsTime, tailRecordOffset, totalEltCount, totalRecordCount -
Constructor Summary
ConstructorsConstructorDescriptionRecordReaderGenericRdfBase(RecordReaderRdfConf conf, Accumulating<U, G, A, T> accumulating) -
Method Summary
Modifier and TypeMethodDescriptionvoidinitialize(org.apache.hadoop.mapreduce.InputSplit inputSplit, org.apache.hadoop.mapreduce.TaskAttemptContext context) Read out config paramaters (prefixes, length thresholds, ...) and examine the codec in order to set an internal flag whether the stream will be encoded or not.protected org.apache.jena.riot.system.AsyncParserBuildersetupParser(InputStream in, boolean isProbe) Methods inherited from class net.sansa_stack.hadoop.core.RecordReaderGenericBase
abbreviate, abbreviate, abbreviateAsUTF8, aggregate, aggregate, close, convert, createMatcherFactory, createRecordFlow, detectTail, didHitSplitBound, effectiveInputStream, effectiveInputStreamSupp, findFirstPositionWithProbeSuccess, findNextRegion, getCurrentKey, getCurrentValue, getPos, getPosition, getProgress, getStats, initRecordFlow, lines, logClose, logUnexpectedClose, nextKeyValue, parse, parseFromSeekable, prober, setStreamToInterval, unbufferedStream
-
Field Details
-
baseIriKey
-
headerBytesKey
-
prefixesMaxLengthKey
-
baseIri
-
lang
protected org.apache.jena.riot.Lang lang -
prefixMap
protected org.apache.jena.riot.system.PrefixMap prefixMap
-
-
Constructor Details
-
RecordReaderGenericRdfBase
-
-
Method Details
-
initialize
public void initialize(org.apache.hadoop.mapreduce.InputSplit inputSplit, org.apache.hadoop.mapreduce.TaskAttemptContext context) throws IOException Description copied from class:RecordReaderGenericBaseRead out config paramaters (prefixes, length thresholds, ...) and examine the codec in order to set an internal flag whether the stream will be encoded or not.- Overrides:
initializein classRecordReaderGenericBase<U,G, A, T> - Throws:
IOException
-
setupParser
protected org.apache.jena.riot.system.AsyncParserBuilder setupParser(InputStream in, boolean isProbe)
-