Class FileInputFormatRdfBase<T>

    • Nested Class Summary

      • Nested classes/interfaces inherited from class org.apache.hadoop.mapreduce.lib.input.FileInputFormat

        org.apache.hadoop.mapreduce.lib.input.FileInputFormat.Counter
    • Field Summary

      Fields 
      Modifier and Type Field Description
      static String BASE_IRI_KEY  
      protected org.apache.jena.riot.Lang lang
      Input language
      static long PARSED_PREFIXES_LENGTH_DEFAULT  
      static String PREFIXES_KEY  
      protected String prefixesLengthMaxKey  
      • Fields inherited from class org.apache.hadoop.mapreduce.lib.input.FileInputFormat

        DEFAULT_LIST_STATUS_NUM_THREADS, INPUT_DIR, INPUT_DIR_RECURSIVE, LIST_STATUS_NUM_THREADS, NUM_INPUT_FILES, PATHFILTER_CLASS, SPLIT_MAXSIZE, SPLIT_MINSIZE
    • Constructor Summary

      Constructors 
      Constructor Description
      FileInputFormatRdfBase​(org.apache.jena.riot.Lang lang, String prefixesLengthMaxKey)  
    • Method Summary

      All Methods Static Methods Instance Methods Abstract Methods Concrete Methods 
      Modifier and Type Method Description
      org.apache.hadoop.mapreduce.RecordReader<org.apache.hadoop.io.LongWritable,​T> createRecordReader​(org.apache.hadoop.mapreduce.InputSplit inputSplit, org.apache.hadoop.mapreduce.TaskAttemptContext context)  
      abstract org.apache.hadoop.mapreduce.RecordReader<org.apache.hadoop.io.LongWritable,​T> createRecordReaderActual​(org.apache.hadoop.mapreduce.InputSplit inputSplit, org.apache.hadoop.mapreduce.TaskAttemptContext context)  
      static org.apache.jena.rdf.model.Model getModel​(org.apache.hadoop.conf.Configuration conf)
      Extract a Model from a hadoop conf using PREFIXES_KEY
      static org.apache.jena.rdf.model.Model getModel​(org.apache.hadoop.conf.Configuration conf, String key)
      Extract a Model from a hadoop conf.
      List<org.apache.hadoop.mapreduce.InputSplit> getSplits​(org.apache.hadoop.mapreduce.JobContext job)  
      boolean isSplitable​(org.apache.hadoop.mapreduce.JobContext context, org.apache.hadoop.fs.Path file)  
      • Methods inherited from class org.apache.hadoop.mapreduce.lib.input.FileInputFormat

        addInputPath, addInputPathRecursively, addInputPaths, computeSplitSize, getBlockIndex, getFormatMinSplitSize, getInputDirRecursive, getInputPathFilter, getInputPaths, getMaxSplitSize, getMinSplitSize, listStatus, makeSplit, makeSplit, setInputDirRecursive, setInputPathFilter, setInputPaths, setInputPaths, setMaxInputSplitSize, setMinInputSplitSize
    • Constructor Detail

      • FileInputFormatRdfBase

        public FileInputFormatRdfBase​(org.apache.jena.riot.Lang lang,
                                      String prefixesLengthMaxKey)
    • Method Detail

      • isSplitable

        public boolean isSplitable​(org.apache.hadoop.mapreduce.JobContext context,
                                   org.apache.hadoop.fs.Path file)
        Overrides:
        isSplitable in class org.apache.hadoop.mapreduce.lib.input.FileInputFormat<org.apache.hadoop.io.LongWritable,​T>
      • createRecordReader

        public final org.apache.hadoop.mapreduce.RecordReader<org.apache.hadoop.io.LongWritable,​T> createRecordReader​(org.apache.hadoop.mapreduce.InputSplit inputSplit,
                                                                                                                            org.apache.hadoop.mapreduce.TaskAttemptContext context)
        Specified by:
        createRecordReader in class org.apache.hadoop.mapreduce.InputFormat<org.apache.hadoop.io.LongWritable,​T>
      • createRecordReaderActual

        public abstract org.apache.hadoop.mapreduce.RecordReader<org.apache.hadoop.io.LongWritable,​T> createRecordReaderActual​(org.apache.hadoop.mapreduce.InputSplit inputSplit,
                                                                                                                                     org.apache.hadoop.mapreduce.TaskAttemptContext context)
      • getSplits

        public List<org.apache.hadoop.mapreduce.InputSplit> getSplits​(org.apache.hadoop.mapreduce.JobContext job)
                                                               throws IOException
        Overrides:
        getSplits in class org.apache.hadoop.mapreduce.lib.input.FileInputFormat<org.apache.hadoop.io.LongWritable,​T>
        Throws:
        IOException
      • getModel

        public static org.apache.jena.rdf.model.Model getModel​(org.apache.hadoop.conf.Configuration conf)
        Extract a Model from a hadoop conf using PREFIXES_KEY
      • getModel

        public static org.apache.jena.rdf.model.Model getModel​(org.apache.hadoop.conf.Configuration conf,
                                                               String key)
        Extract a Model from a hadoop conf. Result is never null; empty if there was no entry for the key or exception on parse error.