Class FileInputFormatCsv


  • public class FileInputFormatCsv
    extends org.apache.hadoop.mapreduce.lib.input.FileInputFormat<org.apache.hadoop.io.LongWritable,​List>
    • Nested Class Summary

      • Nested classes/interfaces inherited from class org.apache.hadoop.mapreduce.lib.input.FileInputFormat

        org.apache.hadoop.mapreduce.lib.input.FileInputFormat.Counter
    • Field Summary

      • Fields inherited from class org.apache.hadoop.mapreduce.lib.input.FileInputFormat

        DEFAULT_LIST_STATUS_NUM_THREADS, INPUT_DIR, INPUT_DIR_RECURSIVE, LIST_STATUS_NUM_THREADS, NUM_INPUT_FILES, PATHFILTER_CLASS, SPLIT_MAXSIZE, SPLIT_MINSIZE
    • Method Summary

      All Methods Static Methods Instance Methods Concrete Methods 
      Modifier and Type Method Description
      org.apache.hadoop.mapreduce.RecordReader<org.apache.hadoop.io.LongWritable,​List> createRecordReader​(org.apache.hadoop.mapreduce.InputSplit inputSplit, org.apache.hadoop.mapreduce.TaskAttemptContext context)  
      static org.apache.commons.csv.CSVFormat getCsvFormat​(org.apache.hadoop.conf.Configuration conf, org.apache.commons.csv.CSVFormat defaultValue)  
      boolean isSplitable​(org.apache.hadoop.mapreduce.JobContext context, org.apache.hadoop.fs.Path file)  
      static void setCsvFormat​(org.apache.hadoop.conf.Configuration conf, org.apache.commons.csv.CSVFormat csvFormat)  
      • Methods inherited from class org.apache.hadoop.mapreduce.lib.input.FileInputFormat

        addInputPath, addInputPathRecursively, addInputPaths, computeSplitSize, getBlockIndex, getFormatMinSplitSize, getInputDirRecursive, getInputPathFilter, getInputPaths, getMaxSplitSize, getMinSplitSize, getSplits, listStatus, makeSplit, makeSplit, setInputDirRecursive, setInputPathFilter, setInputPaths, setInputPaths, setMaxInputSplitSize, setMinInputSplitSize
    • Constructor Detail

      • FileInputFormatCsv

        public FileInputFormatCsv()
    • Method Detail

      • isSplitable

        public boolean isSplitable​(org.apache.hadoop.mapreduce.JobContext context,
                                   org.apache.hadoop.fs.Path file)
        Overrides:
        isSplitable in class org.apache.hadoop.mapreduce.lib.input.FileInputFormat<org.apache.hadoop.io.LongWritable,​List>
      • createRecordReader

        public org.apache.hadoop.mapreduce.RecordReader<org.apache.hadoop.io.LongWritable,​List> createRecordReader​(org.apache.hadoop.mapreduce.InputSplit inputSplit,
                                                                                                                         org.apache.hadoop.mapreduce.TaskAttemptContext context)
        Specified by:
        createRecordReader in class org.apache.hadoop.mapreduce.InputFormat<org.apache.hadoop.io.LongWritable,​List>
      • setCsvFormat

        public static void setCsvFormat​(org.apache.hadoop.conf.Configuration conf,
                                        org.apache.commons.csv.CSVFormat csvFormat)
      • getCsvFormat

        public static org.apache.commons.csv.CSVFormat getCsvFormat​(org.apache.hadoop.conf.Configuration conf,
                                                                    org.apache.commons.csv.CSVFormat defaultValue)