pyspark.SparkContext.binaryRecords

SparkContext.binaryRecords(path, recordLength)[source]

Load data from a flat binary file, assuming each record is a set of numbers with the specified numerical format (see ByteBuffer), and the number of bytes per record is constant.

Parameters
  • path – Directory to the input data files

  • recordLength – The length at which to split the records