App Engine Python SDK  v1.6.9 rev.445
The Python runtime is available as an experimental Preview feature.
Public Member Functions | List of all members
google.appengine.ext.mapreduce.shuffler.ShufflePipeline Class Reference
Inheritance diagram for google.appengine.ext.mapreduce.shuffler.ShufflePipeline:
google.appengine.ext.mapreduce.pipeline_base.PipelineBase

Public Member Functions

def run
 
- Public Member Functions inherited from google.appengine.ext.mapreduce.pipeline_base.PipelineBase
def start
 

Detailed Description

A pipeline to shuffle multiple key-value files.

Args:
  job_name: The descriptive name of the overall job.
  filenames: list of file names to sort. Files have to be of records format
    defined by Files API and contain serialized file_service_pb.KeyValue
    protocol messages.
  shards: Optional. Number of output shards to generate. Defaults
    to the number of input files.

Returns:
  default: a list of filenames as string. Resulting files contain
    serialized file_service_pb.KeyValues protocol messages with
    all values collated to a single key. When there is no output,
    an empty list from shuffle service or a list of empty files from
    in memory shuffler.

The documentation for this class was generated from the following file: