SummaryBuilder

class pyspark.ml.stat.SummaryBuilder(jSummaryBuilder)[source]

A builder object that provides summary statistics about a given column.

Users should not directly create such builders, but instead use one of the methods in pyspark.ml.stat.Summarizer

New in version 2.4.0.

Methods

Methods Documentation

summary(featuresCol, weightCol=None)[source]

Returns an aggregate object that contains the summary of the column with the requested metrics.

Parameters
  • featuresCol – a column that contains features Vector object.

  • weightCol – a column that contains weight value. Default weight is 1.0.

Returns

an aggregate column that contains the statistics. The exact content of this structure is determined during the creation of the builder.

New in version 2.4.0.