apache_beam.transforms.combiners module¶
A library of basic combiner PTransform subclasses.
- 
class apache_beam.transforms.combiners.Mean[source]¶
- Bases: - object- Combiners for computing arithmetic means of elements. - 
class Globally(has_defaults=True)[source]¶
- Bases: - apache_beam.transforms.combiners.CombinerWithoutDefaults- combiners.Mean.Globally computes the arithmetic mean of the elements. 
 - 
class PerKey(label=None)[source]¶
- Bases: - apache_beam.transforms.ptransform.PTransform- combiners.Mean.PerKey finds the means of the values for each key. 
 
- 
class 
- 
class apache_beam.transforms.combiners.MeanCombineFn(*unused_args, **unused_kwargs)[source]¶
- Bases: - apache_beam.transforms.core.CombineFn- CombineFn for computing an arithmetic mean. 
- 
class apache_beam.transforms.combiners.Count[source]¶
- Bases: - object- Combiners for counting elements. - 
class Globally(has_defaults=True)[source]¶
- Bases: - apache_beam.transforms.combiners.CombinerWithoutDefaults- combiners.Count.Globally counts the total number of elements. 
 - 
class PerKey(label=None)[source]¶
- Bases: - apache_beam.transforms.ptransform.PTransform- combiners.Count.PerKey counts how many elements each unique key has. 
 - 
class PerElement(label=None)[source]¶
- Bases: - apache_beam.transforms.ptransform.PTransform- combiners.Count.PerElement counts how many times each element occurs. 
 
- 
class 
- 
class apache_beam.transforms.combiners.CountCombineFn(*unused_args, **unused_kwargs)[source]¶
- Bases: - apache_beam.transforms.core.CombineFn- CombineFn for computing PCollection size. 
- 
class apache_beam.transforms.combiners.Top[source]¶
- Bases: - object- Combiners for obtaining extremal elements. - 
class Of(n, key=None, reverse=False)[source]¶
- Bases: - apache_beam.transforms.combiners.CombinerWithoutDefaults- Returns the n greatest elements in the PCollection. - This transform will retrieve the n greatest elements in the PCollection to which it is applied, where “greatest” is determined by a function supplied as the key or reverse arguments. - Creates a global Top operation. - The arguments ‘key’ and ‘reverse’ may be passed as keyword arguments, and have the same meaning as for Python’s sort functions. - Parameters: - n – number of elements to extract from pcoll.
- key – (optional) a mapping of elements to a comparable key, similar to the key argument of Python’s sorting methods.
- reverse – (optional) whether to order things smallest to largest, rather than largest to smallest
 
 - 
class PerKey(n, key=None, reverse=False)[source]¶
- Bases: - apache_beam.transforms.ptransform.PTransform- Identifies the N greatest elements associated with each key. - This transform will produce a PCollection mapping unique keys in the input PCollection to the n greatest elements with which they are associated, where “greatest” is determined by a function supplied as the key or reverse arguments. - Creates a per-key Top operation. - The arguments ‘key’ and ‘reverse’ may be passed as keyword arguments, and have the same meaning as for Python’s sort functions. - Parameters: - n – number of elements to extract from pcoll.
- key – (optional) a mapping of elements to a comparable key, similar to the key argument of Python’s sorting methods.
- reverse – (optional) whether to order things smallest to largest, rather than largest to smallest
 
 - 
static Largest(n, has_defaults=True, key=None)[source]¶
- Obtain a list of the greatest N elements in a PCollection. 
 - 
static Smallest(n, has_defaults=True, key=None)[source]¶
- Obtain a list of the least N elements in a PCollection. 
 
- 
class 
- 
class apache_beam.transforms.combiners.TopCombineFn(n, key=None, reverse=False)[source]¶
- Bases: - apache_beam.transforms.core.CombineFn- CombineFn doing the combining for all of the Top transforms. - This CombineFn uses a key or reverse operator to rank the elements. - Parameters: - key – (optional) a mapping of elements to a comparable key, similar to the key argument of Python’s sorting methods.
- reverse – (optional) whether to order things smallest to largest, rather than largest to smallest
 
- 
class apache_beam.transforms.combiners.Sample[source]¶
- Bases: - object- Combiners for sampling n elements without replacement. 
- 
class apache_beam.transforms.combiners.SampleCombineFn(n)[source]¶
- Bases: - apache_beam.transforms.core.CombineFn- CombineFn for all Sample transforms. 
- 
class apache_beam.transforms.combiners.ToList(has_defaults=True)[source]¶
- Bases: - apache_beam.transforms.combiners.CombinerWithoutDefaults- A global CombineFn that condenses a PCollection into a single list. 
- 
class apache_beam.transforms.combiners.ToListCombineFn(*unused_args, **unused_kwargs)[source]¶
- Bases: - apache_beam.transforms.core.CombineFn- CombineFn for to_list. 
- 
class apache_beam.transforms.combiners.ToDict(has_defaults=True)[source]¶
- Bases: - apache_beam.transforms.combiners.CombinerWithoutDefaults- A global CombineFn that condenses a PCollection into a single dict. - PCollections should consist of 2-tuples, notionally (key, value) pairs. If multiple values are associated with the same key, only one of the values will be present in the resulting dict. 
- 
class apache_beam.transforms.combiners.ToDictCombineFn(*unused_args, **unused_kwargs)[source]¶
- Bases: - apache_beam.transforms.core.CombineFn- CombineFn for to_dict. 
- 
class apache_beam.transforms.combiners.ToSet(has_defaults=True)[source]¶
- Bases: - apache_beam.transforms.combiners.CombinerWithoutDefaults- A global CombineFn that condenses a PCollection into a set. 
- 
class apache_beam.transforms.combiners.ToSetCombineFn(*unused_args, **unused_kwargs)[source]¶
- Bases: - apache_beam.transforms.core.CombineFn- CombineFn for ToSet. 
- 
class apache_beam.transforms.combiners.Latest[source]¶
- Bases: - object- Combiners for computing the latest element 
- 
class apache_beam.transforms.combiners.LatestCombineFn(*unused_args, **unused_kwargs)[source]¶
- Bases: - apache_beam.transforms.core.CombineFn- CombineFn to get the element with the latest timestamp from a PCollection.