apache_beam.transforms.combiners module¶
A library of basic combiner PTransform subclasses.
-
class
apache_beam.transforms.combiners.
Mean
[source]¶ Bases:
object
Combiners for computing arithmetic means of elements.
-
class
Globally
(has_defaults=True)[source]¶ Bases:
apache_beam.transforms.combiners.CombinerWithoutDefaults
combiners.Mean.Globally computes the arithmetic mean of the elements.
-
class
PerKey
(label=None)[source]¶ Bases:
apache_beam.transforms.ptransform.PTransform
combiners.Mean.PerKey finds the means of the values for each key.
-
class
-
class
apache_beam.transforms.combiners.
MeanCombineFn
(*unused_args, **unused_kwargs)[source]¶ Bases:
apache_beam.transforms.core.CombineFn
CombineFn for computing an arithmetic mean.
-
class
apache_beam.transforms.combiners.
Count
[source]¶ Bases:
object
Combiners for counting elements.
-
class
Globally
(has_defaults=True)[source]¶ Bases:
apache_beam.transforms.combiners.CombinerWithoutDefaults
combiners.Count.Globally counts the total number of elements.
-
class
PerKey
(label=None)[source]¶ Bases:
apache_beam.transforms.ptransform.PTransform
combiners.Count.PerKey counts how many elements each unique key has.
-
class
PerElement
(label=None)[source]¶ Bases:
apache_beam.transforms.ptransform.PTransform
combiners.Count.PerElement counts how many times each element occurs.
-
class
-
class
apache_beam.transforms.combiners.
CountCombineFn
(*unused_args, **unused_kwargs)[source]¶ Bases:
apache_beam.transforms.core.CombineFn
CombineFn for computing PCollection size.
-
class
apache_beam.transforms.combiners.
Top
[source]¶ Bases:
object
Combiners for obtaining extremal elements.
-
class
Of
(n, key=None, reverse=False)[source]¶ Bases:
apache_beam.transforms.combiners.CombinerWithoutDefaults
Returns the n greatest elements in the PCollection.
This transform will retrieve the n greatest elements in the PCollection to which it is applied, where “greatest” is determined by a function supplied as the key or reverse arguments.
Creates a global Top operation.
The arguments ‘key’ and ‘reverse’ may be passed as keyword arguments, and have the same meaning as for Python’s sort functions.
Parameters: - n – number of elements to extract from pcoll.
- key – (optional) a mapping of elements to a comparable key, similar to the key argument of Python’s sorting methods.
- reverse – (optional) whether to order things smallest to largest, rather than largest to smallest
-
class
PerKey
(n, key=None, reverse=False)[source]¶ Bases:
apache_beam.transforms.ptransform.PTransform
Identifies the N greatest elements associated with each key.
This transform will produce a PCollection mapping unique keys in the input PCollection to the n greatest elements with which they are associated, where “greatest” is determined by a function supplied as the key or reverse arguments.
Creates a per-key Top operation.
The arguments ‘key’ and ‘reverse’ may be passed as keyword arguments, and have the same meaning as for Python’s sort functions.
Parameters: - n – number of elements to extract from pcoll.
- key – (optional) a mapping of elements to a comparable key, similar to the key argument of Python’s sorting methods.
- reverse – (optional) whether to order things smallest to largest, rather than largest to smallest
-
static
Largest
(n, has_defaults=True, key=None)[source]¶ Obtain a list of the greatest N elements in a PCollection.
-
static
Smallest
(n, has_defaults=True, key=None)[source]¶ Obtain a list of the least N elements in a PCollection.
-
class
-
class
apache_beam.transforms.combiners.
TopCombineFn
(n, key=None, reverse=False)[source]¶ Bases:
apache_beam.transforms.core.CombineFn
CombineFn doing the combining for all of the Top transforms.
This CombineFn uses a key or reverse operator to rank the elements.
Parameters: - key – (optional) a mapping of elements to a comparable key, similar to the key argument of Python’s sorting methods.
- reverse – (optional) whether to order things smallest to largest, rather than largest to smallest
-
class
apache_beam.transforms.combiners.
Sample
[source]¶ Bases:
object
Combiners for sampling n elements without replacement.
-
class
apache_beam.transforms.combiners.
SampleCombineFn
(n)[source]¶ Bases:
apache_beam.transforms.core.CombineFn
CombineFn for all Sample transforms.
-
class
apache_beam.transforms.combiners.
ToList
(has_defaults=True)[source]¶ Bases:
apache_beam.transforms.combiners.CombinerWithoutDefaults
A global CombineFn that condenses a PCollection into a single list.
-
class
apache_beam.transforms.combiners.
ToListCombineFn
(*unused_args, **unused_kwargs)[source]¶ Bases:
apache_beam.transforms.core.CombineFn
CombineFn for to_list.
-
class
apache_beam.transforms.combiners.
ToDict
(has_defaults=True)[source]¶ Bases:
apache_beam.transforms.combiners.CombinerWithoutDefaults
A global CombineFn that condenses a PCollection into a single dict.
PCollections should consist of 2-tuples, notionally (key, value) pairs. If multiple values are associated with the same key, only one of the values will be present in the resulting dict.
-
class
apache_beam.transforms.combiners.
ToDictCombineFn
(*unused_args, **unused_kwargs)[source]¶ Bases:
apache_beam.transforms.core.CombineFn
CombineFn for to_dict.
-
class
apache_beam.transforms.combiners.
ToSet
(has_defaults=True)[source]¶ Bases:
apache_beam.transforms.combiners.CombinerWithoutDefaults
A global CombineFn that condenses a PCollection into a set.
-
class
apache_beam.transforms.combiners.
ToSetCombineFn
(*unused_args, **unused_kwargs)[source]¶ Bases:
apache_beam.transforms.core.CombineFn
CombineFn for ToSet.
-
class
apache_beam.transforms.combiners.
Latest
[source]¶ Bases:
object
Combiners for computing the latest element
-
class
apache_beam.transforms.combiners.
LatestCombineFn
(*unused_args, **unused_kwargs)[source]¶ Bases:
apache_beam.transforms.core.CombineFn
CombineFn to get the element with the latest timestamp from a PCollection.