apache_beam.yaml.yaml_combine module

This module defines the basic Combine operation.

apache_beam.yaml.yaml_combine.normalize_combine(spec)[source]

Expands various shorthand specs for combine (which can otherwise be quite verbose for simple cases.) We do this here so that it doesn’t need to be done per language. The following are all equivalent:

dest: fn_type

dest:
  value: dest
  fn: fn_type

dest:
  value: dest
  fn:
    type: fn_type
class apache_beam.yaml.yaml_combine.PyJsYamlCombine(group_by: Iterable[str], combine: Mapping[str, Mapping[str, Any]], language: str | None = None)[source]

Bases: PTransform

Groups and combines records sharing common fields.

Built-in combine functions are sum, max, min, all, any, mean, count, group, concat but custom aggregation functions can be used as well.

See also the documentation on [YAML Aggregation](https://beam.apache.org/documentation/sdks/yaml-combine/).

Parameters:
  • group_by – The field(s) to aggregate on.

  • combine – The aggregation function to use.

  • language – The language used to define (and execute) the custom callables in combine. Defaults to generic.

expand(pcoll)[source]
apache_beam.yaml.yaml_combine.create_combine_providers()[source]