Class Combine.Globally<InputT,OutputT>
- Type Parameters:
InputT- type of input valuesOutputT- type of output values
- All Implemented Interfaces:
Serializable,HasDisplayData
- Enclosing class:
Combine
Combine.Globally<InputT, OutputT> takes a PCollection<InputT> and returns a
PCollection<OutputT> whose elements are the result of combining all the elements in
each window of the input PCollection, using a specified CombineFn<InputT, AccumT, OutputT>. It is common for InputT == OutputT, but not
required. Common combining functions include sums, mins, maxes, and averages of numbers,
conjunctions and disjunctions of booleans, statistical aggregations, etc.
Example of use:
PCollection<Integer> pc = ...;
PCollection<Integer> sum = pc.apply(
Combine.globally(new Sum.SumIntegerFn()));
Combining can happen in parallel, with different subsets of the input PCollection
being combined separately, and their intermediate results combined further, in an arbitrary
tree reduction pattern, until a single result value is produced.
If the input PCollection is windowed into GlobalWindows, a default value in
the GlobalWindow will be output if the input PCollection is empty. To use this
with inputs with other windowing, either withoutDefaults() or asSingletonView()
must be called, as the default value cannot be automatically assigned to any single window.
By default, the Coder of the output PValue<OutputT> is inferred from the
concrete type of the CombineFn<InputT, AccumT, OutputT>'s output type OutputT.
See also Combine.perKey(org.apache.beam.sdk.transforms.SerializableFunction<java.lang.Iterable<V>, V>)/Combine.PerKey and Combine.groupedValues(org.apache.beam.sdk.transforms.SerializableFunction<java.lang.Iterable<V>, V>)/Combine.GroupedValues, which are useful for combining values associated with
each key in a PCollection of KVs.
- See Also:
-
Field Summary
Fields inherited from class org.apache.beam.sdk.transforms.PTransform
annotations, displayData, name, resourceHints -
Method Summary
Modifier and TypeMethodDescriptionReturns aPTransformthat produces aPCollectionViewwhose elements are the result of combining elements per-window in the inputPCollection.expand(PCollection<InputT> input) Override this method to specify how thisPTransformshould be expanded on the givenInputT.Returns the side inputs of thisCombine, tagged with the tag of thePCollectionView.CombineFnBase.GlobalCombineFn<? super InputT, ?, OutputT> getFn()Returns theCombineFnBase.GlobalCombineFnused by this Combine operation.protected StringReturns the name to use by default for thisPTransform(not including the names of any enclosingPTransforms).List<PCollectionView<?>> Returns the side inputs used by this Combine operation.booleanReturns whether or not this transformation applies a default value.voidpopulateDisplayData(DisplayData.Builder builder) Register display data for the given transform or component.withFanout(int fanout) Returns aPTransformidentical to this, but that uses an intermediate node to combine parts of the data to reduce load on the final global combine step.Returns aPTransformidentical to this, but that does not attempt to provide a default value in the case of empty input.withSideInputs(Iterable<? extends PCollectionView<?>> sideInputs) Returns aPTransformidentical to this, but with the specified side inputs to use inCombineWithContext.CombineFnWithContext.withSideInputs(PCollectionView<?>... sideInputs) Returns aPTransformidentical to this, but with the specified side inputs to use inCombineWithContext.CombineFnWithContext.Methods inherited from class org.apache.beam.sdk.transforms.PTransform
addAnnotation, compose, compose, getAnnotations, getDefaultOutputCoder, getDefaultOutputCoder, getDefaultOutputCoder, getName, getResourceHints, setDisplayData, setResourceHints, toString, validate, validate
-
Method Details
-
getKindString
Description copied from class:PTransformReturns the name to use by default for thisPTransform(not including the names of any enclosingPTransforms).By default, returns the base name of this
PTransform's class.The caller is responsible for ensuring that names of applied
PTransforms are unique, e.g., by adding a uniquifying suffix when needed.- Overrides:
getKindStringin classPTransform<PCollection<InputT>,PCollection<OutputT>>
-
asSingletonView
Returns aPTransformthat produces aPCollectionViewwhose elements are the result of combining elements per-window in the inputPCollection. If a value is requested from the view for a window that is not present, the result of applying theCombineFnto an empty input set will be returned. -
withoutDefaults
Returns aPTransformidentical to this, but that does not attempt to provide a default value in the case of empty input. Required when the input is not globally windowed and the output is not being used as a side input. -
withFanout
Returns aPTransformidentical to this, but that uses an intermediate node to combine parts of the data to reduce load on the final global combine step.The
fanoutparameter determines the number of intermediate keys that will be used. -
withSideInputs
Returns aPTransformidentical to this, but with the specified side inputs to use inCombineWithContext.CombineFnWithContext. -
withSideInputs
public Combine.Globally<InputT,OutputT> withSideInputs(Iterable<? extends PCollectionView<?>> sideInputs) Returns aPTransformidentical to this, but with the specified side inputs to use inCombineWithContext.CombineFnWithContext. -
getFn
Returns theCombineFnBase.GlobalCombineFnused by this Combine operation. -
getSideInputs
Returns the side inputs used by this Combine operation. -
getAdditionalInputs
Returns the side inputs of thisCombine, tagged with the tag of thePCollectionView. The values of the returned map will be equal to the result ofgetSideInputs().- Overrides:
getAdditionalInputsin classPTransform<PCollection<InputT>,PCollection<OutputT>>
-
isInsertDefault
public boolean isInsertDefault()Returns whether or not this transformation applies a default value. -
expand
Description copied from class:PTransformOverride this method to specify how thisPTransformshould be expanded on the givenInputT.NOTE: This method should not be called directly. Instead apply the
PTransformshould be applied to theInputTusing theapplymethod.Composite transforms, which are defined in terms of other transforms, should return the output of one of the composed transforms. Non-composite transforms, which do not apply any transforms internally, should return a new unbound output and register evaluators (via backend-specific registration methods).
- Specified by:
expandin classPTransform<PCollection<InputT>,PCollection<OutputT>>
-
populateDisplayData
Description copied from class:PTransformRegister display data for the given transform or component.populateDisplayData(DisplayData.Builder)is invoked by Pipeline runners to collect display data viaDisplayData.from(HasDisplayData). Implementations may callsuper.populateDisplayData(builder)in order to register display data in the current namespace, but should otherwise usesubcomponent.populateDisplayData(builder)to use the namespace of the subcomponent.By default, does not register any display data. Implementors may override this method to provide their own display data.
- Specified by:
populateDisplayDatain interfaceHasDisplayData- Overrides:
populateDisplayDatain classPTransform<PCollection<InputT>,PCollection<OutputT>> - Parameters:
builder- The builder to populate with display data.- See Also:
-