Class CoGroupByKey<K>
java.lang.Object
org.apache.beam.sdk.transforms.PTransform<KeyedPCollectionTuple<K>,PCollection<KV<K,CoGbkResult>>>
org.apache.beam.sdk.transforms.join.CoGroupByKey<K>
- Type Parameters:
K- the type of the keys in the input and outputPCollections
- All Implemented Interfaces:
Serializable,HasDisplayData
public class CoGroupByKey<K>
extends PTransform<KeyedPCollectionTuple<K>,PCollection<KV<K,CoGbkResult>>>
A
PTransform that performs a CoGroupByKey on a tuple of tables. A CoGroupByKey groups results from all tables by like keys into CoGbkResults, from which
the results for any specific table can be accessed by the TupleTag supplied with the initial table.
Example of performing a CoGroupByKey followed by a ParDo that consumes the
results:
PCollection<KV<K, V1>> pt1 = ...; PCollection<KV<K, V2>> pt2 = ...; final TupleTag<V1> t1 = new TupleTag<>(); final TupleTag<V2> t2 = new TupleTag<>(); PCollection<KV<K, CoGbkResult>> coGbkResultCollection = KeyedPCollectionTuple.of(t1, pt1) .and(t2, pt2) .apply(CoGroupByKey.<K>create()); PCollection<T> finalResultCollection = coGbkResultCollection.apply(ParDo.of( new DoFn<KV<K, CoGbkResult>, T>(){@ProcessElement public void processElement(ProcessContext c) { KV<K, CoGbkResult> e = c.element(); Iterable<V1> pt1Vals = e.getValue().getAll(t1); V2 pt2Val = e.getValue().getOnly(t2); ... Do Something .... c.output(...some T...); }}));
- See Also:
-
Field Summary
Fields inherited from class org.apache.beam.sdk.transforms.PTransform
annotations, displayData, name, resourceHints -
Method Summary
Modifier and TypeMethodDescriptionstatic <K> CoGroupByKey<K> create()Returns aCoGroupByKey<K>PTransform.expand(KeyedPCollectionTuple<K> input) Override this method to specify how thisPTransformshould be expanded on the givenInputT.Methods inherited from class org.apache.beam.sdk.transforms.PTransform
addAnnotation, compose, compose, getAdditionalInputs, getAnnotations, getDefaultOutputCoder, getDefaultOutputCoder, getDefaultOutputCoder, getKindString, getName, getResourceHints, populateDisplayData, setDisplayData, setResourceHints, toString, validate, validate
-
Method Details
-
create
Returns aCoGroupByKey<K>PTransform.- Type Parameters:
K- the type of the keys in the input and outputPCollections
-
expand
Description copied from class:PTransformOverride this method to specify how thisPTransformshould be expanded on the givenInputT.NOTE: This method should not be called directly. Instead apply the
PTransformshould be applied to theInputTusing theapplymethod.Composite transforms, which are defined in terms of other transforms, should return the output of one of the composed transforms. Non-composite transforms, which do not apply any transforms internally, should return a new unbound output and register evaluators (via backend-specific registration methods).
- Specified by:
expandin classPTransform<KeyedPCollectionTuple<K>,PCollection<KV<K, CoGbkResult>>>
-