T - the type of input and output elementIdT - the type of representative values used to deduppublic static final class Deduplicate.WithRepresentativeValues<T,IdT> extends PTransform<PCollection<T>,PCollection<T>>
PTransform that uses a SerializableFunction to obtain a representative value
for each input element used for deduplication.
Construct via Deduplicate.withRepresentativeValueFn(org.apache.beam.sdk.transforms.SerializableFunction<T, IdT>).
name, resourceHints| Modifier and Type | Method and Description |
|---|---|
PCollection<T> |
expand(PCollection<T> input)
Override this method to specify how this
PTransform should be expanded on the given
InputT. |
Deduplicate.WithRepresentativeValues<T,IdT> |
withDuration(Duration duration)
Return a
WithRepresentativeValues PTransform that is like this one, but with
the specified deduplication duration. |
Deduplicate.WithRepresentativeValues<T,IdT> |
withRepresentativeCoder(Coder<IdT> coder)
Return a
WithRepresentativeValues PTransform that is like this one, but with
the specified id type coder. |
Deduplicate.WithRepresentativeValues<T,IdT> |
withRepresentativeType(TypeDescriptor<IdT> type)
Return a
WithRepresentativeValues PTransform that is like this one, but with
the specified id type descriptor. |
Deduplicate.WithRepresentativeValues<T,IdT> |
withTimeDomain(TimeDomain timeDomain)
Returns a
WithRepresentativeValues PTransform like this one but with the
specified time domain. |
compose, compose, getAdditionalInputs, getDefaultOutputCoder, getDefaultOutputCoder, getDefaultOutputCoder, getKindString, getName, getResourceHints, populateDisplayData, setResourceHints, toString, validate, validatepublic Deduplicate.WithRepresentativeValues<T,IdT> withRepresentativeType(TypeDescriptor<IdT> type)
WithRepresentativeValues PTransform that is like this one, but with
the specified id type descriptor.
Either withRepresentativeCoder(org.apache.beam.sdk.coders.Coder<IdT>) or this method must be invoked if using Deduplicate.withRepresentativeValueFn(org.apache.beam.sdk.transforms.SerializableFunction<T, IdT>) in Java 8 with a lambda as the fn.
type - a TypeDescriptor describing the representative type of this WithRepresentativeValuesWithRepresentativeValues PTransform that is like this one, but with
the specified representative value type descriptor. Any previously set representative
value coder will be cleared.public Deduplicate.WithRepresentativeValues<T,IdT> withRepresentativeCoder(Coder<IdT> coder)
WithRepresentativeValues PTransform that is like this one, but with
the specified id type coder.
Required for use of Deduplicate.withRepresentativeValueFn(org.apache.beam.sdk.transforms.SerializableFunction<T, IdT>) in Java 8 with a lambda
as the fn.
coder - a Coder capable of encoding the representative type of this WithRepresentativeValuesWithRepresentativeValues PTransform that is like this one, but with
the specified representative value coder. Any previously set representative value type
descriptor will be cleared.public Deduplicate.WithRepresentativeValues<T,IdT> withTimeDomain(TimeDomain timeDomain)
WithRepresentativeValues PTransform like this one but with the
specified time domain.public Deduplicate.WithRepresentativeValues<T,IdT> withDuration(Duration duration)
WithRepresentativeValues PTransform that is like this one, but with
the specified deduplication duration.public PCollection<T> expand(PCollection<T> input)
PTransformPTransform should be expanded on the given
InputT.
NOTE: This method should not be called directly. Instead apply the PTransform should
be applied to the InputT using the apply method.
Composite transforms, which are defined in terms of other transforms, should return the output of one of the composed transforms. Non-composite transforms, which do not apply any transforms internally, should return a new unbound output and register evaluators (via backend-specific registration methods).
expand in class PTransform<PCollection<T>,PCollection<T>>